Site Reliability Engineer, L5 – CORE

November 7

Apply Now
Logo of Netflix

Netflix

B2C • eCommerce • Media

Netflix is a global streaming service that offers a wide variety of award-winning television shows, movies, anime, documentaries, and more on thousands of internet-connected devices. It allows users to watch instantly and provides original content produced by Netflix itself, catering to diverse tastes. As a pioneer in original programming since its first series in 2013, Netflix aims to entertain audiences worldwide through immersive storytelling and innovative technology.

- employees

Founded 1997

👥 B2C

🛍️ eCommerce

📱 Media

💰 $20M Post-IPO Equity on 2022-01

📋 Description

• Design, implement, and maintain scalable and reliable infrastructure to support Netflix Streaming Suite • Collaborate with engineering and product teams to integrate observability, reliability, and security considerations into the entire software development lifecycle • Develop and implement automation tools for monitoring, deployment, and incident response • Participate in on-call rotations to ensure the 24/7 health of the Netflix Streaming and contribute to incident response, diagnosis, and resolution • Implement and maintain a robust incident response framework • Proactively identify sources of instability in distributed systems • Champion and embed a culture of reliability across the Ads organization

🎯 Requirements

• 5+ years of experience as a Site Reliability Engineer (SRE), Production Engineer, or similar role supporting business-critical, high-traffic services • Write code to solve problems • Proficient in one or more languages like Python, Go, or Java • Hands-on experience with cloud providers such as AWS/Azure/GCP • Infrastructure as Code such as Terraform • Container orchestration systems like Kubernetes • Understand large-scale distributed systems, their common failure modes and edge cases • Experience with incident management and response • Must be fluent in English

🏖️ Benefits

• Health Plans • Mental Health support • 401(k) Retirement Plan with employer match • Stock Option Program • Disability Programs • Health Savings and Flexible Spending Accounts • Family-forming benefits • Life and Serious Injury Benefits • Paid leave of absence programs • Full-time hourly employees accrue 35 days annually for paid time off • Full-time salaried employees are immediately entitled to flexible time off

Apply Now

Similar Jobs

November 7

RT²

51 - 200

Site Reliability Engineer ensuring system reliability for Realtime's innovative retail tech solutions. Collaborating to enhance performance and operational efficiency in cloud-based infrastructures.

Ansible

Azure

Cloud

Grafana

Python

Terraform

VMware

November 6

MAK-SYSTEM

201 - 500

Platform Engineer positioned within 2 Tech at MAK-SYSTEM, enhancing deployment and infrastructure management on AWS. Collaborating with teams on innovative solutions and system improvements.

Ansible

AWS

Chef

Docker

Java

Jenkins

Kubernetes

Linux

MySQL

Oracle

Postgres

Puppet

Subversion

Terraform

Unix

November 6

Lead Site Reliability Engineer focused on developing automation for Akamai's compute infrastructure. Mentoring SRE team and collaborating with project teams to enhance operational excellence.

November 6

Senior Microsoft 365 Deployment Engineer overseeing Microsoft 365 solutions for Red River customers. Collaborating with clients to design and deploy tailored solutions while resolving technical challenges.

November 6

Guidehouse

10,000+ employees

DevOps Engineer supporting development, QA, and operations across multiple applications. Enabling fast, secure software delivery through automation and cloud-native infrastructure.

Ansible

Apache

Cloud

DNS

Firewalls

Grafana

Java

Jenkins

Kubernetes

Linux

NGINX

OpenShift

Postgres

Prometheus

Splunk

Terraform

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com