Lead Site Reliability Developer

🕒 May 1

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Ticketmaster

Ticketmaster

10,000+ employees

Founded 1976

🛍️ eCommerce

⚽ Sports

eCommerce • Entertainment • Sports

Ticketmaster is a leading ticketing platform that facilitates the sale of tickets for concerts, sports events, theater performances, and other live entertainment. The platform offers a user-friendly experience for purchasing tickets, as well as managing events and finding popular shows and games. Ticketmaster serves as the official ticket marketplace for many major sports leagues and artist events, making it a key player in the live entertainment industry.

📋 Description

• Lead consulting work from discovery through delivery • Establish working cadence and facilitate decision forums • Align stakeholders on reliability targets and trade-offs • Identify systemic risks and coordinate remediation • Drive change adoption by embedding reliability mechanisms • Design and implement reusable reliability mechanisms • Lead complex incident investigations and ensure learnings translate into durable fixes

🎯 Requirements

• Deep practical understanding of SRE principles • Proven ability to lead cross-team technical work • Strong experience designing and troubleshooting distributed systems • Strong Kubernetes and AWS experience • Ability to design reliability automation and tooling • Excellent communication skills

🏖️ Benefits

• Inclusive work environment • Professional development opportunities • Opportunities to work with talented people

Apply Now

Similar Jobs

🕒 May 1

Live Nation Entertainment

10,000+ employees

📱 Media

Lead Site Reliability Engineer leading consulting work at Ticketmaster for reliability improvements across multiple teams. Aligning stakeholders and driving adoption of SRE principles.

AWS

Distributed Systems

Kubernetes

🕒 April 30

Civica US

51 - 200

🏛️ Government

☁️ SaaS

📚 Education

Senior Site Reliability Engineer ensuring the reliability, performance and security of Civica’s cloud platform. Collaborating with teams to drive automation and best practices in cloud environments.

Ansible

AWS

Azure

Cloud

Google Cloud Platform

Grafana

Java

Kubernetes

OpenShift

Packer

Prometheus

Python

Terraform

VMware

Go

.NET

🕒 April 25

Atos

10,000+ employees

🔒 Cybersecurity

DevOps Engineer supporting cloud transformation and application portfolios for clients. Collaborating with stakeholders and developers to improve technology and infrastructure in a remote-first environment.

AWS

Azure

Cloud

Cyber Security

Docker

Kubernetes

🕒 April 24

GitLab

1001 - 5000

🤖 Artificial Intelligence

🏢 Enterprise

☁️ SaaS

Cloud Cost Utilization SRE responsible for making cloud spending actionable. Collaborating with Finance and Engineering at GitLab to optimize resource usage.

Ansible

AWS

Cloud

Google Cloud Platform

Grafana

Prometheus

Terraform

🕒 April 24

Lyrebird Health

11 - 50

⚕️ Healthcare Insurance

☁️ SaaS

🤖 Artificial Intelligence

Senior SRE at Lyrebird tasked with managing the reliability and scalability of production systems. Build infrastructure and deployment patterns to support AI-powered healthcare tools.

AWS

Cloud

Distributed Systems

Docker

EC2

Kubernetes