Lead Site Reliability Developer

🕒 il y a 1 mois

🗣️🇺🇸🇬🇧 Anglais requis

Postuler Maintenant
Trouver des Emplois à Distance Similaires

📊 Vérifiez votre score de CV pour ce poste

Améliorez vos chances d'obtenir un entretien en vérifiant votre score de CV avant de postuler.

Logo of Live Nation Entertainment

Live Nation Entertainment

10 000+ employés

Fondée en 1996

📱 Médias

💰 Post-IPO Debt en 2023-01

Media • Entertainment

Live Nation Entertainment est le leader mondial du divertissement en direct, offrant des expériences inoubliables à travers le monde. Axé sur les artistes et guidé par les fans, Live Nation collabore avec les musiciens pour donner vie à leur créativité sur des scènes internationales. En tant que principal producteur de concerts, vendeur de billets et connecteur de marques à la musique, la plateforme de Live Nation domine le marché dans ces trois industries clés. Leur mission s'étend au-delà du divertissement, visant à élever, inspirer et créer des souvenirs grâce à la puissance de la musique live.

Description

• Lead consulting work from discovery through delivery by aligning stakeholders on priorities, sequencing work, and communicating measurable outcomes. • Establish working cadence and facilitate decision forums to surface risks, map dependencies, and drive clear ownership and timelines. • Align product, platform, and engineering stakeholders on reliability targets and trade-offs using SLOs and error budgets. • Partner regularly with Engineering Managers, product managers, Staff and Principal engineers, and platform leads to keep dependencies, decisions, and delivery aligned. • Identify systemic risks across shared dependencies and coordinate remediation across multiple teams to reduce recurring incidents. • Drive change adoption by embedding reliability mechanisms into partner team routines such as planning, PRRs, and on-call practices. • Design and implement reusable reliability mechanisms, templates, and tooling that can be adopted across teams. • Establish and evolve production readiness review practices with partner teams to improve launch quality and change safety. • Drive observability strategy for partner domains by improving signal quality, alerting philosophy, and operational dashboards. • Lead complex incident investigations and ensure learnings translate into durable fixes with clear owners and verification. • Lead reliability-focused design and code reviews and guide teams toward simpler, safer architectures. • Mentor Senior engineers and other consultants through pairing, reviews, and structured coaching to multiply impact. • Partner with internal platform engineering to influence roadmaps and deliver shared capabilities that accelerate SRE adoption. • Improve CSRE Consulting playbooks and operating practices based on repeated patterns observed across teams.

🎯 Exigences

• Deep practical understanding of SRE principles, including SLO governance and error budget policy in practice. • Proven ability to lead cross-team technical work and influence without authority. • Strong experience designing and troubleshooting distributed systems with cross-service failure modes. • Experience shaping observability and alerting strategy and improving operational signal quality. • Strong Kubernetes and AWS experience, including governance and cost trade-offs. • Ability to design reliability automation and tooling that is reusable and adopted by multiple teams. • Experience leading production readiness and resilience practices, including DR validation and controlled testing. • Strong software engineering fundamentals with the ability to deliver and review high-quality changes in enterprise codebases. • Advanced incident analysis skills focused on systemic risk reduction and organizational learning. • Excellent communication skills, including exec-ready summaries and clear technical diagrams.

🏖️ Avantages

• Health: Medical, vision, dental and mental health benefits for you and your family, with access to a health care concierge, and Flexible or Health Savings Accounts (FSA or HSA) • Yourself: Free concert tickets, generous paid time off including paid holidays, sick time, and personal days • Wealth: 401(k) program with company match, stock reimbursement program • Family: New parent programs including caregiver leave, plus fertility, adoption, foster, or surrogacy support • Career: Career and skill development programs with School of Live, tuition reimbursement, and student loan repayment • Others: Volunteer time off, crowdfunding match

Postuler Maintenant

Emplois Similaires

🕒 il y a 1 mois

Meduit | Driving Revenue Cycle Performance

1001 - 5000

⚕️ Assurance santé

🤖 Intelligence artificielle

☁️ SaaS

DevOps Software Configuration Engineer building and maintaining CI/CD pipelines for Java-based applications at Meduit. Collaborating with Engineering, QA, and Application Support teams to ensure reliable software delivery.

🇺🇸 États-Unis – Télétravail

💵 $130 000 - $145 000 / an

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 1 mois

Meduit | Driving Revenue Cycle Performance

1001 - 5000

🤝 B2B

🤖 Intelligence artificielle

☁️ SaaS

DevOps Engineer responsible for automated build and deployment in healthcare revenue cycle management. Collaborating with cross-functional teams to support modern deployment practices across AWS and Azure.

🇺🇸 États-Unis – Télétravail

💵 $130 000 - $145 000 / an

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 1 mois

BakerHostetler

1001 - 5000

📋 Conformité

🏢 Entreprise

🤝 B2B

Database Reliability Engineer at BakerHostetler to enhance firm’s data ecosystem across hybrid environments. Ensuring availability, performance, security, and disaster recovery readiness for critical database systems.

🇺🇸 États-Unis – Télétravail

💵 $120 000 - $140 000 / an

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 1 mois

Juul Labs

1001 - 5000

👥 B2C

🛒 Commerce de détail

🧘 Bien-être

Senior Site Reliability Engineer managing operational stability and performance of Juul's hybrid cloud infrastructure. Leading automation efforts and architecting for reliability in critical incidents.

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 1 mois

Prompt Therapy Solutions Inc

11 - 50

⚕️ Assurance santé

⚡ Productivité

☁️ SaaS

Senior DevOps Engineer managing infrastructure and deployment processes for healthcare tech company Prompt Therapy. Leading a team and ensuring scalability, security, and reliability in cloud environments.

🇺🇸 États-Unis – Télétravail

💵 $230 000 - $250 000 / an

⏰ Temps Plein

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis