Lead Site Reliability Engineer

🕒 April 9

🇺🇸 United States – Remote

💵 $136k - $177k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

info
Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Alteryx

Alteryx

1001 - 5000 employees

🤖 Artificial Intelligence

🤝 B2B

Analytics • Artificial Intelligence • B2B

Alteryx is a leading provider of enterprise analytics solutions that empower organizations to unlock valuable insights from their data. With its AI-driven analytics platform, Alteryx automates the processes of data preparation, analysis, and reporting, making analytics accessible to users at all skill levels. The platform offers features such as data enrichment, predictive modeling, and location intelligence, enabling businesses to improve operational efficiency and drive informed decision-making across various sectors, including financial services, retail, healthcare, and manufacturing.

📋 Description

• Define and drive reliability strategy across control-plane and data-plane systems, including multi-region resilience, BCDR, and failover design • Establish and operationalize SLOs, SLAs, and error budgets, ensuring they inform planning and engineering tradeoffs • Lead initiatives that measurably improve MTTR, incident prevention, and overall service health • Own incident management end-to-end, driving systemic fixes and long-term reliability improvements beyond immediate response • Lead architecture and design reviews to ensure systems meet scalability, reliability, and cost efficiency goals • Champion automation and modernization, including AI-driven reliability improvements • Establish and enforce code quality and review standards • Lead cross-functional initiatives and align engineering with product priorities • Mentor senior engineers and act as a technical leader across teams

🎯 Requirements

• 6+ years leading delivery of complex, distributed systems or SaaS platforms • Strong experience with multi-region, split-plane architectures (control-plane / data-plane) • Proven track record improving SLOs, MTTR, and system reliability at scale • Proficiency in languages like Python, Java, C++, or JavaScript • Deep experience with: • Kubernetes (multi-cluster), CI/CD, and GitOps (ArgoCD) • SLO/SLA design, observability, and incident management • Infrastructure as Code and cloud platforms • Disaster recovery, resilience, and security best practices • Strong leadership skills with experience mentoring senior engineers and influencing cross-team decisions • Nice to Have • Experience with chaos engineering and large-scale reliability automation • Background in enterprise SaaS platforms or split-plane architectures • Expertise in navigating, understanding and leveraging modern Observability platforms (Datadog, Grafana, etc)

🏖️ Benefits

• bonus or commission • medical • retirement • financial • wellness • time off • employee discounts

Apply Now

Similar Jobs

🕒 April 8

Toast

1001 - 5000

☁️ SaaS

🤝 B2B

Staff Software Engineer, Tech Lead focused on mobile DevOps at Toast, specializing in Android development and CI/CD processes for restaurant technology.

🕒 April 8

EITACIES Inc.

51 - 200

🏢 Enterprise

🔒 Cybersecurity

🤖 Artificial Intelligence

DevOps Architect leading platform engineering standards across a multi-cloud, hybrid environment at Eitacies Inc. Focus on automation, infrastructure, and cloud architecture.

🇺🇸 United States – Remote

💵 $60 / hour

⏰ Full Time

🟠 Senior

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

🕒 April 8

Flywire

1001 - 5000

💸 Finance

💳 Fintech

Manager II, Site Reliability Engineering at Flywire driving reliability and performance in our cloud infrastructure. Lead SRE teams, collaborate across functions, and ensure production excellence.

🇺🇸 United States – Remote

💵 $160k - $200k / year

💰 $60M Series F on 2021-03

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

info

🕒 April 8

ARUP Laboratories

1001 - 5000

🧬 Biotechnology

🤝 B2B

📚 Education

DevOps Engineer III delivering innovative solutions at ARUP Laboratories. Engineering scalable platforms and automating production releases while collaborating with development teams.

🕒 April 8

ICF

5001 - 10000

☁️ SaaS

⚡ Energy

Senior DevOps Engineer collaborating with teams to build Azure solutions utilizing Terraform and Kubernetes. Driving automation and CI/CD development across various projects.

🇺🇸 United States – Remote

💵 $108.5k - $184.4k / year

💰 $30M Grant on 2021-03

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

info