Senior Site Reliability Engineer

August 25

🐻 Alaska – Remote

info

🌺 Hawaii – Remote

info

+7 more states

info

💵 $167.2k - $216k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

Apply Now
Logo of Virta Health

Virta Health

Healthcare Insurance • Wellness • Health Tech

Virta Health is a healthcare company focused on reversing type 2 diabetes and promoting sustainable weight loss through a nutrition-first approach. The company offers personalized treatment plans that help individuals reduce or eliminate the need for diabetes medications. Virta collaborates with organizations and healthcare providers to deliver transformative outcomes in metabolic care. Their approach is evidence-backed, emphasizing the importance of lifestyle and dietary changes to achieve lasting health improvements and weight management.

201 - 500 employees

⚕️ Healthcare Insurance

🧘 Wellness

📋 Description

• Virta Health is on a mission to transform diabetes care and reverse the type 2 diabetes epidemic. Current treatment approaches aren’t working—over half of US adults have either type 2 diabetes or prediabetes. • As an SRE on the Infrastructure team at Virta, you will be building the foundation that will help our company move as fast as possible while meeting security and compliance requirements. • Key projects for the team over the next two quarters include: Implement an AI‑driven observability and metrics platform that automatically detects anomalies and highlights SLO risks, enabling product teams to make data‑driven decisions. • Enhancing system observability, reliability, and efficiency using off-the-shelf technology combined with internal tools developed in Python and Go to increase transparency and visibility into our systems as well as centralizing data. • Building out more products for our Product Development teams like observability (SLOs, alerting, dashboards) modules to allow them to spin up an MVP out of the box. • Improving incident readiness with better tooling and the right hygiene practices such as game days. • Engage with feature development teams in toil reduction exercises, capacity planning, load testing, SLO process, and other best practices — partnering with product teams to replace manual capacity planning with predictive/AI-driven scaling models and to codify self-healing runbooks that minimize toil • Improving the velocity and quality of our developer platform and tooling • General AI fluency desired: comfortable with concepts like prompt engineering, operational chatbots, and AI-assisted workflows to accelerate incident response and reliability improvements • We are in the midst of re-defining our incident response tooling/strategy, improving test tooling, and developing a strategy to ensure all applications are performant and available. Joining Virta would make you one of the key people defining and driving the future vision of what reliability and observability should look like.

🎯 Requirements

• Highly proficient in shipping backend code in high-quality production environments, with strong hands-on coding and automation expertise, and a deep understanding of reliability and production readiness practices • Hands-on expertise with automation and infrastructure-as-code (Terraform modules preferred), ideally with experience in observability • Experience designing and implementing highly observable, scalable systems — with a proven track record configuring AIOps / ML-based monitoring platforms — that support large numbers of users while reducing operational burden • Applied and general AI fluency: ability to leverage AI/ML-assisted observability (e.g., anomaly detection, error-budget burn prediction) while also being comfortable with concepts like prompt engineering, operational chatbots, and AI-assisted workflows to accelerate incident response and reliability improvements • Growth mindset and craftsmanship: ability to coach, mentor, and evangelize AI-first insights while continually improving engineering practices and following best practices

🏖️ Benefits

• Information about Virta’s benefits is on our Careers page at: https://www.virtahealth.com/careers

Apply Now

Similar Jobs

August 22

NVIDIA

10,000+ employees

🤖 Artificial Intelligence

🎮 Gaming

Designs, builds and maintains large-scale Observability and Telemetry platforms at NVIDIA. Drives reliability, automation and incident response.

🇺🇸 United States – Remote

💵 $168k - $333.5k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

August 20

Gov Services Hub

51 - 200

🏛️ Government

🔒 Cybersecurity

🎯 Recruiter

Salesforce DevOps Architect providing leadership for multiple Salesforce teams. Managing CI/CD pipelines and enforcing development standards in a remote role.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

August 20

TensorWave

11 - 50

🤖 Artificial Intelligence

🏢 Enterprise

☁️ SaaS

Senior SRE building scalable, secure infra for AI compute at TensorWave. Designs low-level systems and automates infrastructure.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

August 20

Atolio

11 - 50

🤖 Artificial Intelligence

🏢 Enterprise

☁️ SaaS

Deployment Engineer at Atolio: ensure secure, scalable deployments of enterprise search across environments; build automation and collaborate with success teams.

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

August 19

Syniti

1001 - 5000

🤝 B2B

🏢 Enterprise

Senior DevOps Engineer at Syniti builds CI/CD pipelines and cloud automation; mentors teams and optimizes DevOps practices for scalable data platform.

🇺🇸 United States – Remote

💰 Private Equity Round on 2017-08

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com