Site Reliability Engineer

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Ad Hoc LLC

Ad Hoc LLC

501 - 1000 employees

Founded 2014

🏛️ Government

🤖 Artificial Intelligence

🔌 API

Government • Artificial Intelligence • API

Ad Hoc LLC is a technology consulting firm that modernizes digital government services through human-centered design, platform and mobile development, and systems transformation. The company specializes in data interoperability, API infrastructure, DevOps and cybersecurity practices, and AI-driven/agentic automation to help federal, state, and local agencies scale services, reduce costs, and improve user experiences. Notable work includes large-scale Medicare enrollment processing, the VA Health and Benefits mobile app, and enabling access to billions of health claims via modern APIs.

📋 Description

• Help ensure the availability, performance, and reliability of a large federal enterprise cloud platform • Monitor platform health and support service level objectives (SLOs), service level indicators, and error budgets • Build and maintain observability tooling, including metrics, logging, alerting, and dashboards • Participate in on-call rotations and incident response, helping restore service and reduce time to recovery • Contribute to blameless postmortems and drive follow-up actions • Automate repetitive operational tasks to reduce toil • Support capacity planning and performance tuning across cloud infrastructure (AWS) and Kubernetes (Amazon EKS) • Implement reliability improvements as infrastructure as code (Terraform) • Work with government partners and application teams to meet security, SLA, and performance requirements • Support recruiting efforts by evaluating exercises and assisting with interviews

🎯 Requirements

• Bachelor's and 5+ years of experience; relevant experience may be substituted for education • Experience with monitoring and observability tooling and on-call operations • Proficient with at least one infrastructure-as-code tool (Terraform preferred) • Background in key DevOps concepts: containerization, networking, and cloud infrastructure • Must be able to obtain and maintain a U.S. Public Trust / suitability determination

🏖️ Benefits

• Company-subsidized health, dental, and vision insurance • Flexible PTO • 401K with employer match • Paid parental leave after one year of service • Employee Assistance Program

Apply Now

Similar Jobs

🔥 7 hours ago

Site Reliability Engineer assuring enterprise cloud platform performance across Azure and AWS at Core Specialty. Implementing reliability practices, observability frameworks, and automation for business-critical applications.

🔥 7 hours ago

CVS Health

10,000+ employees

⚕️ Healthcare Insurance

🛒 Retail

🧘 Wellness

Data DevOps Platform Engineer supporting data platforms' stability and reliability at CVS Health. Collaborating with cross-functional teams on Dev, QA, and Production environments in both on-prem and cloud settings.

🔥 7 hours ago

Manulife

10,000+ employees

💸 Finance

⚕️ Healthcare Insurance

Lead Power Platform Reliability Engineer enhancing enterprise-level applications for Manulife. Engage with stakeholders and mentor teammates on Power Platform solutions.

🔥 9 hours ago

Casa Inc.

11 - 50

💸 Finance

₿ Crypto

🔐 Security

DevSecOps Engineer securing customer data and overseeing security practices for Casa's Bitcoin solutions. Collaborating with the Security team to develop effective security programs and processes.

🔥 9 hours ago

Granicus

501 - 1000

🏛️ Government

☁️ SaaS

📋 Compliance

DevOps Engineer I role focused on cloud platform reliability and efficiency, supporting software delivery and infrastructure automation. Collaborating with teams to resolve technical issues and improve systems.