Search Remote Jobs

Site Reliability Engineer

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Core Specialty Insurance Holdings, Inc.

Core Specialty Insurance Holdings, Inc.

501 - 1000 employees

Insurance

Core Specialty Insurance Holdings, Inc. is a company that, through its subsidiary insurers, offers a diversified range of specialty insurance products tailored for small to mid-sized companies. The company specializes in niche markets and provides solutions such as property and short-tail agriculture insurance, equine, marine and energy, specialty casualty, and healthcare professional liability insurance, among others. Core Specialty focuses on local distribution and superior underwriting knowledge, executing its business model through its multiple divisions with a high degree of autonomy. They have the capital, the underwriting talent, and a proven track record to meet the toughest insurance needs of their clients quickly and effectively.

📋 Description

• Ensure the availability, scalability, performance, and resiliency of enterprise cloud platforms across Azure and AWS environments • Combine software engineering, automation, and infrastructure expertise to operationalize reliability engineering practices • Drive cloud-native resiliency patterns and enable business-critical applications to meet defined SLAs, SLOs, and compliance requirements • Partner with engineering, security, and operations teams to implement observability and incident response frameworks • Design and implement highly available, fault-tolerant architectures using cloud-native services (microservices, containers, serverless) • Define and operationalize SLOs, SLIs, and error budgets for critical applications and platforms • Build and maintain Infrastructure as Code (IaC) (Terraform) • Develop automated remediation and self-healing capabilities to reduce MTTR and improve system resilience • Establish enterprise-level monitoring, logging, and observability frameworks (Datadog, Azure Monitor) • Drive cost optimization (FinOps) initiatives • Support DR/BCP strategy execution, including failover testing and regional isolation validation • Collaborate with application teams to embed reliability engineering practices into CI/CD pipelines

🎯 Requirements

• 5+ years experience in Site Reliability Engineering, DevOps, or Cloud Engineering • Strong expertise in cloud platforms (Azure, AWS) • Deep understanding of cloud-native architecture patterns (microservices, containers, serverless) • Proficiency in Infrastructure as Code (Terraform, ARM/Bicep) • Experience with observability platforms (Datadog, Azure Monitor) • Knowledge of CI/CD pipelines and GitOps practices • Expertise in system reliability concepts: SLI / SLO / SLA management • Familiarity with security, compliance, and regulatory controls (SOC, ISO) • Proven experience supporting mission-critical production systems at scale • Hands-on experience with incident management and on-call operations • Experience implementing automated monitoring, alerting, and remediation frameworks • Exposure to regulated environments (insurance, financial services) preferred • Demonstrated ability to work across cross-functional architecture, engineering, and operations teams

🏖️ Benefits

• medical, dental, vision, and life insurances • short and long-term disability • Company-match of 100% of a 6% contribution 401(k) plan • Employee Assistance Plan • Health Savings Account • Flexible Spending Account • Health Reimbursement Account • wellness program

Apply Now

Similar Jobs

🔥 5 minutes ago

CVS Health

10,000+ employees

⚕️ Healthcare Insurance

🛒 Retail

🧘 Wellness

Data DevOps Platform Engineer supporting data platforms' stability and reliability at CVS Health. Collaborating with cross-functional teams on Dev, QA, and Production environments in both on-prem and cloud settings.

🔥 13 minutes ago

Manulife

10,000+ employees

💸 Finance

⚕️ Healthcare Insurance

Lead Power Platform Reliability Engineer enhancing enterprise-level applications for Manulife. Engage with stakeholders and mentor teammates on Power Platform solutions.

🔥 1 hour ago

Casa Inc.

11 - 50

💸 Finance

₿ Crypto

🔐 Security

DevSecOps Engineer securing customer data and overseeing security practices for Casa's Bitcoin solutions. Collaborating with the Security team to develop effective security programs and processes.

🔥 1 hour ago

Granicus

501 - 1000

🏛️ Government

☁️ SaaS

📋 Compliance

DevOps Engineer I role focused on cloud platform reliability and efficiency, supporting software delivery and infrastructure automation. Collaborating with teams to resolve technical issues and improve systems.

🔥 1 hour ago

66degrees

501 - 1000

🤖 Artificial Intelligence

Workspace Engineer providing technical support and managing Google Workspace deployments. Empowering organizations through successful migrations and technical solutions in cloud environments.