Senior Site Reliability Engineer

Job not on LinkedIn

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Datavail

Datavail

1001 - 5000 employees

Founded 2007

Cloud • Data Management • Analytics

Datavail is a comprehensive IT solutions provider specializing in analytics, application development, database administration, and cloud services. Their offerings span advanced analytics consulting, cloud migration and management, and application modernization, ensuring clients can leverage their data for competitive advantage. Datavail focuses on delivering tailored solutions to enhance data management and operational efficiency across various sectors.

📋 Description

• Define and maintain SLIs/SLOs, monitor alignment and error budget usage • Lead incident response and postmortems, implement corrective measures • Automate operations tasks via tooling (e.g. auto-remediation, scaling rules) • Build, improve, and maintain CI/CD pipelines, canary deployments, blue/green strategies • Lead technical discussions with customers to align on reliability, scalability, and performance requirements • Drive continuous platform improvements across the service lifecycle, including architecture, monitoring, and operational processes • Implement and extend observability systems (metrics, tracing, log aggregation) • Optimize performance and cost by tuning cloud services, autoscaling, resource rightsizing • Design, deploy, and operate containerized workloads using Docker and Kubernetes in production environments • Collaborate with dev teams to integrate resilience patterns (circuit breakers, bulkheading) • Participate in architecture discussions around high availability, disaster recovery • Mentor mid and junior SREs; conduct reliability design reviews

🎯 Requirements

• 5–8 years of experience in a reliability or operations role • Cloud-agnostic certification: Terraform Associate, Certified Kubernetes Administrator (CKA), or SRE Foundation • Cloud provider certification: Professional-level certification in AWS (Solutions Architect), Azure (Solutions Architect Expert), GCP (Professional Cloud Architect), or Oracle Cloud (Architect Professional) • Solid coding skills (Python, Go, or equivalent) • Experience with IaC, CI/CD pipelines, and monitoring/observability stacks (Prometheus, Grafana, OpenTelemetry, ELK) • Comfortable with observability stacks (Prometheus, Grafana, OpenTelemetry, ELK, Jaeger) • Experience working in distributed systems and production scale services

Apply Now

Similar Jobs

🔥 13 hours ago

Applaudo

501 - 1000

☁️ SaaS

🤖 Artificial Intelligence

🔒 Cybersecurity

Senior Azure DevOps Engineer managing enterprise-scale Azure environments. Designing and implementing cloud platforms and automation solutions for operational excellence.

Azure

Cloud

Python

Terraform

🕒 2 days ago

Jalasoft

1001 - 5000

☁️ SaaS

📚 Education

Mid Senior DevOps Engineer at Jalasoft designing and implementing cloud infrastructure using GCP, Terraform, and GitLab. Collaborating with teams to enhance DevOps best practices and workflows.

Cloud

Docker

Google Cloud Platform

Kubernetes

Python

Terraform

🕒 3 days ago

Devsu

51 - 200

🤝 B2B

🏢 Enterprise

☁️ SaaS

Senior DevOps Engineer joining a major bank in Latin America's tech team. Focusing on automation, CI/CD, and secure cloud infrastructure.

🗣️🇪🇸 Spanish Required

Ansible

AWS

Azure

Cloud

Docker

Jenkins

Kubernetes

Linux

OpenShift

Terraform

🕒 June 2

CSG

5001 - 10000

DevOps Engineer supporting and scaling cloud and infrastructure operations across AWS and Azure. Seeking hands-on experience with configuration management, automation, and scripting environments.

Ansible

AWS

Azure

Chef

Cloud

Linux

Prometheus

Puppet

Python

Terraform

🕒 June 2

CSG

5001 - 10000

DevOps Engineer enabling SSO, UX, and AI-driven functionality across CSG products. Building and operating scalable cloud-native systems end-to-end with a hands-on approach.

AWS

Cloud

Distributed Systems

Kubernetes

Python

Terraform