Site Reliability Engineer

🕒 il y a 3 mois

🇺🇸 États-Unis – Télétravail

💵 $156 000 - $288 000 / an

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

⛑ Ingénieur DevOps & SRE

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

Postuler Maintenant
Trouver des Emplois à Distance Similaires

📊 Vérifiez votre score de CV pour ce poste

Améliorez vos chances d'obtenir un entretien en vérifiant votre score de CV avant de postuler.

Logo of Ditto

Ditto

11 - 50 employés

Fondée en 2018

🔌 API

📡 Télécommunications

API • Software • Telecommunications

Ditto est une plateforme qui permet la synchronisation de données en peer-to-peer entre divers appareils, y compris mobiles et IoT, même en mode hors ligne. Elle propose un SDK flexible qui peut être intégré dans des applications existantes pour un flux de données fluide et des mises à jour en temps réel. En supportant plusieurs langages de programmation et en offrant une résolution automatique des conflits, Ditto permet aux développeurs de moderniser rapidement les applications tout en maintenant une haute fiabilité des données et de la connectivité.

Description

• Develop and maintain observability solutions using platforms like Datadog, Prometheus and Grafana • Take a leading role in incident management, including coordinating response efforts, troubleshooting issues, and identifying follow-up actions • Partner with product engineering teams to architect reliable systems, recover from incidents, and learn from mistakes • Work with teams to implement and maintain SLOs, monitoring, and alerting strategies that ensure reliability at scale • Design and implement automation and support tooling to improve system resilience, maintain operational safety and reduce operational overhead • Lead the development and maintenance of runbooks, alert definitions, and incident response procedures • Participate in on-call rotations to provide 24/7 support for critical production systems

🎯 Exigences

• 4+ years of experience in Site Reliability Engineering or similar DevOps roles focused on system reliability and incident management • 2+ years of hands-on experience architecting applications for Kubernetes, and managing Kubernetes infrastructure • Strong experience with modern monitoring stacks including Prometheus, Grafana, and Datadog • Experience in at least one systems programming language, such as Go, Rust, C, or Java • Expertise with Infrastructure as Code tools, like Terraform and Helm • Expertise with at least one major cloud service provider (AWS, GCP, Azure) • Strong communication skills, with the ability to lead incident response and effectively collaborate across teams • Willingness and experience engaging with on-call rotations and emergency response procedures • A high degree of agency and bias towards action. Identify problems and work autonomously to solve them • Excellent problem-solving skills and a methodical approach to troubleshooting complex issues

🏖️ Avantages

• Health insurance • Dental insurance • Vision insurance • Life insurance • Disability insurance • 401(k) • Flexible spending accounts • Flexible time off

Postuler Maintenant

Emplois Similaires

🕒 il y a 3 mois

CAKE.com

201 - 500

⚡ Productivité

☁️ SaaS

🏢 Entreprise

SRE managing scalable infrastructure for CAKE.com, ensuring seamless user experience and high traffic handling. Involves automation, monitoring, and incident resolution processes.

🇺🇸 États-Unis – Télétravail

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 3 mois

Moonlite

1 - 10

📚 Éducation

🏪 Place de marché

👥 B2C

Build and operate production-grade AI infrastructure for organizations running intensive computational research. Leverage deep Kubernetes expertise for high-performance workloads.

🇺🇸 États-Unis – Télétravail

💵 $165 000 - $225 000 / an

⏰ Temps Plein

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 3 mois

Owner.com

201 - 500

☁️ SaaS

🤝 B2B

🏪 Place de marché

Senior DevOps Engineer evolving and operating Owner’s cloud platform. Design systems for reliability, security, and developer productivity as we scale.

🇺🇸 États-Unis – Télétravail

💵 $190 000 - $240 000 / an

💰 €120 000 000 Series C - Owner en 2025-05

⏰ Temps Plein

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 3 mois

Vytwo Technologies Inc

201 - 500

🤝 B2B

🏢 Entreprise

🎯 Recrutement

Meanstack Architect with DevOps expertise for TCoE, designing scalable applications and leading technical teams in a fully remote environment.

🇺🇸 États-Unis – Télétravail

💵 $45 - $50 / heure

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 3 mois

Truv

51 - 200

Senior DevOps Engineer architecting and scaling AWS infrastructure and building observability platforms. Leading compliance projects and optimizing CI/CD pipelines in a remote setup.

🗣️🇺🇸🇬🇧 Anglais requis