Site Reliability Engineer

Ähnliche Remote-Jobs finden

11 - 50 Mitarbeiter

Gegründet 2018

🔌 API

📡 Telekommunikation

API • Software • Telecommunications

Ditto ist eine Plattform, die Peer-to-Peer-Datensynchronisation über verschiedene Geräte, einschließlich mobiler Geräte und IoT, selbst im Offline-Modus ermöglicht. Sie bietet ein flexibles SDK, das nahtlos in bestehende Anwendungen integriert werden kann, um einen reibungslosen Datenfluss und Echtzeitaktualisierungen zu gewährleisten. Durch die Unterstützung mehrerer Programmiersprachen und die Bereitstellung automatischer Konfliktlösungen stellt Ditto sicher, dass Entwickler Anwendungen schnell modernisieren können, während sie eine hohe Datenzuverlässigkeit und Konnektivität aufrechterhalten.

Site Reliability Engineer

🕒 vor 2 Monaten

🇺🇸 Vereinigte Staaten – Remote

💵 $156.000 - $288.000 / Jahr

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

⛑ DevOps- und Site Reliability Engineer (SRE)

🦅 H1B-Visum-Sponsor

🗣️🇺🇸🇬🇧 Englisch erforderlich

AWS

Azure

Cloud

Google Cloud Platform

Grafana

Java

Kubernetes

Prometheus

Rust

Terraform

Jetzt Bewerben

📊 Überprüfen Sie Ihre Lebenslauf-Bewertung für diese Stelle

Verbessern Sie Ihre Chancen auf ein Vorstellungsgespräch, indem Sie Ihre Lebenslauf-Bewertung vor der Bewerbung überprüfen.

Ditto

11 - 50 Mitarbeiter

Gegründet 2018

🔌 API

📡 Telekommunikation

API • Software • Telecommunications

Beschreibung

• Develop and maintain observability solutions using platforms like Datadog, Prometheus and Grafana • Take a leading role in incident management, including coordinating response efforts, troubleshooting issues, and identifying follow-up actions • Partner with product engineering teams to architect reliable systems, recover from incidents, and learn from mistakes • Work with teams to implement and maintain SLOs, monitoring, and alerting strategies that ensure reliability at scale • Design and implement automation and support tooling to improve system resilience, maintain operational safety and reduce operational overhead • Lead the development and maintenance of runbooks, alert definitions, and incident response procedures • Participate in on-call rotations to provide 24/7 support for critical production systems

🎯 Anforderungen

• 4+ years of experience in Site Reliability Engineering or similar DevOps roles focused on system reliability and incident management • 2+ years of hands-on experience architecting applications for Kubernetes, and managing Kubernetes infrastructure • Strong experience with modern monitoring stacks including Prometheus, Grafana, and Datadog • Experience in at least one systems programming language, such as Go, Rust, C, or Java • Expertise with Infrastructure as Code tools, like Terraform and Helm • Expertise with at least one major cloud service provider (AWS, GCP, Azure) • Strong communication skills, with the ability to lead incident response and effectively collaborate across teams • Willingness and experience engaging with on-call rotations and emergency response procedures • A high degree of agency and bias towards action. Identify problems and work autonomously to solve them • Excellent problem-solving skills and a methodical approach to troubleshooting complex issues

🏖️ Vorteile

• Health insurance • Dental insurance • Vision insurance • Life insurance • Disability insurance • 401(k) • Flexible spending accounts • Flexible time off

Jetzt Bewerben

Ähnliche Jobs

Site Reliability Engineer, SRE

🕒 vor 2 Monaten

CAKE.com

201 - 500

⚡ Produktivität

☁️ SaaS

🏢 Unternehmen

SRE managing scalable infrastructure for CAKE.com, ensuring seamless user experience and high traffic handling. Involves automation, monitoring, and incident resolution processes.

🇺🇸 Vereinigte Staaten – Remote

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

⛑ DevOps- und Site Reliability Engineer (SRE)

🗣️🇺🇸🇬🇧 Englisch erforderlich

Ansible

AWS

Docker

Jenkins

Linux

Packer

Puppet

Terraform

Unix

Senior Site Reliability Engineer – SRE

🕒 vor 2 Monaten

Moonlite

1 - 10

📚 Bildung

🏪 Marktplatz

👥 B2C

Build and operate production-grade AI infrastructure for organizations running intensive computational research. Leverage deep Kubernetes expertise for high-performance workloads.

🇺🇸 Vereinigte Staaten – Remote

💵 $165.000 - $225.000 / Jahr

⏰ Vollzeit

🟠 Senior

⛑ DevOps- und Site Reliability Engineer (SRE)

🗣️🇺🇸🇬🇧 Englisch erforderlich

Ansible

DNS

Grafana

Kubernetes

Linux

Prometheus

Python

Terraform

Senior DevOps Engineer

🕒 vor 2 Monaten

Owner.com

201 - 500

☁️ SaaS

🤝 B2B

🏪 Marktplatz

Senior DevOps Engineer evolving and operating Owner’s cloud platform. Design systems for reliability, security, and developer productivity as we scale.

🇺🇸 Vereinigte Staaten – Remote

💵 $190.000 - $240.000 / Jahr

💰 €120.000.000 Series C - Owner im 2025-05

⏰ Vollzeit

🟠 Senior

⛑ DevOps- und Site Reliability Engineer (SRE)

🗣️🇺🇸🇬🇧 Englisch erforderlich

AWS

Cloud

Kubernetes

Terraform

Meanstack Architect, DevOps

🕒 vor 2 Monaten

Vytwo Technologies Inc

201 - 500

🤝 B2B

🏢 Unternehmen

🎯 Rekrutierung

Meanstack Architect with DevOps expertise for TCoE, designing scalable applications and leading technical teams in a fully remote environment.

🇺🇸 Vereinigte Staaten – Remote

💵 $45 - $50 / Stunde

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

⛑ DevOps- und Site Reliability Engineer (SRE)

🗣️🇺🇸🇬🇧 Englisch erforderlich

Angular

AWS

Azure

Cloud

Docker

JavaScript

Kubernetes

Microservices

MongoDB

Node.js

Senior DevOps Engineer

🕒 vor 2 Monaten

Truv

51 - 200

Senior DevOps Engineer architecting and scaling AWS infrastructure and building observability platforms. Leading compliance projects and optimizing CI/CD pipelines in a remote setup.

🇺🇸 Vereinigte Staaten – Remote

💵 $100.000 - $140.000 / Jahr

⏰ Vollzeit

🟠 Senior

⛑ DevOps- und Site Reliability Engineer (SRE)

🦅 H1B-Visum-Sponsor

🗣️🇺🇸🇬🇧 Englisch erforderlich

AWS

Cloud

Kubernetes

Postgres

Redis

Terraform