Site Reliability Engineer

Stelle nicht auf LinkedIn

🕒 vor 1 Monat

🇺🇸 Vereinigte Staaten – Remote

💵 $180.000 - $250.000 / Jahr

⏰ Vollzeit

🟡 Mittelstufe

🟠 Senior

⛑ DevOps- und Site Reliability Engineer (SRE)

🗣️🇺🇸🇬🇧 Englisch erforderlich

Jetzt Bewerben
Ähnliche Remote-Jobs finden

📊 Überprüfen Sie Ihre Lebenslauf-Bewertung für diese Stelle

Verbessern Sie Ihre Chancen auf ein Vorstellungsgespräch, indem Sie Ihre Lebenslauf-Bewertung vor der Bewerbung überprüfen.

Logo of AceHack 4.0

AceHack 4.0

11 - 50 Mitarbeiter

Gegründet 2022

⚡ Produktivität

☁️ SaaS

Software • Productivity • SaaS

AceHack 4. 0 ist ein Technologieunternehmen, das sich auf die Entwicklung von Softwareanwendungen spezialisiert hat, insbesondere auf Lösungen zur Fehlerbehebung und Handhabung von Fehlern. Das Unternehmen hat das Ziel, Unternehmen dabei zu unterstützen, client-seitige Ausnahmen zu identifizieren und zu lösen, um die Benutzererfahrung und die Anwendungsleistung zu verbessern.

Beschreibung

• Own reliability, availability, and performance of production systems running in cloud environments • Define and monitor SLIs/SLOs and help manage error budgets across the platform • Lead incident response efforts including detection, triage, mitigation, and postmortems • Improve observability through logging, monitoring, alerting, and dashboards • Automate operational workflows and reduce manual toil wherever possible • Partner closely with engineering teams to improve system resiliency and scalability • Assist with capacity planning, infrastructure optimization, and performance tuning • Build internal tooling, runbooks, and operational best practices • Support Kubernetes-based infrastructure and distributed systems at scale • Act as an escalation point for complex production and platform issues

🎯 Anforderungen

• 5+ years of experience in Site Reliability Engineering, DevOps, Platform Engineering, or related infrastructure roles • Strong experience with cloud platforms such as AWS, GCP, or Azure • Hands-on experience with Kubernetes and containerized environments • Strong understanding of distributed systems and microservices architecture • Experience with observability tools such as Prometheus, Grafana, Datadog, ELK, or OpenTelemetry • Proficiency with infrastructure automation and scripting (Terraform, Python, Bash, etc.) • Experience managing CI/CD pipelines and deployment automation • Strong troubleshooting and incident management skills • Ability to work cross-functionally and communicate effectively during high-pressure situations

🏖️ Vorteile

• Comprehensive health coverage including medical, dental, and vision • Flexible PTO • Support for personal development

Jetzt Bewerben

Ähnliche Jobs

🕒 vor 1 Monat

NVIDIA

10.000+ Mitarbeiter

🤖 Künstliche Intelligenz

🎮 Gaming

Senior Network Reliability Engineer maintaining NVIDIA's cloud and datacenter networks. Engaging in global support and driving operational improvements across teams.

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 1 Monat

NetBox Labs

11 - 50

🤝 B2B

☁️ SaaS

🏢 Unternehmen

Senior DevOps Engineer joining NetBox Labs Cloud Delivery team to enhance AWS infrastructure. Leading projects and mentorship within a fast-paced DevOps environment.

🇺🇸 Vereinigte Staaten – Remote

💵 $165.000 - $185.000 / Jahr

⏰ Vollzeit

🟠 Senior

⛑ DevOps- und Site Reliability Engineer (SRE)

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 1 Monat

Launch Potato

51 - 200

📱 Medien

👥 B2C

Lead Engineer overseeing Launch Potato's cloud infrastructure and SRE function. Evolving CI/CD platform, compliance posture, and leading AWS multi-account migration.

🇺🇸 Vereinigte Staaten – Remote

⏰ Vollzeit

🟠 Senior

⛑ DevOps- und Site Reliability Engineer (SRE)

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 1 Monat

Launch Potato

51 - 200

📱 Medien

👥 B2C

Lead SRE/DevOps Engineer at Launch Potato evolving cloud infrastructure and CI/CD platform. Owning SRE function development for faster product team performance without compromising reliability or security.

🇺🇸 Vereinigte Staaten – Remote

⏰ Vollzeit

🟠 Senior

⛑ DevOps- und Site Reliability Engineer (SRE)

🗣️🇺🇸🇬🇧 Englisch erforderlich

🕒 vor 1 Monat

Launch Potato

51 - 200

📱 Medien

👥 B2C

Lead DevOps/SRE Engineer evolving cloud infrastructure at Launch Potato. Building an SRE function to enable faster shipping of products while maintaining reliability and cost control.

🇺🇸 Vereinigte Staaten – Remote

⏰ Vollzeit

🟠 Senior

⛑ DevOps- und Site Reliability Engineer (SRE)

🗣️🇺🇸🇬🇧 Englisch erforderlich