Senior Site Reliability Engineer

Emploi pas sur LinkedIn

🕒 il y a 1 mois

🇺🇸 États-Unis – Télétravail

💵 $140 000 - $170 000 / an

⏰ Temps Plein

🟠 Senior

⛑ Ingénieur DevOps & SRE

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

Postuler Maintenant
Trouver des Emplois à Distance Similaires

📊 Vérifiez votre score de CV pour ce poste

Améliorez vos chances d'obtenir un entretien en vérifiant votre score de CV avant de postuler.

Logo of Coterie

Coterie

11 - 50 employés

👥 B2C

🛍️ eCommerce

🛒 Commerce de détail

B2C • eCommerce • Retail

Coterie est une entreprise dédiée à fournir des solutions de couchage haut de gamme pour les parents modernes. Leurs produits sont conçus pour offrir une douceur exceptionnelle, une haute absorption, et réduire les risques de fuites, débordements et éruptions cutanées. Les offres de Coterie incluent différentes options de couchage telles que The Diaper et The Pant, ainsi que des lingettes, pour assurer à la fois le confort des bébés et la tranquillité d'esprit des parents. L'entreprise met l'accent sur la sécurité, offrant des produits testés dermatologiquement, hypoallergéniques, fabriqués à partir de matériaux de qualité vestimentaire. Coterie propose également la commodité d'un service d'abonnement Auto-Renew, permettant aux clients de recevoir des livraisons régulières et de réaliser des économies. La marque est reconnue avec plusieurs récompenses pour ses solutions de couchage, soulignant son engagement envers la qualité et l'innovation dans le soin des bébés.

Description

• Manage and maintain cloud infrastructure on Azure, including Azure Kubernetes Service (AKS) clusters and supporting resources • Build, improve, and maintain CI/CD pipelines using GitHub Actions to support reliable and repeatable deployments • Own and enhance our Grafana implementation; designing dashboards, configuring alerts, and supporting incident management workflows • Monitor system health, triage incidents, and drive root cause analysis to prevent recurrence • Collaborate with development teams to define and track SLIs, SLOs, and error budgets that align with business goals • Contribute to infrastructure-as-code practices using Pulumi • Identify and resolve reliability risks through capacity planning, performance tuning, and proactive system improvements • Participate in an on-call rotation to support production systems and respond to incidents • Document runbooks, operational procedures, and architectural decisions to support team knowledge sharing

🎯 Exigences

• 5+ years of experience in a Site Reliability Engineering, DevOps, or Infrastructure role • 3+ years experience working with infrastructure as code • 2+ years of experience architecting CI/CD pipelines and cloud-based infrastructure • Strong hands-on experience with: Azure Cloud services and resource management • Kubernetes and AKS administration, including deployments, networking, and troubleshooting • GitHub Actions for CI/CD pipeline development and maintenance • 3+ experience with Grafana or similar tooling, including dashboard creation, alerting configuration, and incident management • Hands-on experience with Prometheus, Loki, or other observability tools in the Grafana ecosystem • Proficiency in at least one scripting or programming language such as Python or Bash • Understanding of networking fundamentals, DNS, load balancing, and container orchestration concepts • Strong analytical and communication skills; able to diagnose complex system issues and clearly communicate findings • Demonstrated ability to collaborate across teams and contribute to a culture of reliability • Experience working in an agile environment with modern DevOps practices

🏖️ Avantages

• 100% remote • Health insurance through Aetna (we pay 100% of premiums) • Dental and vision insurance through Guardian (we pay 100% of premiums) • Basic life insurance (we pay 100% of premiums) • Access to flexible spending account (FSA) or health savings account (HSA) (for those using HSA eligible plans) • 401K plan (up 4% match with immediate vest). • Must be 21 years of age or older to participate • Flexible PTO policy offering employees up to 4 weeks of PTO in their first 12 months. Thereafter, PTO usage aligns with company standards and typically does not exceed 5 weeks per calendar year. • 12 company-paid holidays each year • Continuing education annual stipend

Postuler Maintenant

Emplois Similaires

🕒 il y a 1 mois

ARUP Laboratories

1001 - 5000

🧬 Biotechnologie

🤝 B2B

📚 Éducation

Delivering innovative products for developing teams at ARUP Laboratories. Engineering high-throughput, cloud-enabled systems for clinical genomics and patient diagnostics.

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 1 mois

Granicus

501 - 1000

🏛️ Gouvernement

☁️ SaaS

📋 Conformité

Site Reliability Engineer ensuring reliability, scalability, and performance of Granicus services. Leading efforts in automation, monitoring, and incident management for cloud-based solutions.

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 1 mois

FICO

1001 - 5000

💸 Finance

🤖 Intelligence artificielle

☁️ SaaS

DevOps Engineer at FICO focusing on secure cloud solutions and Kubernetes expertise. Collaborating with engineering teams to drive reliable and scalable software delivery.

🇺🇸 États-Unis – Télétravail

💵 $101 500 - $159 500 / an

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 1 mois

Senior DevOps Engineer designing, deploying, and scaling platforms with Kubernetes for aviation systems. Working in a fully remote, international team with a modern cloud-native technology stack.

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 1 mois

ImagineX

201 - 500

🤖 Intelligence artificielle

🔒 Cybersecurity

🏢 Entreprise

Senior Azure DevOps Engineer at ImagineX deploying Azure infrastructure and CI/CD pipelines. Collaborating with teams for secure and scalable solutions in a remote environment.

🇺🇸 États-Unis – Télétravail

💰 Private equity en 2023-11

⏰ Temps Plein

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis