Senior Site Reliability Engineer – Government, Sovereign Cloud

🕒 il y a 1 mois

🏄 California – Distant

info

💵 $138 900 - $231 400 / an

⏰ Temps Plein

🟠 Senior

⛑ Ingénieur DevOps & SRE

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

Postuler Maintenant
Trouver des Emplois à Distance Similaires

📊 Vérifiez votre score de CV pour ce poste

Améliorez vos chances d'obtenir un entretien en vérifiant votre score de CV avant de postuler.

Logo of Veeam Software

Veeam Software

1001 - 5000 employés

Fondée en 2006

☁️ SaaS

🔒 Cybersecurity

🏢 Entreprise

💰 €500 000 000 Private Equity Round en 2019-01

SaaS • Cybersecurity • Enterprise

Veeam Software est un leader mondial de la résilience et de la protection des données, proposant un logiciel de protection des données autogéré pour les environnements cloud hybrides et multi-cloud. La Veeam Data Platform offre des solutions complètes de sauvegarde, de restauration et de sécurité des données, intégrant les principes Zero Trust et des outils d’IA pour l’intelligence des données. L’offre de Veeam comprend des services de sauvegarde et de stockage sécurisés pour des plateformes telles que Microsoft 365, AWS et Google Cloud, prenant en charge des charges de travail variées, notamment des environnements virtuels, physiques et SaaS. Réputé pour son innovation et la confiance de ses clients, Veeam accompagne un large éventail d’industries, garantissant la résilience des données face aux perturbations telles que les attaques par rançongiciel. Ses solutions permettent aux entreprises de bénéficier de la liberté des données, d’un stockage sécurisé et d’une gestion efficace, renforçant sa position de fournisseur de référence de logiciels de sauvegarde et de restauration pour l’entreprise à l’échelle mondiale.

Description

• Get up to speed on the full platform — all VDC workloads, dependencies, and risk areas. Much of this will happen through code, docs, and conversations rather than direct environment access. • Work with SMEs across the org to fill knowledge gaps and build onboarding material for the team. • Write and maintain runbooks, architecture docs, and operational guides. • Design infrastructure for high availability and fault tolerance on Azure (including Azure Government). • Define SLIs, SLOs, and error budgets where none exist today. • Run incident response and blameless postmortems. Turn incidents into improvements. • Identify reliability risks across modern and legacy workloads and build practical remediation plans that work within compliance constraints. • Close observability gaps — define instrumentation requirements and drive implementation. • Set alerting, telemetry, and monitoring standards with partner teams. • Build automation to reduce toil and support fleet management. • Participate in on-call rotations. • Work with IaC, CI/CD, deployment automation, and config management — including in air-gapped or compliance-restricted environments. • Build and maintain testing, canary deployment, and release validation pipelines. • Integrate chaos engineering and monitoring tools, adapting choices to meet regulatory requirements. • Work across product, platform, security, legal, compliance, and operations teams. • Own problems end-to-end — identify gaps, drive solutions, don't wait for direction. • Mentor other engineers and help spread SRE practices across the org.

🎯 Exigences

• 7+ years in Software Engineering, with 3+ years in SRE, Platform Engineering, or similar — across multi-service platforms, not just single-service environments. • Experience with Government or Sovereign Cloud (e.g., Azure Government, AWS GovCloud). • Experience in regulated compliance environments — government (FedRAMP, CMMC, IL2/IL4/IL5), financial (PCI-DSS, SOX), or healthcare (HIPAA, HITRUST). You understand how compliance shapes architecture and operations. • Strong experience building and running production services on cloud infrastructure (Azure preferred, including Azure Government). • Able to learn large, complex platforms quickly with limited guidance — comfortable building understanding from code, docs, and architecture artifacts when direct environment access is restricted. • Can investigate systems independently and produce clear docs, risk assessments, and improvement plans. • Comfortable working across teams — engineering, product, security, compliance, operations. • Programming skills in one or more of: TypeScript/JS, Go, Java, C#, or similar. • Experience with monitoring and observability tools (e.g., Prometheus, Grafana, OpenTelemetry, ELK stack). • Experience with IaC (Terraform, Terragrunt, Pulumi) and container orchestration (Kubernetes). • Experience with CI/CD and GitOps tooling — GitHub Actions, Azure DevOps, GitLab CI, ArgoCD, FluxCD, or Dagger. • Solid grasp of distributed systems, networking, and cloud-native architecture. • Clear written and verbal communication skills.

🏖️ Avantages

• Unlimited paid time off, 12 paid holidays, plus 4 extra global VeeaMe Days for self-care and 24 paid volunteer hours annually through Veeam Cares • Paid parental leave: 8 weeks for all parents, 16 weeks for birthing parents • Medical, dental, and vision coverage starting on your first day • Mental health support, therapy sessions, and digital wellness tools via our Employee Assistance Program • 401(k) retirement plan with company matching contributions • Fertility, adoption, and surrogacy support through Maven, plus paid volunteer time • AirVet: 24/7 virtual veterinary care at no cost • Legal services, identity protection, and supplemental health insurance options • Tax-advantaged spending accounts for healthcare, dependent care, and commuting • Opportunities to learn and grow through on-demand libraries (LinkedIn Learning, O’Reilly), mentoring, workshops, and learning events like our annual Global Day of Learning

Postuler Maintenant

Emplois Similaires

🕒 il y a 1 mois

ImmunityBio, Inc.

501 - 1000

🧬 Biotechnologie

⚕️ Assurance santé

💊 Pharmaceutique

DevOps Engineer bridging software development and operations at ImmunityBio, involved in CI/CD and infrastructure automation. Collaborating across teams to support reliable and scalable services.

🇺🇸 États-Unis – Télétravail

💵 $130 500 - $150 000 / an

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 1 mois

Prompt Therapy Solutions Inc

11 - 50

⚕️ Assurance santé

⚡ Productivité

☁️ SaaS

Senior DevOps Engineer managing AWS infrastructure and CI/CD pipelines at Prompt. Focused on AI-driven features and collaborating with engineering teams for optimal solutions.

🇺🇸 États-Unis – Télétravail

💵 $180 000 - $200 000 / an

⏰ Temps Plein

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 1 mois

Driver

11 - 50

☁️ SaaS

🔌 API

⚡ Productivité

DevOps Engineer coding and optimizing infrastructure at AI startup Driver, focused on AI-assisted development technology with a dynamic team.

🇺🇸 États-Unis – Télétravail

💵 $150 000 - $250 000 / an

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 1 mois

Speechify

51 - 200

☁️ SaaS

Lead technical onboarding for enterprise customers with Speechify's AI/ML platform, ensuring successful integration and collaboration across teams. This role shapes customer outcomes and informs product direction.

🇺🇸 États-Unis – Télétravail

💵 $140 000 - $200 000 / an

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

⛑ Ingénieur DevOps & SRE

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 1 mois

DataRobot

501 - 1000

🤖 Intelligence artificielle

🏢 Entreprise

☁️ SaaS

DevOps Engineer II at DataRobot, architecting scalable software systems and collaborating with cross-functional teams to optimize AI processes. Requires expertise in Kubernetes, Python, and cloud platforms.

🗣️🇺🇸🇬🇧 Anglais requis