Director, Data Reliability Engineering

Emploi pas sur LinkedIn

🕒 il y a 5 jours

🚗 Michigan – Distant

info

💵 $128 500 - $276 000 / an

⏰ Temps Plein

🔴 Expert

⛑ Ingénieur DevOps & SRE

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

Postuler Maintenant
Trouver des Emplois à Distance Similaires

📊 Vérifiez votre score de CV pour ce poste

Améliorez vos chances d'obtenir un entretien en vérifiant votre score de CV avant de postuler.

Logo of Rocket Mortgage

Rocket Mortgage

10 000+ employés

Fondée en 1985

💸 Finance

💳 Fintech

🏠 Immobilier

Finance • Fintech • Real Estate

Rocket Mortgage est un prêteur hypothécaire en ligne de premier plan qui simplifie le processus d'achat et de refinancement de logement pour les consommateurs. Il propose une variété d'options hypothécaires, notamment des prêts à taux fixe, à taux ajustable, FHA et VA, avec des outils et des calculateurs pour aider les clients à comprendre leurs besoins financiers. En se concentrant sur la fourniture d'une expérience utilisateur fluide et de diverses offres promotionnelles, Rocket Mortgage s'engage à rendre l'accession à la propriété plus accessible et abordable.

Description

• Lead Engineering teams responsible for improving the reliability, observability, recoverability, and operational maturity of enterprise data platforms • Define reliability standards for databases, data warehouses, pipelines, jobs, storage, access patterns, and supporting infrastructure • Establish operating expectations for monitoring, alerting, logging, incident response, change management, backup/recovery, disaster recovery, patching, access controls, service ownership, and operational readiness • Create metrics that measure platform health, data freshness, data quality, recovery readiness, incident trends, operational risk, compliance alignment, and business impact • Lead current-state assessments of systems, data flows, operational processes, observability, access patterns, and reliability gaps • Convert assessment findings into executable roadmaps that improve platform stability, data trust, security alignment, and operational predictability • Support migration and modernization programs involving on-premise platforms, AWS, Snowflake, and related enterprise data systems • Build durable operating mechanisms, including reliability reviews, service health reviews, incident reviews, operational readiness reviews, risk reviews, roadmap reviews, and executive reporting • Develop senior technical talent and create the leadership structure required to scale Data Reliability Engineering over time

🎯 Exigences

• 10+ years of experience in data infrastructure, database engineering, data platform engineering, cloud infrastructure, site reliability engineering, or related technical disciplines • 5+ years of experience leading engineering teams responsible for production systems, databases, data platforms, infrastructure platforms, or reliability engineering • Strong understanding of enterprise data infrastructure, including databases, data warehouses, pipelines, storage, compute, backup/recovery, resiliency, and production operations • Experience improving reliability practices across complex production environments, including observability, monitoring, incident response, change management, disaster recovery, and lifecycle management • Experience establishing service health metrics, data reliability metrics, operational maturity indicators, and executive-level reporting • Strong understanding of enterprise security, compliance, access management, auditability, operational controls, and infrastructure standards • Proven ability to create structure in ambiguous environments, set clear priorities, influence across teams, and translate technical reliability work into business outcomes

🏖️ Avantages

• Perks and health benefits for you and your family • Support for individual needs • Peace of mind with our offerings

Postuler Maintenant

Emplois Similaires

🕒 il y a 6 jours

Coinbase

1001 - 5000

₿ Crypto

💸 Finance

💳 Fintech

Staff Site Reliability Engineer driving AI transformation by ensuring reliability and automation at Coinbase. Collaborating with infrastructure teams and leading critical incident responses to maintain service excellence.

🇺🇸 États-Unis – Télétravail

💵 $218 025 - $256 500 / an

💰 €21 400 000 Post-IPO Equity en 2022-11

⏰ Temps Plein

🔴 Expert

⛑ Ingénieur DevOps & SRE

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 6 jours

Aya Healthcare

5001 - 10000

⚕️ Assurance santé

🎯 Recrutement

Lead the SRE team at Aya Healthcare for enhancing product reliability and operational efficiency. Manage incident responses and AI-native operations for a top healthcare workforce solutions provider.

🇺🇸 États-Unis – Télétravail

💵 $230 000 - $255 000 / an

⏰ Temps Plein

🟠 Senior

🔴 Expert

⛑ Ingénieur DevOps & SRE

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 6 jours

MKS2 Technologies

201 - 500

🤝 B2B

🔒 Cybersecurity

Site Reliability Systems Engineer working with monitoring tools to enhance VA's infrastructure reliability. Collaborating across teams to resolve outages and improve service quality for veterans.

🇺🇸 États-Unis – Télétravail

⏰ Temps Plein

🟠 Senior

🔴 Expert

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 8 jours

NVIDIA

10 000+ employés

🤖 Intelligence artificielle

🎮 Jeux vidéo

Site Reliability and Software Engineering leader managing NVIDIA's DGX Cloud computing services. Overseeing team operations and driving technical project success in innovative environment.

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 8 jours

Leidos

10 000+ employés

🔒 Cybersecurity

🔬 Science

DevSecOps Engineer automating delivery infrastructure for mission-critical software at Leidos. Building CI/CD pipelines and maintaining security compliance in cloud environments.

🇺🇸 États-Unis – Télétravail

💵 $107 900 - $195 050 / an

⏰ Temps Plein

🟠 Senior

🔴 Expert

⛑ Ingénieur DevOps & SRE

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis