Senior Site Reliability Engineer, Core Cloud Engineering

Trouver des Emplois à Distance Similaires

201 - 500 employés

Fondée en 2014

🤖 Intelligence artificielle

🤝 B2B

🔧 Matériel

🔥 Financement dans la dernière année

💰 €329 000 000 Debt Financing - Vultr en 2025-06

Artificial Intelligence • B2B • Hardware

Vultr est un fournisseur mondial d'infrastructures cloud offrant des machines virtuelles à la demande, des serveurs bare-metal, des instances accélérées par GPU, des bases de données gérées, un stockage d'objets et de blocs, des services Kubernetes et de mise en réseau. La plateforme met l'accent sur les charges de travail d'IA et de calcul haute performance (HPC) avec un large choix de GPUs AMD et NVIDIA, un réseau rapide, et plus de 32 régions de centres de données, ainsi qu'un marché d'applications déployables et des API conviviales pour les développeurs. Vultr cible les développeurs et les entreprises à la recherche d'alternatives cloud abordables, évolutives et conformes aux hyperscalers pour le calcul et le stockage.

Senior Site Reliability Engineer, Core Cloud Engineering

🕒 il y a 3 mois

🇺🇸 États-Unis – Télétravail

💵 $120 000 - $130 000 / an

⏰ Temps Plein

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

Distributed Systems

Grafana

Linux

MySQL

PHP

Puppet

Postuler Maintenant

📊 Vérifiez votre score de CV pour ce poste

Améliorez vos chances d'obtenir un entretien en vérifiant votre score de CV avant de postuler.

Vultr

201 - 500 employés

Fondée en 2014

🤖 Intelligence artificielle

🤝 B2B

🔧 Matériel

🔥 Financement dans la dernière année

💰 €329 000 000 Debt Financing - Vultr en 2025-06

Artificial Intelligence • B2B • Hardware

Description

• Operate and scale Vultr’s control plane, ensuring availability, correctness, and performance across global datacenters. • Design, implement, and maintain automation to manage hypervisor fleets (KVM, QEMU, libvirt) and supporting infrastructure at scale. • Develop tooling and automation for Open vSwitch (OVS), BGP routing, and other networking components to ensure resilient and self-healing network operations. • Continuously analyze and improve system performance across compute, storage, and network layers, with an emphasis on reducing toil and eliminating single points of failure. • Implement advanced monitoring, logging, and tracing solutions (Grafana, Sentry, SumoLogic) while leading incident response to minimize impact and drive postmortem culture. • Maintain and evolve infrastructure pipelines (GitLab CI/CD, Puppet) to enable safe, fast, and reliable changes to both control plane and hypervisor infrastructure. • Work closely with Software Engineers, Network Engineers, and Product teams to align platform reliability with business and user needs. • Produce clear technical documentation for runbooks, operational procedures, and automation frameworks to improve team efficiency and reliability standards. • Coach and mentor team members in best practices for site reliability, incident handling, automation, and low-level Linux systems debugging.

🎯 Exigences

• Proficiency in PHP with strong scripting and automation skills. • Experience running large-scale distributed systems and control plane infrastructure in production. • Strong background in hypervisor technologies (libvirt, QEMU, KVM) and Linux systems administration. • Expertise in networking protocols and tools, particularly BGP and Open vSwitch (OVS), with automation experience. • Deep knowledge of observability and monitoring frameworks (Grafana, Sentry, SumoLogic) and incident management. • Advanced troubleshooting skills across compute, networking, and storage subsystems. • Experience building and maintaining CI/CD pipelines (GitLab) and configuration management (Puppet). • Familiarity with MySQL or similar databases, with an understanding of operational considerations for reliability and scale. • Strong problem-solving abilities and the drive to tackle complex, low-level reliability challenges. • Effective cross-team communication and collaboration skills. • A commitment to continuous improvement and fostering a culture of operational excellence.

🏖️ Avantages

• Excellent Medical Benefits w/ 100% company paid premiums for employee only plan + 100% company paid dental & vision premiums • 401(k) plan that matches 100% up to 4% with immediate vesting • Professional Development Reimbursement of $2,500 each year • 11 Holidays + Paid Time Off Accrual + Rollover Plan • Increased PTO at 3 year & 10 year anniversary + 1 month paid sabbatical every 5 years + Anniversary Bonus each year • $500 first year remote office setup + $400 each following year for new equipment • Internet reimbursement up to $75 per month • Gym membership reimbursement up to $50 per month • Company paid Wellable subscription

Postuler Maintenant

Emplois Similaires

DevOps Engineer – Mission-Critical Systems

🕒 il y a 3 mois

Tactibit Technologies

11 - 50

🔒 Cybersecurity

🏛️ Gouvernement

DevOps Engineer working at Tactibit Technologies to modernize legacy architectures for mission-critical systems. Collaborate with teams on cloud migrations and automating business processes.

🇺🇸 États-Unis – Télétravail

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

AWS

Cloud

SRE – Platform Engineer

🕒 il y a 3 mois

DroneUp

51 - 200

🚀 Aérospatiale

☁️ SaaS

🤝 B2B

SRE - Platform Engineer at DroneUp focusing on IT infrastructure reliability and scalability. Driving SRE best practices within the team and collaborating on cloud engineering solutions.

🇺🇸 États-Unis – Télétravail

💵 $125 000 - $150 000 / an

💰 €241 201 Seed Round - DroneUp en 2022-07

⏰ Temps Plein

🟠 Senior

🔴 Expert

⛑ Ingénieur DevOps & SRE

🦅 Parrain de Visa H1B

🗣️🇺🇸🇬🇧 Anglais requis

AWS

Azure

Cloud

Google Cloud Platform

Grafana

Kubernetes

Linux

MacOS

Node.js

Prometheus

Python

Terraform

Unix

Senior Database Reliability Engineer

🕒 il y a 3 mois

Filevine

201 - 500

☁️ SaaS

🤖 Intelligence artificielle

Senior DBRE managing performance and scalability of data platform at Filevine, a legal AI company. Focus on AI-driven automation, optimizing SQL Server and Postgres environments.

🇺🇸 États-Unis – Télétravail

💵 $145 000 - $180 000 / an

💰 €108 000 000 Series D en 2022-04

⏰ Temps Plein

🟠 Senior

⛑ Ingénieur DevOps & SRE

🦅 Parrain de Visa H1B

🗣️🇺🇸🇬🇧 Anglais requis

AWS

Docker

DynamoDB

Entity Framework

Kubernetes

MS SQL Server

Postgres

Python

Redis

SQL

Terraform

DevSecOps Engineer

🕒 il y a 4 mois

Agile Defense

501 - 1000

🏛️ Gouvernement

🔒 Cybersecurity

DevSecOps Engineer building secure software delivery systems for national security missions. Seeking a builder with 3–5 years of relevant experience and a proactive approach to integration challenges.

🇺🇸 États-Unis – Télétravail

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

Cloud

Kubernetes

SDLC

Terraform

Vault

Customer Reliability Engineer

🕒 il y a 4 mois

Supabase

51 - 200

☁️ SaaS

🔌 API

🤖 Intelligence artificielle