Senior Site Reliability Engineer

Trouver des Emplois à Distance Similaires

51 - 200 employés

🤖 Intelligence artificielle

🔌 API

💰 €30 000 000 Series B en 2022-11

Artificial Intelligence • API • Automotive

Parallel Domain est une entreprise qui propose une API pour les équipes de machine learning, vision par ordinateur et perception afin de générer des données synthétiques de capteurs de haute fidélité, y compris des données de caméra, lidar et radar. Ces données aident à l'entraînement et au test des modèles de perception en simulant des scénarios dans des mondes générés procéduralement ou des répliques de n'importe quel lieu réel. La plateforme fournit des données synthétiques de haute qualité pour analyser, entraîner, évaluer et surveiller les modèles de perception, améliorant la fiabilité de l'IA tout en réduisant les risques, le temps de développement et les coûts. Parallel Domain soutient divers cas d'utilisation de perception dans de multiples industries, telles que l'automobile et les drones, en offrant des ensembles de données variés avec des cas limites et des annotations précises, augmentant ainsi la performance des modèles d'apprentissage automatique pour des tâches comme la détection de véhicules d'urgence et la classification des systèmes de feux de circulation.

Senior Site Reliability Engineer

🕒 il y a 1 mois

🌲 Oregon, Washington – Distant

💵 $145 000 - $185 000 / an

⏰ Temps Plein

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

AWS

Cloud

DNS

Grafana

Kubernetes

Linux

Node.js

Prometheus

Python

Terraform

Postuler Maintenant

📊 Vérifiez votre score de CV pour ce poste

Améliorez vos chances d'obtenir un entretien en vérifiant votre score de CV avant de postuler.

Parallel Domain

51 - 200 employés

🤖 Intelligence artificielle

🔌 API

💰 €30 000 000 Series B en 2022-11

Artificial Intelligence • API • Automotive

Description

• Design, build, and maintain multi-region AWS infrastructure using Terraform. • Operate and scale EKS clusters across production regions: autoscaling, node lifecycle, workload health. • Manage networking across environments: VPC design, DNS, load balancing, and cross-region connectivity. • Support infrastructure changes, migrations, and expansions into new regions. • Help build and run incident management processes: severity definitions, escalation paths, on-call practices. • Lead incident response, debugging, and root-cause analysis. • Write postmortems and drive systemic reliability improvements from what they surface. • Improve observability across metrics, logging, tracing, and dashboards. • Provide security-conscious feedback on platform architecture decisions. • Own cloud IAM governance: roles, policies, and access boundaries across accounts and services. • Improve CI/CD pipelines and infrastructure validation. • Support engineers with infrastructure debugging, environment setup, and performance issues. • Contribute to tooling and automation in Python and Bash.

🎯 Exigences

• 5+ years in SRE, DevOps, or infrastructure engineering roles, with a track record of operating production systems across multiple regions. • Terraform experience: Modules, state management, and multi-environment patterns. • AWS depth: Solid experience across VPC, IAM, EKS, S3, and CloudWatch. • Kubernetes expertise: Cluster operations, autoscaling, RBAC, and Helm. • CI/CD and GitOps: Experience with GitHub Actions, ArgoCD, or similar workflows. • Networking fundamentals: CIDR, DNS, load balancing, VPN, and cross-region connectivity. • Observability: Experience with tooling such as Prometheus and Grafana. • Scripting: Comfort with Python and Bash for tooling and automation. • Cross-platform familiarity: Working knowledge of both Linux and Windows environments. Operational experience supporting Windows-based workloads is a meaningful advantage. • Pragmatism and ownership: Comfortable in a fast-moving startup with evolving priorities. You take ownership of systems while collaborating closely with other teams, and you're pragmatic about tradeoffs between speed, reliability, and complexity.

🏖️ Avantages

• equity • full health/dental/vision coverage • learning stipend • generous vacation

Postuler Maintenant

Emplois Similaires

Senior Manager, Cloud & DevOps Engineering

🕒 il y a 1 mois

Nomi Health

501 - 1000

⚕️ Assurance santé

💸 Finance

☁️ SaaS

Senior Manager of Cloud and DevOps Engineering managing daily operations of AWS and Kubernetes infrastructure across businesses. Leading a team and working closely with senior leadership for operational excellence.

🇺🇸 États-Unis – Télétravail

💰 €110 000 000 Series A en 2021-12

⏰ Temps Plein

🟠 Senior

⛑ Ingénieur DevOps & SRE

🦅 Parrain de Visa H1B

🗣️🇺🇸🇬🇧 Anglais requis

AWS

Cloud

Docker

EC2

Kubernetes

Terraform

Senior DevOps Engineer

🕒 il y a 1 mois

Sagent

201 - 500

☁️ SaaS

💳 Fintech

Cloud Infrastructure Engineer managing cloud resources for large-scale infrastructure. Supporting development teams in a microservices environment to streamline deployments and optimize performance.

🇺🇸 États-Unis – Télétravail

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

⛑ Ingénieur DevOps & SRE

🦅 Parrain de Visa H1B

🗣️🇺🇸🇬🇧 Anglais requis

Airflow

Azure

BigQuery

Cloud

DNS

Google Cloud Platform

Grafana

Kafka

Kubernetes

Matillion

Microservices

Postgres

Prometheus

Python

Redis

Spark

SQL

Terraform

Vault

Senior Site Reliability Engineer – Government, Sovereign Cloud

🕒 il y a 1 mois

Veeam Software

1001 - 5000

☁️ SaaS

🔒 Cybersecurity

🏢 Entreprise

Senior Site Reliability Engineer for Veeam's Government & Sovereign Cloud environments. Building a global SRE function with an emphasis on high availability and operational excellence.

🇺🇸 États-Unis – Télétravail

💵 $138 900 - $231 400 / an

💰 €500 000 000 Private Equity Round en 2019-01

⏰ Temps Plein

🟠 Senior

⛑ Ingénieur DevOps & SRE

🦅 Parrain de Visa H1B

🗣️🇺🇸🇬🇧 Anglais requis

AWS

Azure

Cloud

Dagger

Distributed Systems

Grafana

Java

JavaScript

Kubernetes

Prometheus

Terraform

TypeScript

DevOps Engineer

🕒 il y a 1 mois

ImmunityBio, Inc.

501 - 1000

🧬 Biotechnologie

⚕️ Assurance santé

💊 Pharmaceutique

DevOps Engineer bridging software development and operations at ImmunityBio, involved in CI/CD and infrastructure automation. Collaborating across teams to support reliable and scalable services.

🇺🇸 États-Unis – Télétravail

💵 $130 500 - $150 000 / an

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

Ansible

Grafana

Jenkins

Kubernetes

Linux

Prometheus

Python

Terraform

Senior DevOps Engineer – Infrastructure, MLOps

🕒 il y a 1 mois

Prompt Therapy Solutions Inc

11 - 50

⚕️ Assurance santé

⚡ Productivité

☁️ SaaS