SRE – Infra

Emploi pas sur LinkedIn

🕒 il y a 2 mois

🇺🇸 États-Unis – Télétravail

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

Postuler Maintenant
Trouver des Emplois à Distance Similaires

📊 Vérifiez votre score de CV pour ce poste

Améliorez vos chances d'obtenir un entretien en vérifiant votre score de CV avant de postuler.

Logo of PostHog

PostHog

11 - 50 employés

Fondée en 2020

☁️ SaaS

⚡ Productivité

🏢 Entreprise

SaaS • Productivity • Enterprise

PostHog est une plateforme complète qui permet aux développeurs de créer des produits réussis grâce à des outils pour l'analyse de produits, l'analyse web, la relecture de session, les feature flags, les expérimentations et les sondages. Elle s'intègre parfaitement dans les flux de travail existants, offrant des solutions de pipelines de données et de stockage qui se synchronisent avec des plateformes populaires comme Stripe, Hubspot, Zendesk, et bien d'autres. Avec PostHog, les équipes peuvent déployer en toute sécurité de nouvelles fonctionnalités, réaliser des expériences avec une signification statistique, et recueillir des insights approfondis grâce à des produits IA et LLM. La plateforme est construite avec un accès complet à l'API, permettant un contrôle total sur les données des clients. PostHog évolue avec les entreprises, des startups aux stades de croissance, en faisant un outil polyvalent pour les équipes d'ingénierie cherchant à rationaliser leurs opérations de données tout en se concentrant sur le développement de produits.

Description

• You won’t be in a typical “keep the lights on” SRE role. The work is about turning a fast-growing, stateful system into a predictable, well-automated platform. • Operating EKS clusters across several environments with Karpenter autoscaling, Cilium networking, and ArgoCD-driven GitOps deployments • Managing and evolving a multi AWS account organization, provisioning, networking, access control, and cross-account connectivity • Maintaining the Terraform/Terragrunt IaC platform - modules, automated plan-on-PR / apply-on-merge pipelines, and safe patterns for shared infrastructure • Improving operational tooling around deploys, schema changes, backups, restores, and incident response • Reducing operational load by identifying repeat pain points and eliminating them through code and self-healing automation • Optimizing cloud spend as you go • Participating in on-call and incident response, with a strong focus on making incidents rarer over time.

🎯 Exigences

• Deep hands-on experience with Kubernetes in production (EKS preferred). You've debugged node pressure, networking issues, and deployment failures at scale (thousands of nodes) • Strong experience operating production infrastructure on AWS. Not just one account, but understanding organizational boundaries, IAM, and networking between many • Experience automating infrastructure using Terraform or Terragrunt at scale, including module design and state management • Solid understanding of Linux systems (disk, memory, networking, failure modes) • Experience supporting stateful systems (databases, queues, storage systems, etc.) • Ability to debug and reason about performance and reliability issues in production • You're comfortable owning systems end-to-end, including on-call responsibilities.

🏖️ Avantages

• Transparency: Everyone can read about our roadmap, how we pay (or even let go of) people, our strategy, and how we work, in our public company handbook. Internally, we share revenue, notes and slides from board meetings, and fundraising plans, so everyone has the context they need to make good decisions. • Autonomy: We don’t tell anyone what to do. Everyone chooses what to work on next based on what's going to have the biggest impact on our customers, and what they find interesting and motivating to work on. • Shipping fast: Why not now? We want to build a lot of products; we can't do that shipping at a normal pace. We prioritize heads down building time over perfect coordination. This will be the most productive job you've ever had. • Time for building: Nothing gets shipped in a meeting. We're a natively remote company. We default to async communication – PRs > Issues > Slack. Tuesdays and Thursdays are meeting-free days. • Ambition: We want to solve big problems. We strongly believe that aiming for the best possible upside, and sometimes missing, is better than never trying. We're optimistic about what's possible and our ability to get there. • Being weird: Doing weird stuff is a competitive advantage. And it's fun.

Postuler Maintenant

Emplois Similaires

🕒 il y a 2 mois

Cresta

51 - 200

☁️ SaaS

🤖 Intelligence artificielle

🏢 Entreprise

Senior Infrastructure Engineer/SRE responsible for building core infrastructure at AI-driven contact center company. Designing tools for developers and ensuring reliability across cloud platforms.

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 2 mois

Alteryx

1001 - 5000

🤖 Intelligence artificielle

🤝 B2B

Lead Site Reliability Engineer guiding reliability strategy and execution for modern multi-region SaaS platform. Focused on system design, incident management, and cross-team collaboration.

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 2 mois

Toast

1001 - 5000

☁️ SaaS

🤝 B2B

Staff Software Engineer, Tech Lead focused on mobile DevOps at Toast, specializing in Android development and CI/CD processes for restaurant technology.

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 2 mois

EITACIES Inc.

51 - 200

🏢 Entreprise

🔒 Cybersecurity

🤖 Intelligence artificielle

DevOps Architect leading platform engineering standards across a multi-cloud, hybrid environment at Eitacies Inc. Focus on automation, infrastructure, and cloud architecture.

🇺🇸 États-Unis – Télétravail

💵 $60 / heure

⏰ Temps Plein

🟠 Senior

🔴 Expert

⛑ Ingénieur DevOps & SRE

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 2 mois

Flywire

1001 - 5000

💸 Finance

💳 Fintech

Manager II, Site Reliability Engineering at Flywire driving reliability and performance in our cloud infrastructure. Lead SRE teams, collaborate across functions, and ensure production excellence.

🇺🇸 États-Unis – Télétravail

💵 $160 000 - $200 000 / an

💰 €60 000 000 Series F en 2021-03

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

⛑ Ingénieur DevOps & SRE

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis