Lead SRE

Job not on LinkedIn

🔥 0 minutes ago

🗣️🇫🇷 French Required

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Voodoo

Voodoo

501 - 1000 employees

🎮 Gaming

📱 Media

👥 B2C

Gaming • Media • B2C

Voodoo is a global tech company known for entertaining the world through its iconic apps and games. The company creates engaging puzzle and strategy games that are easy to pick up, yet challenging to master. With a focus on fun that meets real stakes, Voodoo turns favorite games into competitions with cash rewards. It boasts over 7 billion downloads and 200 million monthly active users, generating $600 million in revenue as of 2023. Voodoo empowers bold creators to bring their ideas to life, working alongside creative minds, content creators, and technical innovators to shape future hits.

📋 Description

• Define and drive SRE practices across the organization, including SLIs, SLOs, error budgets, incident management, postmortem processes, and long-term reliability improvements across the platform • Design, implement, and optimize infrastructure for availability, scalability, reliability, and cost efficiency • Own and evolve our observability stack, improving monitoring, alerting, logging, and distributed tracing • Drive automation of infrastructure and operational workflows (e.g., Terraform, Terragrunt, Kubernetes) • Lead FinOps initiatives, developing tools and insights to optimize cloud costs • Partner closely with development squads to improve service reliability, performance, and operational excellence • Influence architectural decisions and establish best practices for building resilient distributed systems • Mentor and support Infrastructure engineers, helping raise the bar on reliability, operational excellence, and technical execution • Analyze performance bottlenecks and work on solutions such as scaling strategies, service optimizations, and system debugging

🎯 Requirements

• Strong knowledge of Kubernetes • Experience with high traffic, distributed systems architectures, and related tools (service discovery, config/secret management, etc.) • Strong knowledge of one Cloud provider (AWS or GCP preferred) • Proven experience defining and operating SRE practices (SLOs, incident management, observability, reliability engineering) • Strong operational mindset with experience managing production incidents and driving reliability improvements • Leadership and mentoring experience, with the ability to influence technical decisions across teams • Ownership-driven – If something isn’t working, you don’t wait for instructions; you improve it • Pragmatic and impact-oriented – You balance reliability, delivery speed, and business priorities • Performance vs cost-conscious – You make decisions that align with both technical excellence and financial sustainability

🏖️ Benefits

• Competitive salary based on experience • Swile Lunch voucher • Gymlib (100% covered by Voodoo) • Premium healthcare coverage with SideCare, 100% covered for you and your family • Wellness activities in our Paris office

Apply Now

Similar Jobs

🕒 June 8

Alma

201 - 500

💳 Fintech

🛍️ eCommerce

🛒 Retail

Senior DevOps Engineer optimizing CI/CD workflows for Alma's fintech solutions. Join the Developer Experience squad to enhance the developer environment and tooling.

🗣️🇫🇷 French Required

Cloud

Kubernetes

PHP

Prometheus

Python

Terraform

TypeScript

🕒 May 21

Filigran

201 - 500

🔒 Cybersecurity

☁️ SaaS

Senior Platform Engineer - SRE joining Filigran to improve cyber threat management infrastructure. Collaborating with SRE engineers and development teams for platform stability.

Ansible

AWS

Azure

Cloud

ElasticSearch

Google Cloud Platform

Grafana

Java

Kubernetes

Linux

Open Source

Prometheus

Python

RabbitMQ

Redis

Terraform

Go

🕒 May 20

Replit

51 - 200

Site Reliability Engineer joining Replit to ensure reliability, scalability, and performance of infrastructure. Developing monitoring solutions and optimizing systems for developers worldwide.

Ansible

Cloud

Distributed Systems

Kubernetes

Python

Terraform

Go

🕒 May 20

Datadog

1001 - 5000

🔒 Cybersecurity

☁️ SaaS

🏢 Enterprise

Engineering Manager II leading reliability initiatives and managing engineering managers for Datadog. Driving system robustness through proactive risk mitigation and cross-functional partnerships.

Distributed Systems

🕒 May 13

N2JSoft, administrative and HR softwares

51 - 200

👥 HR Tech

☁️ SaaS

🤝 B2B

DevOps Engineer for N2JSoft, improving AWS infrastructure and CI/CD pipelines. Collaborate autonomously in a growing fintech environment.

🗣️🇫🇷 French Required

AWS

Cloud

Kubernetes

Python

Terraform

TypeScript

Go