
501 - 1000 employees
Founded 2005
đĽ B2C
đą Media
đ Social Impact
B2C ⢠Media ⢠Social Impact
Reddit, Inc. is a social media platform that acts as a hub for thousands of communities, where users can engage in diverse conversations ranging from breaking news to niche interests. It enables users to post, comment, and vote on content, fostering a vibrant online community. Millions of people globally connect and share their passions on Reddit, creating a dynamic environment for authentic human interaction.
đĽ 0 minutes ago
đŹđ§ United Kingdom â Remote
â° Full Time
đ´ Lead
â DevOps & Site Reliability Engineer (SRE)
Improve your chances of getting an interview by checking your resume score before you apply.

501 - 1000 employees
Founded 2005
đĽ B2C
đą Media
đ Social Impact
B2C ⢠Media ⢠Social Impact
Reddit, Inc. is a social media platform that acts as a hub for thousands of communities, where users can engage in diverse conversations ranging from breaking news to niche interests. It enables users to post, comment, and vote on content, fostering a vibrant online community. Millions of people globally connect and share their passions on Reddit, creating a dynamic environment for authentic human interaction.
⢠Lead reliability initiatives across multiple Ads domains including ad serving, auctions, targeting, reporting, measurement, and billing. ⢠Partner with engineering leadership to improve reliability, scalability, operational excellence, and engineering efficiency across the Ads organization. ⢠Drive architecture reviews and influence technical decisions impacting critical revenue-generating systems. ⢠Design and build platforms, tooling, and automation that improve reliability and developer productivity at scale. ⢠Participate in on-call rotations, lead complex incident investigations and coordinate cross-functional response efforts during major production events. ⢠Identify systemic reliability risks and drive long-term solutions that improve platform resilience. ⢠Establish reliability metrics around advertiser-critical user journeys such as campaign creation, ad delivery, auction participation, reporting, attribution, and billing. ⢠Mentor engineers and provide technical leadership across multiple teams. ⢠Influence roadmap planning and ensure reliability considerations are incorporated into product and infrastructure investments.
⢠8+ years of experience in Site Reliability Engineering, Infrastructure Engineering, or related roles operating large scale distributed systems. ⢠Strong experience supporting high traffic, user facing production environments. ⢠Deep understanding of distributed systems, networking, Linux systems, cloud native architectures. ⢠Experience designing highly available systems with strong operational and reliability practices. ⢠Strong understanding of observability systems including metrics, logging, tracing, and alerting. ⢠Good programming skills in languages such as Go, Python, or similar. ⢠Experience improving reliability through SLOs, automation, incident management, and performance optimization. ⢠Demonstrated ability to troubleshoot complex issues across a modern distributed system stack. ⢠Strong collaboration and communication skills with the ability to influence technical direction across teams.
⢠Global Benefit programs that fit your lifestyle, from workspace to professional development to caregiving support ⢠Family Planning Support ⢠Gender-Affirming Care ⢠Mental Health & Coaching Benefits ⢠Group Personal Pension Scheme with Employer match ⢠Private Medical and Dental Scheme ⢠Income Replacement Programs ⢠Bike to Work scheme ⢠Flexible Vacation & Paid Volunteer Time Off ⢠Generous Paid Parental Leave
Apply Nowđ June 11
DevOps Reliability Engineer ensuring performance, scalability, and reliability of Azure-based SaaS platform at ASI. Collaborating with engineering teams to improve system efficiency and resilience.
đŹđ§ United Kingdom â Remote
đ° Venture Round on 2022-01
â° Full Time
đ Senior
đ´ Lead
â DevOps & Site Reliability Engineer (SRE)
đ June 2
DevOps engineer optimizing deployment and CI/CD processes at Ohalo, a data protection startup. Collaborating with Engineering teams to enable better solutions and protect client data rights.
đ May 26
Principal DevOps Engineer serving as technical lead and architect for infrastructure, automation, and deployments in cloud communications provider. Focused on reliability, standards, and cross-platform initiatives.
đŹđ§ United Kingdom â Remote
đ° Venture Round on 2017-02
â° Full Time
đ´ Lead
â DevOps & Site Reliability Engineer (SRE)
đ May 12
Principal Platform Infrastructure Engineer designing and operating Menlo Security's infrastructure platform across multiple environments. Collaborating with global teams and leveraging cloud-native technologies like Google Kubernetes Engine and Terraform.
đŹđ§ United Kingdom â Remote
đ° $100M Series E on 2020-11
â° Full Time
đ´ Lead
â DevOps & Site Reliability Engineer (SRE)
đ April 28
Staff/Senior DevOps Engineer at Runware enhancing infrastructure for real-time AI inference. Focus on automation, observability, and scaling to meet growing demands.