Senior Site Reliability Engineer

51 - 200 employees

Founded 2016

☁️ SaaS

🏢 Enterprise

🤖 Artificial Intelligence

SaaS • Enterprise • Artificial Intelligence

Honeycomb. io is an observability platform designed to provide comprehensive insights into application performance. It unifies logs, metrics, and traces into a single data type, allowing engineers to quickly diagnose and resolve issues. Honeycomb. io offers features like distributed tracing, anomaly detection, and service maps to help teams enhance system visibility and operational efficiency. It integrates with popular cloud services like Amazon Web Services and Kubernetes, and supports technologies such as OpenTelemetry. Honeycomb. io aims to enable engineering teams to deploy confidently, reduce incident response times, and improve overall productivity.

Senior Site Reliability Engineer

🕒 June 5

🇬🇧 United Kingdom – Remote

💵 £127.7k - £150.2k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

AWS

Kafka

Kubernetes

Terraform

Apply Now

Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Honeycomb.io

51 - 200 employees

Founded 2016

☁️ SaaS

🏢 Enterprise

🤖 Artificial Intelligence

SaaS • Enterprise • Artificial Intelligence

📋 Description

• Help Honeycomb scale our backend systems to support our highest-volume customers. • Build organizational trust through transparent communication, giving and receiving direct and kind feedback. • Work with other backend teams to dive deep into our stack to make sure we’re getting the most out of our infrastructure. • Be trained, become, and then train others as an Incident Commander. • Help SRE and Honeycomb develop a healthy cross-Atlantic engineering culture. • Participate in the team’s on-call rotation as the EU side of a new follow-the-sun rotation. • Help the organization navigate tradeoffs between reliability and its other goals and priorities. • Optional: act as an external ambassador through blog posts, conference talks, and presentations with support from our DevRel team.

🎯 Requirements

• Strong experience in AWS and Kubernetes • Experience performing cost analysis and reduction • Solid Helm, Terraform, and CI/CD experience • Project management skills • Software engineering experience (Golang is a plus, and so is performance engineering) • Experience with Kafka or another high-volume distributed system • Excellent written and spoken communication skills, with the ability to tailor your communication for your audience and give direct feedback when you notice something wrong • A curiosity to learn how people and systems work, and the willingness to make them partners in your initiatives • Familiarity with observability concepts (SLOs, instrumentation) and data-driven decision making • Comfort operating in ambiguity, with a bias for action and experimentation • Interest in both the technical and human sides of reliability engineering • Experience working in geographically distributed teams

🏖️ Benefits

• A stake in our success - generous equity with employee-friendly stock program • It’s not about how strong of a negotiator you are - our pay is based on transparent levels relative to experience • Time to recharge with unlimited PTO • A distributed-first mindset and culture (really!) • Home office, co-working, and internet stipend • Full benefits coverage for employees, with additional coverage available for dependents • Up to 16 weeks of paid parental leave, regardless of path to parenthood • Annual development allowance

Apply Now

Similar Jobs

Site Reliability Engineer

🕒 June 5

MLabs

51 - 200

💼 Consulting

🤖 Artificial Intelligence

💳 Fintech

Site Reliability Engineer at a high-performance financial technology firm specializing in integration platforms for global financial institutions. Join the SRE team to champion automation culture and operational excellence.

🇬🇧 United Kingdom – Remote

💵 £90k - £110k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🇬🇧 UK Skilled Worker Visa Sponsor

Ansible

DevOps Engineer

🕒 June 4

Sporty Group

501 - 1000

📱 Media

🎮 Gaming

🎲 Gambling

DevOps Engineer improving cloud infrastructure for Sporty Group. Collaborating with DevOps and DBA teams to enhance global deployments and processes.

🇬🇧 United Kingdom – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

AWS

Cloud

Grafana

Kubernetes

Linux

Redis

Site Reliability Engineer

🕒 May 29

Orion Health

501 - 1000

🏥 Healthcare

🤖 Artificial Intelligence

Site Reliability Engineer ensuring reliability and scalability of Orion Health's cloud infrastructure. Collaborating with teams to automate processes and enhance platform stability.

🇬🇧 United Kingdom – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🇬🇧 UK Skilled Worker Visa Sponsor

AWS

Azure

Cloud

Docker

Google Cloud Platform

Kubernetes

Python

Terraform

Senior DevOps Engineer, AWS Platform

🕒 May 28

Enigmatic Smile

11 - 50

💼 Consulting

📣 Marketing

☁️ SaaS

Senior DevOps Engineer focusing on AWS infrastructure for a security-first fintech scale-up. Collaborating with teams to build robust systems aligned with AWS best practices.

🇬🇧 United Kingdom – Remote

💵 £68k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

AWS

Cloud

Distributed Systems

Docker

EC2

Linux

Terraform

Site Reliability Lead

🕒 May 26

Arbor Education

51 - 200

📚 Education

🤝 B2B