Senior Site Reliability Engineer, HyperShift

October 21

Apply Now
Logo of Red Hat

Red Hat

Enterprise • Cloud

Red Hat is a leading provider of enterprise open source software solutions, helping companies worldwide to build and deploy applications across hybrid cloud infrastructures. With a strong focus on developing secure, stable, and innovative technologies, Red Hat offers a broad portfolio including products like Red Hat Enterprise Linux, Red Hat OpenShift, and Red Hat Ansible Automation Platform. These products support IT services on any infrastructure efficiently. Trusted by more than 90% of the U. S. Fortune 500, Red Hat empowers organizations to modernize their IT environments, leveraging open source communities to drive technological advancement.

10,000+ employees

Founded 1993

🏢 Enterprise

💰 Corporate Round on 1999-03

📋 Description

• Develop, scale, and operate OpenShift managed cloud services • Contribute code to increase scalability and reliability • Help develop peers’ capabilities through knowledge sharing and mentoring • Participate in a regular on-call schedule • Practice sustainable incident response and blameless postmortems • Resolve customer issues escalated from the support team • Work within a small agile team to improve SRE software

🎯 Requirements

• Bachelor’s degree in Computer Science or related technical field • Experience programming in Python, Golang, Java, C, C++, or another object-oriented language • Experience with public clouds such as AWS, GCP, or Azure • Ability to collaboratively troubleshoot and solve problems in a team setting • Experience troubleshooting SaaS/PaaS offerings • Experience with complex distributed systems • Basic understanding of Unix/Linux operating systems • 5+ years of experience managing Linux servers • 3+ years of experience with enterprise systems monitoring • 3+ years of experience with configuration management software like Ansible, Puppet, or Chef • Solid communications skills

🏖️ Benefits

• Health insurance • Retirement plans • Paid time off • Flexible work arrangements • Professional development • Equipment allowances

Apply Now

Similar Jobs

October 21

Focal

11 - 50

Site Reliability Engineer responsible for AWS infrastructure and CI/CD pipelines. Collaborating with product engineering to ensure system reliability and resilience strategies.

AWS

Java

Kubernetes

Python

Terraform

Go

October 13

DevOps/Platform Engineer specializing in CI/CD, security hygiene, and observability for deployment workflows in remote-first setting. Collaborate across teams to build safe, reliable environments.

Azure

Cloud

Terraform

October 11

DevOps Engineer managing and scaling AWS cloud infrastructure at Smart Working. Focusing on automation, observability, and collaborating with cross-functional teams to improve system reliability and performance.

AWS

Cloud

Docker

EC2

Jenkins

Kubernetes

Terraform

October 9

Site Reliability Engineer managing critical systems for data collaboration platform Atlan. Ensuring fast, reliable customer experiences while automating workflows and enhancing observability.

AWS

Azure

Cloud

Google Cloud Platform

Grafana

Kubernetes

Prometheus

Python

October 8

Senior Site Reliability Engineer analyzing system performance to shape product direction at Akamai. Collaborating on automation and network deployment for a global customer base.

Distributed Systems

DNS

Kubernetes

Linux

Python

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com