Senior Customer Reliability Engineer

September 22

🇺🇸 United States – Remote

💵 $149.5k - $192.5k / year

⏰ Full Time

🟠 Senior

Apply Now
Logo of Replicated

Replicated

SaaS • Enterprise • B2B

Replicated is a software platform designed to help companies easily distribute their software to enterprise customers in complex environments. It provides tools for developing, testing, releasing, licensing, installing, and supporting applications, with a focus on self-hosted and air-gapped deployments. Replicated enables its clients to manage software lifecycle and compatibility across diverse on-premises infrastructure, including Kubernetes environments. Trusted by major enterprises, Replicated assists in streamlining installations, managing software releases, and supporting customer deployments efficiently.

51 - 200 employees

Founded 2017

☁️ SaaS

🏢 Enterprise

🤝 B2B

💰 $50M Series C on 2021-07

📋 Description

• Provide expert support to customers, resolving issues related to Kubernetes, Linux, and Replicated products, including troubleshooting failures and identifying root causes • Work proactively with customers to ensure successful deployment, management, and scaling of applications using Replicated, providing guidance, best practices, training, and onboarding assistance • Collaborate closely with CREs and product engineers to share customer feedback, identify product improvements, and contribute to the product roadmap • Contribute to tooling and best practices that empower internal teams and vendors; opportunities to develop coding skills and make code contributions over time • Participate in on-call rotation to provide support coverage for Replicated products • Build deep expertise in customer-managed deployments, including cluster installation scenarios, and help vendors operationalize Kubernetes applications • Drive continuous learning and professional growth, leveraging company-provided training, certifications, and curiosity/professional development budgets • Participate in documentation review, process improvement, and vendor interaction to improve support workflows and product usability

🎯 Requirements

• Preferably 3 or more years of professional experience • Experience with Linux system administration and ability to troubleshoot complex system and network issues at an advanced level • Experience with Kubernetes and Helm, including diagnosing complex issues on bare metal and developing/troubleshooting advanced Helm charts • Exceptional technical and non-technical communication and interpersonal skills in English • Strong problem-solving skills and ability to think critically and act quickly under pressure • Customer-centric mindset and a genuine desire to help others succeed • Experience working remotely with teams across various time zones • Willingness to participate in on-call support coverage • Nice to haves: Experience with CNCF tools • Nice to haves: Familiarity with Go and ability to debug Go programs • Nice to haves: Customer-facing experience • Preferred remote location: Australia or New Zealand (applicants must have legal right to work there) • Note: Replicated cannot provide US sponsorship at this time (applicants must be legally authorized to work in the United States)

🏖️ Benefits

• Health/Dental/Vision • Life/AD&D • LTD/STD • FSA • 401K • Stock options • Partner perk programs • Generous time off, we expect you to take a minimum of 3 weeks of per year • Laptop+accessories you need to get set up • Generous home office set up allowance or co-working space allowance - up to $10,000 per year! • Curiosity Budget to help you keep learning and growing! • Professional development budget

Apply Now

Similar Jobs

September 17

CoreSite

201 - 500

Network Reliability Engineer advancing automation, SDN, and cloud interconnection at data center operator CoreSite. Focus on automation, observability, and mentoring engineering teams.

🇺🇸 United States – Remote

💰 $570M Private Equity Round on 2022-10

⏰ Full Time

🟡 Mid-level

🟠 Senior

AWS

Azure

Cloud

Google Cloud Platform

Switching

September 10

Tier III Customer Reliability Engineer ensuring Pager Health platform stability and resolving escalated technical incidents. Collaborate with engineering, product, and customer teams.

AWS

Azure

Cloud

Kubernetes

Python

August 30

Horizon3.ai

51 - 200

Design and operate resilient database systems across AWS; automate provisioning, backups, and monitoring while collaborating with security and product teams.

AWS

Cloud

Cyber Security

DynamoDB

EC2

Kafka

Kubernetes

NoSQL

Postgres

Redis

Terraform

Vault

August 27

Lead DevSecOps for Technical Product Management, managing platform configuration, support, and audits. Coach support teams, coordinate upgrades, and enforce DevOps security practices across enterprise applications.

Azure

August 19

Azure-focused Senior SRE shapes automated infrastructure and CI/CD for Syniti’s Azure-hosted SaaS. Collaborates with security and compliance teams.

Azure

Cloud

Python

Terraform

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com