Site Reliability Engineer

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Pythian

Pythian

201 - 500 employees

Founded 1997

🤖 Artificial Intelligence

🤝 B2B

🏢 Enterprise

Artificial Intelligence • B2B • Enterprise

<Pythian> Pythian is a global consulting firm specializing in data, analytics, cloud, and AI solutions. With decades of experience and a large team of domain experts, they provide strategy, custom AI development, advanced analytics, database and cloud consulting, migrations, and managed services (including AIOps and DBA services) to enterprise customers across industries. Pythian focuses on helping organizations modernize their data platforms, deploy generative AI use cases, and optimize cloud and database operations through partnerships with major cloud and data technology providers.

📋 Description

• Operate and optimize Kubernetes clusters, Istio service mesh, and Linux-based systems • Automate workflows using Go, Python, and Shell scripting • Build monitoring and observability solutions with Prometheus, Grafana, and Loki • Troubleshoot complex networking, storage, and system performance issues • Participate in on-call rotations and postmortem reviews to improve system resilience • Partner with AI/ML teams to ensure infrastructure readiness for model training and data pipelines

🎯 Requirements

• Experience with Google Cloud, plus IaC tools (Terraform) • Strong knowledge of microservices, containers (Kubernetes, Docker), and networking • SRE mindset with a focus on automation, scalability, and reliability • Hands-on experience with PKI, service mesh, and Linux systems administration

🏖️ Benefits

• Competitive total rewards package • Blog during work hours; take a day off and volunteer for your favorite charity • Flexibly work remotely from your home, there’s no daily travel requirement to an office! • All you need is a stable internet connection • Collaborate with some of the best and brightest in the industry! • Hone your skills or learn new ones with our substantial training allowance; participate in professional development days, attend training, become certified, whatever you like! • We give you all the equipment you need to work from home including a laptop with your choice of OS, and an annual budget to personalize your work environment! • Pythian cares about the health and well-being of our team. You will have an annual wellness budget to make yourself a priority (use it on gym memberships, massages, fitness and more) • Generous amount of paid vacation and sick days, as well as a day off to volunteer for your favorite charity

Apply Now

Similar Jobs

🕒 Yesterday

GE Vernova

10,000+ employees

⚡ Energy

🚀 Aerospace

🤖 Artificial Intelligence

Senior Embedded Reliability Engineer managing reliability initiatives for energy management and utility-scale software. Leading engineering methodologies to ensure system resilience and performance for utility customers.

🕒 Yesterday

Switzerland Global Enterprise

51 - 200

🤝 B2B

🛍️ eCommerce

Senior Embedded Reliability Engineer focusing on reliability of grid automation products at GE Vernova. Collaborating across disciplines to enhance system resilience and support utility customers.

🕒 Yesterday

Branch

501 - 1000

🔌 API

🤝 B2B

☁️ SaaS

AI DevOps & Reliability Engineer at Branch, focusing on software delivery and operational standards, enhancing DevOps with AI tools for reliability and efficiency.

🇨🇦 Canada – Remote

💵 $123k - $160k / year

💰 $282M Series F on 2022-02

⏰ Full Time

🟠 Senior

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

🕒 3 days ago

Rival Technologies

51 - 200

🤖 Artificial Intelligence

☁️ SaaS

DevOps Engineer leading a team to design and implement scalable infrastructure for a tech company. Collaborating with developers and QA to ensure efficient product releases.

🇨🇦 Canada – Remote

💵 $110k - $125k / year

💰 Venture Round on 2019-07

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🕒 June 23

Yelp

1001 - 5000

Site Reliability Engineer managing scalable and self-healing distributed systems at Yelp. Collaborative role ensuring system reliability and performance while using automation and modern tools.