Site Reliability Engineer

🕒 February 26

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of EngFlow

EngFlow

11 - 50 employees

Founded 2020

☁️ SaaS

🤝 B2B

🏢 Enterprise

💰 $18M Series A - EngFlow on 2022-11

SaaS • B2B • Enterprise

EngFlow is a company that provides a cloud-native build acceleration platform for software development teams. It offers remote execution, remote caching, CI runners, build/test UIs, and tooling for Bazel migration and optimization to make large builds and tests faster and more efficient. EngFlow works as a scalable, deployable (on-prem or cloud) SaaS solution targeted at engineering teams in enterprises and marketplaces to reduce build times, improve developer productivity, and support complex build systems and CI integrations.

📋 Description

• Design, build, and maintain cloud infrastructure for our distributed build acceleration platform • Automate everything: from deployment pipelines to monitoring and recovery • Manage scalability and reliability for high-throughput, low-latency systems • Implement and maintain observability: logging, metrics, tracing, and alerting • Work closely with product and engineering teams to embed reliability into every feature • Diagnose and resolve production incidents quickly, and feed learnings back into systems design • Optimize cost, performance, and resilience across multi-cloud environments

🎯 Requirements

• 4+ years in SRE, DevOps, or Production Engineering roles • Experience managing Kubernetes in production • Strong background in cloud infrastructure (GCP or AWS) and IaC (Terraform preferred) • Solid knowledge of networking, security, and distributed systems • Track record of improving system availability and developer productivity • A knack for debugging complex, cross-system issues under pressure

🏖️ Benefits

• comprehensive medical, dental, vision benefits • 401k/pension • parental leave • generous vacation

Apply Now

Similar Jobs

🕒 February 26

Knox Systems, Inc.

201 - 500

🏛️ Government

🔒 Cybersecurity

📋 Compliance

Devops Security Engineer at Knox securing cloud-native environments for U.S. government missions. Focus on preventative security, automation, and continuous compliance within FedRAMP frameworks.

🇺🇸 United States – Remote

💵 $110k - $140k / year

🔥 Funding within the last year

💰 $6.5M Seed on 2025-08

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

AWS

Azure

Cloud

Google Cloud Platform

Kubernetes

Terraform

🕒 February 26

JFrog

1001 - 5000

🏢 Enterprise

☁️ SaaS

🔐 Security

Senior Professional Services DevOps Engineer designing CI/CD pipelines at JFrog. Collaborating with clients and teams to enhance DevOps experience.

Ansible

AWS

Azure

Chef

Cloud

Docker

Google Cloud Platform

Java

Jenkins

Kubernetes

Linux

Maven

Open Source

Puppet

🕒 February 26

Risk Labs Foundation

11 - 50

₿ Crypto

🌐 Web 3

Senior DevOps engineer driving evolution of Risk Labs operations and development processes. Work closely with platform engineers on internal tooling and vital protocol operations.

Cloud

Google Cloud Platform

Python

Terraform

Web3

🕒 February 25

Nick AI

1 - 10

🤖 Artificial Intelligence

₿ Crypto

☁️ SaaS

Backend/DevOps Engineer managing deployments and infrastructure for AI trading platform. Responsible for security, reliability, and scaling of systems across multiple venues.

AWS

Cloud

Docker

Google Cloud Platform

Grafana

Kubernetes

Prometheus

Python

Web3

🕒 February 25

WorkOS

51 - 200

🔌 API

🏢 Enterprise

🤝 B2B

Site Reliability Engineer ensuring reliability and performance at WorkOS across complex systems. Leading incident response and collaborating with cross-functional teams for operational excellence.

AWS

Cloud

Grafana

Kubernetes

Prometheus

TypeScript