Site Reliability Engineer

Job not on LinkedIn

September 4

Apply Now
Logo of EngFlow

EngFlow

SaaS • Productivity • Enterprise

EngFlow is a company specializing in optimizing software development processes through remote execution and caching technologies. They provide tools such as the Bazel Invocation Analyzer and CI Runners to accelerate builds and tests, support various continuous integration platforms, and enhance developer productivity. EngFlow partners with companies to increase development efficiency and has helped reduce build times significantly for many large-scale applications. They also offer flexible deployment options that integrate with existing infrastructure, including cloud services like AWS, Azure, and GCP.

📋 Description

• Design, build, and maintain cloud infrastructure for our distributed build acceleration platform • Automate everything: from deployment pipelines to monitoring and recovery • Manage scalability and reliability for high-throughput, low-latency systems • Implement and maintain observability: logging, metrics, tracing, and alerting • Work closely with product and engineering teams to embed reliability into every feature • Diagnose and resolve production incidents quickly, and feed learnings back into systems design • Optimize cost, performance, and resilience across multi-cloud environments

🎯 Requirements

• 4+ years in SRE, DevOps, or Production Engineering roles • Experience managing Kubernetes in production • Strong background in cloud infrastructure (GCP or AWS) and IaC (Terraform preferred) • Solid knowledge of networking, security, and distributed systems • Track record of improving system availability and developer productivity • A knack for debugging complex, cross-system issues under pressure

🏖️ Benefits

• Comprehensive medical, dental, vision benefits • 401k/pension • Parental leave • Generous vacation • Fully remote team with several in-person global meetups per year • Fun team events (chocolate, whisky, and tea tastings; monthly team games; escape rooms and other events)

Apply Now

Similar Jobs

September 3

Lead DevOps Engineer architecting M&S DevSecOps platform and secure pipelines for Global InfoTek. Supporting cloud and on-prem integration for defense customers.

Cloud

Cyber Security

Python

TypeScript

September 3

Public Sector SRE at Unstructured building secure, compliant cloud infrastructure for federal AI workloads focused on reliability and observability

Ansible

AWS

Azure

Cloud

Docker

Grafana

Kubernetes

Linux

Prometheus

Python

Terraform

TypeScript

Go

August 29

Senior SRE specializing in data infrastructure; ensures reliable, scalable data platform and collaborates with cross-functional teams to govern data, security, and streaming pipelines.

Airflow

Apache

AWS

Cloud

Docker

Kafka

Kubernetes

Python

Shell Scripting

Spark

Terraform

August 29

DevOps Engineer for Inco blockchain infra—manages testnet/mainnet nodes and Kubernetes. Builds monitoring, GitOps, and secure CI/CD pipelines.

Ansible

AWS

Azure

Cloud

Distributed Systems

Firewalls

Google Cloud Platform

Grafana

Kubernetes

Linux

Microservices

Open Source

Postgres

Prometheus

Python

Redis

SDLC

Terraform

Vault

Go

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com