Senior Site Reliability Engineer

Job not on LinkedIn

🕒 April 29

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Satsuma Technology Ltd

Satsuma Technology Ltd

1 - 10 employees

🔌 API

🤖 Artificial Intelligence

🛍️ eCommerce

API • Artificial Intelligence • eCommerce

Satsuma Technology Ltd. is an enterprise solution provider that harnesses AI technology to enhance chatbot interactions and drive orders for businesses. Through its advanced API integrations, Satsuma synchronizes over 1 billion products and has successfully facilitated more than 300,000 orders. The company's platform serves a multitude of partners, including Shipt, Vivino, and GoPuff.

📋 Description

• Own infrastructure across AWS, GCP, and Azure environments • Build and maintain CI/CD pipelines, observability stacks, and incident response workflows • Define and enforce SLOs/SLIs; lead postmortems • Author and maintain IaC (Terraform preferred) • Write internal tooling and automation using AI-assisted development workflows • Partner closely with engineering on reliability reviews and architecture decisions

🎯 Requirements

• 5-8 years in SRE, DevOps, or infrastructure engineering • Hands-on experience across at least two major cloud providers • Strong Kubernetes, Terraform, and observability tooling (Datadog, Grafana, or equivalent) • Comfortable reading and editing code; able to ship scripts and internal tools • Experience with AI-assisted development (Copilot, Cursor, Claude Code) • On-call maturity -- you've owned incidents end-to-end and made systems better afterward • Prior experience at a startup or high-growth SaaS company • Familiarity with API gateway infrastructure or commerce tech stacks • Hands-on experience with MCP or agentic AI infrastructure

🏖️ Benefits

• Unlimited PTO • 401(K) • Healthcare Stipend • Gym stipend

Apply Now

Similar Jobs

🕒 April 28

Parallel Domain

51 - 200

🤖 Artificial Intelligence

🔌 API

Senior Site Reliability Engineer managing AWS infrastructure and Kubernetes for autonomous systems testing. Collaborating across teams to ensure system reliability and security.

AWS

Cloud

DNS

Grafana

Kubernetes

Linux

Node.js

Prometheus

Python

Terraform

🕒 April 28

Nomi Health

501 - 1000

⚕️ Healthcare Insurance

💸 Finance

☁️ SaaS

Senior Manager of Cloud and DevOps Engineering managing daily operations of AWS and Kubernetes infrastructure across businesses. Leading a team and working closely with senior leadership for operational excellence.

AWS

Cloud

Docker

EC2

Kubernetes

Terraform

🕒 April 28

Sagent

201 - 500

☁️ SaaS

💳 Fintech

Cloud Infrastructure Engineer managing cloud resources for large-scale infrastructure. Supporting development teams in a microservices environment to streamline deployments and optimize performance.

Airflow

Azure

BigQuery

Cloud

DNS

Google Cloud Platform

Grafana

Kafka

Kubernetes

Matillion

Microservices

Postgres

Prometheus

Python

Redis

Spark

SQL

Terraform

Vault

Go

🕒 April 27

Veeam Software

1001 - 5000

☁️ SaaS

🔒 Cybersecurity

🏢 Enterprise

Senior Site Reliability Engineer for Veeam's Government & Sovereign Cloud environments. Building a global SRE function with an emphasis on high availability and operational excellence.

AWS

Azure

Cloud

Dagger

Distributed Systems

Grafana

Java

JavaScript

Kubernetes

Prometheus

Terraform

TypeScript

Go

🕒 April 27

ImmunityBio, Inc.

501 - 1000

🧬 Biotechnology

⚕️ Healthcare Insurance

💊 Pharmaceuticals

DevOps Engineer bridging software development and operations at ImmunityBio, involved in CI/CD and infrastructure automation. Collaborating across teams to support reliable and scalable services.

Ansible

Grafana

Jenkins

Kubernetes

Linux

Prometheus

Python

Terraform