Senior SRE

Cloud infrastructure platform enabling companies to excel with cloud native tech stacks, focused on tech stack evaluation, planning, and evolution.

2 - 10 employees

Senior SRE

October 1

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

AWS

Cloud

Grafana

Kubernetes

Prometheus

Splunk

Terraform

Apply Now

Catio

Cloud infrastructure platform enabling companies to excel with cloud native tech stacks, focused on tech stack evaluation, planning, and evolution.

2 - 10 employees

📋 Description

• Establish the foundations of AWS-based cloud operations and infrastructure-as-code strategy. • Design, implement, and administer secure, scalable, and cost-effective AWS infrastructure. • Develop infrastructure-as-code using tools like Terraform or Helm to manage and evolve cloud environments. • Define and deploy observability pipelines and dashboards across metrics, logs, and traces (CloudWatch, Prometheus, Grafana, etc) with Splunk being our preference and the tool of choice. • Write internal documentation and structured reports on architectural decisions and infrastructure health. • Collaborate with the product and engineering teams to align infrastructure capabilities with evolving product needs. • Operate independently and propose scalable, secure, and production-ready solutions with minimal guidance.

🎯 Requirements

• 5+ years of experience in SRE, DevOps, or Cloud Infrastructure roles with a strong AWS and Kubernetes focus. • Advanced expertise in cloud architecture design and administration of core AWS services (VPC, IAM, ECS/EKS, RDS, CloudWatch, etc.). • Strong understanding of monitoring, logging and observability frameworks (preferably Splunk) and ability to set up custom dashboards. • Ability to analyze current traffic patterns and make technical recommendations for infrastructure choice appropriate for the business context. • Proficient in infrastructure-as-code frameworks such as Terraform, Pulumi, or AWS CDK. • Proven track record of owning production infrastructure and driving operational excellence at high-growth startups or SaaS companies. • Experience setting up and managing CI/CD pipelines and security best practices in cloud environments. • Excellent communication skills with the ability to distill complex infrastructure topics into clear written reports and dashboards. • Self-starter mindset and thrives in fast-paced, early-stage environments with limited structure.

🏖️ Benefits

• top-tier compensation for startups • significant equity in a rapidly growing, VC-backed company • commitment to fostering an inclusive and diverse workplace

Apply Now

Similar Jobs

Senior Site Reliability Engineer

October 1

The Voleon Group

51 - 200

💸 Finance

🤖 Artificial Intelligence

Senior Cluster Site Reliability Engineer at Voleon scaling research compute clusters using machine learning techniques in finance. Ensuring uptime, reliability, and performance of HPC platforms.

🇺🇸 United States – Remote

💵 $205k - $235k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Ansible

AWS

Cloud

Google Cloud Platform

Grafana

Prometheus

Python

Ruby

Terraform

Senior DevOps Engineer

October 1

Domyn

51 - 200

🤖 Artificial Intelligence

💳 Fintech

⚕️ Healthcare Insurance

Senior DevOps Engineer at Domyn managing cloud and on-prem infrastructure for enterprise AI. Optimize deployments across GCP, Azure, AWS and ensure security, reliability, and high availability.

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

AWS

Azure

Cloud

Docker

Google Cloud Platform

Java

JavaScript

Kubernetes

Linux

Postgres

Python

Terraform

DevOps Engineer

September 30

Mission Box Solutions

11 - 50

👥 HR Tech

🎯 Recruiter

⚕️ Healthcare Insurance

Talent-pool for DevOps-specialist roles at Mission Box Solutions. Connecting veteran-owned recruiting agency candidates with hiring companies across DevOps specializations.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

DevOps Engineer

September 30

Cutsforth Inc.

11 - 50

⚡ Energy

🔧 Hardware

🏢 Enterprise

DevOps Engineer building and operating application servers and IaC for Cutsforth's power-generation monitoring systems. Supports customers, deployments, cybersecurity, and LabVIEW-integrated solutions.

🇺🇸 United States – Remote

💵 $103k - $148k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Cloud

Cyber Security

Terraform