Senior SRE

October 1

Apply Now
Logo of Catio

Catio

Cloud infrastructure platform enabling companies to excel with cloud native tech stacks, focused on tech stack evaluation, planning, and evolution.

2 - 10 employees

📋 Description

• Establish the foundations of AWS-based cloud operations and infrastructure-as-code strategy. • Design, implement, and administer secure, scalable, and cost-effective AWS infrastructure. • Develop infrastructure-as-code using tools like Terraform or Helm to manage and evolve cloud environments. • Define and deploy observability pipelines and dashboards across metrics, logs, and traces (CloudWatch, Prometheus, Grafana, etc) with Splunk being our preference and the tool of choice. • Write internal documentation and structured reports on architectural decisions and infrastructure health. • Collaborate with the product and engineering teams to align infrastructure capabilities with evolving product needs. • Operate independently and propose scalable, secure, and production-ready solutions with minimal guidance.

🎯 Requirements

• 5+ years of experience in SRE, DevOps, or Cloud Infrastructure roles with a strong AWS and Kubernetes focus. • Advanced expertise in cloud architecture design and administration of core AWS services (VPC, IAM, ECS/EKS, RDS, CloudWatch, etc.). • Strong understanding of monitoring, logging and observability frameworks (preferably Splunk) and ability to set up custom dashboards. • Ability to analyze current traffic patterns and make technical recommendations for infrastructure choice appropriate for the business context. • Proficient in infrastructure-as-code frameworks such as Terraform, Pulumi, or AWS CDK. • Proven track record of owning production infrastructure and driving operational excellence at high-growth startups or SaaS companies. • Experience setting up and managing CI/CD pipelines and security best practices in cloud environments. • Excellent communication skills with the ability to distill complex infrastructure topics into clear written reports and dashboards. • Self-starter mindset and thrives in fast-paced, early-stage environments with limited structure.

🏖️ Benefits

• top-tier compensation for startups • significant equity in a rapidly growing, VC-backed company • commitment to fostering an inclusive and diverse workplace

Apply Now

Similar Jobs

October 1

Senior Cluster Site Reliability Engineer at Voleon scaling research compute clusters using machine learning techniques in finance. Ensuring uptime, reliability, and performance of HPC platforms.

Ansible

AWS

Cloud

Google Cloud Platform

Grafana

Prometheus

Python

Ruby

Terraform

October 1

Senior DevOps Engineer at Domyn managing cloud and on-prem infrastructure for enterprise AI. Optimize deployments across GCP, Azure, AWS and ensure security, reliability, and high availability.

AWS

Azure

Cloud

Docker

Google Cloud Platform

Java

JavaScript

Kubernetes

Linux

Postgres

Python

Terraform

September 30

Talent-pool for DevOps-specialist roles at Mission Box Solutions. Connecting veteran-owned recruiting agency candidates with hiring companies across DevOps specializations.

September 30

DevOps Engineer building and operating application servers and IaC for Cutsforth's power-generation monitoring systems. Supports customers, deployments, cybersecurity, and LabVIEW-integrated solutions.

Cloud

Cyber Security

Terraform

September 30

Senior DevOps Architect designing scalable, secure AWS/Kubernetes infrastructure and CI/CD for CrowdStrike's AI-native cybersecurity platform.

AWS

Cloud

Cyber Security

Distributed Systems

Google Cloud Platform

Kubernetes

Terraform

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com