
Cloud infrastructure platform enabling companies to excel with cloud native tech stacks, focused on tech stack evaluation, planning, and evolution.
2 - 10 employees
October 1

Cloud infrastructure platform enabling companies to excel with cloud native tech stacks, focused on tech stack evaluation, planning, and evolution.
2 - 10 employees
• Establish the foundations of AWS-based cloud operations and infrastructure-as-code strategy. • Design, implement, and administer secure, scalable, and cost-effective AWS infrastructure. • Develop infrastructure-as-code using tools like Terraform or Helm to manage and evolve cloud environments. • Define and deploy observability pipelines and dashboards across metrics, logs, and traces (CloudWatch, Prometheus, Grafana, etc) with Splunk being our preference and the tool of choice. • Write internal documentation and structured reports on architectural decisions and infrastructure health. • Collaborate with the product and engineering teams to align infrastructure capabilities with evolving product needs. • Operate independently and propose scalable, secure, and production-ready solutions with minimal guidance.
• 5+ years of experience in SRE, DevOps, or Cloud Infrastructure roles with a strong AWS and Kubernetes focus. • Advanced expertise in cloud architecture design and administration of core AWS services (VPC, IAM, ECS/EKS, RDS, CloudWatch, etc.). • Strong understanding of monitoring, logging and observability frameworks (preferably Splunk) and ability to set up custom dashboards. • Ability to analyze current traffic patterns and make technical recommendations for infrastructure choice appropriate for the business context. • Proficient in infrastructure-as-code frameworks such as Terraform, Pulumi, or AWS CDK. • Proven track record of owning production infrastructure and driving operational excellence at high-growth startups or SaaS companies. • Experience setting up and managing CI/CD pipelines and security best practices in cloud environments. • Excellent communication skills with the ability to distill complex infrastructure topics into clear written reports and dashboards. • Self-starter mindset and thrives in fast-paced, early-stage environments with limited structure.
• top-tier compensation for startups • significant equity in a rapidly growing, VC-backed company • commitment to fostering an inclusive and diverse workplace
Apply NowOctober 1
Senior Cluster Site Reliability Engineer at Voleon scaling research compute clusters using machine learning techniques in finance. Ensuring uptime, reliability, and performance of HPC platforms.
🇺🇸 United States – Remote
💵 $205k - $235k / year
⏰ Full Time
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
Ansible
AWS
Cloud
Google Cloud Platform
Grafana
Prometheus
Python
Ruby
Terraform
October 1
Senior DevOps Engineer at Domyn managing cloud and on-prem infrastructure for enterprise AI. Optimize deployments across GCP, Azure, AWS and ensure security, reliability, and high availability.
AWS
Azure
Cloud
Docker
Google Cloud Platform
Java
JavaScript
Kubernetes
Linux
Postgres
Python
Terraform
September 30
Talent-pool for DevOps-specialist roles at Mission Box Solutions. Connecting veteran-owned recruiting agency candidates with hiring companies across DevOps specializations.
September 30
DevOps Engineer building and operating application servers and IaC for Cutsforth's power-generation monitoring systems. Supports customers, deployments, cybersecurity, and LabVIEW-integrated solutions.
🇺🇸 United States – Remote
💵 $103k - $148k / year
⏰ Full Time
🟡 Mid-level
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
Cloud
Cyber Security
Terraform
September 30
Senior DevOps Architect designing scalable, secure AWS/Kubernetes infrastructure and CI/CD for CrowdStrike's AI-native cybersecurity platform.
🇺🇸 United States – Remote
⏰ Full Time
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
🦅 H1B Visa Sponsor
AWS
Cloud
Cyber Security
Distributed Systems
Google Cloud Platform
Kubernetes
Terraform