Staff Consultant – DevOps

51 - 200 employees

☁️ SaaS

🏢 Enterprise

🤝 B2B

SaaS • Enterprise • B2B

Test Double is a software consultancy firm that focuses on improving the way the world builds software. They provide a range of services including software delivery, product management, legacy system modernization, DevOps, and technical recruitment. Test Double embeds with client teams to solve tough software problems, emphasizing strategic advice and hands-on involvement. They aim to accelerate software investment returns by balancing speed and agility with thorough testing and maintainable code. Additionally, Test Double engages in open source contributions and is committed to community building and diversity.

Staff Consultant – DevOps

Job not on LinkedIn

🕒 May 28

🏄 California – Remote

💵 $170k - $190k / year

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

Ansible

AWS

Chef

Cloud

Docker

Grafana

JavaScript

Jenkins

Kubernetes

Puppet

Python

Ruby

Terraform

TypeScript

Apply Now

Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Test Double

51 - 200 employees

☁️ SaaS

🏢 Enterprise

🤝 B2B

SaaS • Enterprise • B2B

📋 Description

• We help client teams use DevOps practices to create more observable, sustainable, and predictable environments by integrating operations capabilities into development teams. • Delivering primary DevOps solutions to clients across: Cloud Architecture and Deployment in at least one major cloud provider • Hands-on experience running production services on Kubernetes and managed container platforms, including rollout strategies, autoscaling, and observability • Able to weigh tradeoffs between container orchestration (k8s vs. ECS) and serverless container platforms when advising clients • Comfortable with event-driven serverless (Lambda, Cloud Functions) and knowing when it's the right tool versus a long-running container • Infrastructure as Code • Configuration Management • CI/CD Pipelines • Monitoring and Observability • Creating high-quality infrastructure to meet the needs of its users and businesses • Applying security best practices in deployment pipelines and cloud environments • Helping clients achieve Service Level Agreements and Service Level Objectives by providing observable infrastructure • Implementing high-availability and disaster recovery architecture • Identifying technology, communication, and process issues and proposing improvements • Sharing best practices for cloud architecture that are fault-tolerant, highly available, and cost-effective for the client’s business • Mentoring by sharing experience and knowledge with client developers and operations teams so they are well-positioned to succeed, even long after we're gone • Collaborating internally with other Test Double agents on infrastructure best practices • Learn new frameworks, languages, tech, and techniques to adapt to changing client needs • Communicate openly and honestly with everyone, even if the news will not be positively received

🎯 Requirements

• 8+ years of experience in software development • 3+ years of experience in DevOps, cloud computing, or operations • 3+ years of experience in consulting • Strong understanding of Configuration Management tools like Ansible, Chef, or Puppet • Strong understanding of Infrastructure as Code tools like Terraform • CI/CD Pipelines like Jenkins, CircleCI, GitHub Actions, GitLab CI/CD • Demonstrated ability to direct AI in delivery—defining problems, applying quality checks, and producing consistent results, with examples of improving team workflows • Containerized deployment strategies like Kubernetes, AWS Elastic Container Service, Docker • Observability and monitoring tools like CloudWatch, Grafana, and DataDog • Low ego, high emotional intelligence (EQ), and a mindset of continuous improvement • Experience leading teams in decomposing work and maintaining a healthy backlog that is valuable to the business • Experience balancing competing priorities and influencing teams towards high-quality software development practices • Ability to communicate effectively across different levels or positions within an organization • Proficiency in designing, architecting, and refactoring systems of moderate complexity worked on by teams of 10+ • Ability to resolve conflicts and issues within the delivery team • Experience in mentoring and leading the technical direction of software engineers • Expertise in designing and delivering systems to production in the use of one or more of the following: Ruby, Go, Python, JavaScript/Typescript.

🏖️ Benefits

• Remote First - Work from anywhere, travel required for critical client and company functions • Time off: 5 weeks flexible time off (vacation and sick time) + 10 Paid Holidays, 2 week sabbatical after 5 years • Company Ownership: ESOP Employee stock ownership program - Test Double is 100% employee owned • Family Support: 8 weeks paid parental leave at 100% of salary, plus additional unpaid • Retirement: Company Contribution of 3% of salary to (401k) • Continuing Education: 1 week of conference attendance (and up to $3,000 of expenses) • Health: Premium health/dental/vision insurance (80-100% covered) • New computer hardware purchase every 3 years • Co-working space reimbursement (1/2 rent up to $500 monthly) • Company-wide in-person retreat every ~2years • Short and Long Term Disability • Life Insurance

Apply Now

Similar Jobs

VP of Site Reliability

🕒 May 27

Titan AI

1 - 10

VP of Site Reliability managing SRE and operational functions for banking AI software company. Leading engineering practices and ensuring reliable platform deployment for financial institutions.

🇺🇸 United States – Remote

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

AWS

Google Cloud Platform

MongoDB

ServiceNow

Director of Application and DevSecOps Security

🕒 May 26

Gainwell Technologies

10,000+ employees

⚕️ Healthcare Insurance

Director of Application & DevSecOps Security leading secure software development practices at Gainwell. Overseeing application and API security while guiding engineering teams in best security practices.

🇺🇸 United States – Remote

💵 $150.2k - $214.5k / year

💰 Grant on 2023-06

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

AWS

Azure

Cloud

Cyber Security

Google Cloud Platform

Kubernetes

Microservices

SDLC

Principal Service Reliability Engineer

🕒 May 23

Prescryptive Health, Inc.

201 - 500

⚕️ Healthcare Insurance

☁️ SaaS

🤝 B2B

Principal Service Reliability Engineer at Prescryptive ensuring platform reliability across healthcare technology systems. Focusing on technical leadership and operational excellence for cloud-based infrastructures.

🇺🇸 United States – Remote

💵 $150k - $205k / year

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

Azure

Cloud

Kubernetes

Staff Site Reliability Engineer

🕒 May 22

SimSpace

201 - 500

🔒 Cybersecurity

☁️ SaaS

🏛️ Government

Staff Site Reliability Engineer at SimSpace defining technical vision and leading architecture for a cyber range platform. Seeking experienced professional to address complex infrastructure challenges.

🇺🇸 United States – Remote

💵 $165k - $230k / year

🔥 Funding within the last year

💰 $39M Venture Round - SimSpace on 2025-10

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

Distributed Systems

Grafana

Kubernetes

Python

VMware

Staff SRE, AI Infrastructure

🕒 May 21

Andromeda

11 - 50

🤖 Artificial Intelligence

🤝 B2B

🔧 Hardware

Staff SRE at Andromeda responsible for the reliability of AI infrastructure. Leading incident responses and collaborating with engineering on solutions.

🇺🇸 United States – Remote

🔥 Funding within the last year

💰 $15.1M Series A - Andromeda Robotics on 2025-09

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

Linux

Python

PyTorch

Rust