
51 - 200 employees
☁️ SaaS
🏢 Enterprise
🤝 B2B
SaaS • Enterprise • B2B
Test Double is a software consultancy firm that focuses on improving the way the world builds software. They provide a range of services including software delivery, product management, legacy system modernization, DevOps, and technical recruitment. Test Double embeds with client teams to solve tough software problems, emphasizing strategic advice and hands-on involvement. They aim to accelerate software investment returns by balancing speed and agility with thorough testing and maintainable code. Additionally, Test Double engages in open source contributions and is committed to community building and diversity.
🕒 May 28
🏄 California – Remote
💵 $170k - $190k / year
⏰ Full Time
🔴 Lead
⛑ DevOps & Site Reliability Engineer (SRE)
Improve your chances of getting an interview by checking your resume score before you apply.

51 - 200 employees
☁️ SaaS
🏢 Enterprise
🤝 B2B
SaaS • Enterprise • B2B
Test Double is a software consultancy firm that focuses on improving the way the world builds software. They provide a range of services including software delivery, product management, legacy system modernization, DevOps, and technical recruitment. Test Double embeds with client teams to solve tough software problems, emphasizing strategic advice and hands-on involvement. They aim to accelerate software investment returns by balancing speed and agility with thorough testing and maintainable code. Additionally, Test Double engages in open source contributions and is committed to community building and diversity.
• We help client teams use DevOps practices to create more observable, sustainable, and predictable environments by integrating operations capabilities into development teams. • Delivering primary DevOps solutions to clients across: Cloud Architecture and Deployment in at least one major cloud provider • Hands-on experience running production services on Kubernetes and managed container platforms, including rollout strategies, autoscaling, and observability • Able to weigh tradeoffs between container orchestration (k8s vs. ECS) and serverless container platforms when advising clients • Comfortable with event-driven serverless (Lambda, Cloud Functions) and knowing when it's the right tool versus a long-running container • Infrastructure as Code • Configuration Management • CI/CD Pipelines • Monitoring and Observability • Creating high-quality infrastructure to meet the needs of its users and businesses • Applying security best practices in deployment pipelines and cloud environments • Helping clients achieve Service Level Agreements and Service Level Objectives by providing observable infrastructure • Implementing high-availability and disaster recovery architecture • Identifying technology, communication, and process issues and proposing improvements • Sharing best practices for cloud architecture that are fault-tolerant, highly available, and cost-effective for the client’s business • Mentoring by sharing experience and knowledge with client developers and operations teams so they are well-positioned to succeed, even long after we're gone • Collaborating internally with other Test Double agents on infrastructure best practices • Learn new frameworks, languages, tech, and techniques to adapt to changing client needs • Communicate openly and honestly with everyone, even if the news will not be positively received
• 8+ years of experience in software development • 3+ years of experience in DevOps, cloud computing, or operations • 3+ years of experience in consulting • Strong understanding of Configuration Management tools like Ansible, Chef, or Puppet • Strong understanding of Infrastructure as Code tools like Terraform • CI/CD Pipelines like Jenkins, CircleCI, GitHub Actions, GitLab CI/CD • Demonstrated ability to direct AI in delivery—defining problems, applying quality checks, and producing consistent results, with examples of improving team workflows • Containerized deployment strategies like Kubernetes, AWS Elastic Container Service, Docker • Observability and monitoring tools like CloudWatch, Grafana, and DataDog • Low ego, high emotional intelligence (EQ), and a mindset of continuous improvement • Experience leading teams in decomposing work and maintaining a healthy backlog that is valuable to the business • Experience balancing competing priorities and influencing teams towards high-quality software development practices • Ability to communicate effectively across different levels or positions within an organization • Proficiency in designing, architecting, and refactoring systems of moderate complexity worked on by teams of 10+ • Ability to resolve conflicts and issues within the delivery team • Experience in mentoring and leading the technical direction of software engineers • Expertise in designing and delivering systems to production in the use of one or more of the following: Ruby, Go, Python, JavaScript/Typescript.
• Remote First - Work from anywhere, travel required for critical client and company functions • Time off: 5 weeks flexible time off (vacation and sick time) + 10 Paid Holidays, 2 week sabbatical after 5 years • Company Ownership: ESOP Employee stock ownership program - Test Double is 100% employee owned • Family Support: 8 weeks paid parental leave at 100% of salary, plus additional unpaid • Retirement: Company Contribution of 3% of salary to (401k) • Continuing Education: 1 week of conference attendance (and up to $3,000 of expenses) • Health: Premium health/dental/vision insurance (80-100% covered) • New computer hardware purchase every 3 years • Co-working space reimbursement (1/2 rent up to $500 monthly) • Company-wide in-person retreat every ~2years • Short and Long Term Disability • Life Insurance
Apply Now🕒 May 27
1 - 10
VP of Site Reliability managing SRE and operational functions for banking AI software company. Leading engineering practices and ensuring reliable platform deployment for financial institutions.
🕒 May 26
Director of Application & DevSecOps Security leading secure software development practices at Gainwell. Overseeing application and API security while guiding engineering teams in best security practices.
🇺🇸 United States – Remote
💵 $150.2k - $214.5k / year
💰 Grant on 2023-06
⏰ Full Time
🔴 Lead
⛑ DevOps & Site Reliability Engineer (SRE)
🦅 H1B Visa Sponsor
🕒 May 23
Principal Service Reliability Engineer at Prescryptive ensuring platform reliability across healthcare technology systems. Focusing on technical leadership and operational excellence for cloud-based infrastructures.
🇺🇸 United States – Remote
💵 $150k - $205k / year
⏰ Full Time
🔴 Lead
⛑ DevOps & Site Reliability Engineer (SRE)
🕒 May 22
Staff Site Reliability Engineer at SimSpace defining technical vision and leading architecture for a cyber range platform. Seeking experienced professional to address complex infrastructure challenges.
🇺🇸 United States – Remote
💵 $165k - $230k / year
🔥 Funding within the last year
💰 $39M Venture Round - SimSpace on 2025-10
⏰ Full Time
🔴 Lead
⛑ DevOps & Site Reliability Engineer (SRE)
🦅 H1B Visa Sponsor
🕒 May 21
Staff SRE at Andromeda responsible for the reliability of AI infrastructure. Leading incident responses and collaborating with engineering on solutions.
🇺🇸 United States – Remote
🔥 Funding within the last year
💰 $15.1M Series A - Andromeda Robotics on 2025-09
⏰ Full Time
🔴 Lead
⛑ DevOps & Site Reliability Engineer (SRE)
🦅 H1B Visa Sponsor