
1001 - 5000 employees
Founded 2012
đČ Gambling
đź Gaming
đ„ B2C
Gambling âą Gaming âą B2C
DraftKings Inc. is a digital sports entertainment company operating a leading online sportsbook, daily fantasy sports, and casino platform that delivers real-money betting and gaming experiences via web and mobile apps. It combines sports data, analytics, and content to engage fans, provides marketing and VIP/loyalty programs, and maintains global teams across engineering, product, compliance, and customer experience while emphasizing responsible gaming.
đ„ 2 minutes ago
đșđž United States â Remote
đ” $200k - $250k / year
â° Full Time
đŽ Lead
â DevOps & Site Reliability Engineer (SRE)
Improve your chances of getting an interview by checking your resume score before you apply.

1001 - 5000 employees
Founded 2012
đČ Gambling
đź Gaming
đ„ B2C
Gambling âą Gaming âą B2C
DraftKings Inc. is a digital sports entertainment company operating a leading online sportsbook, daily fantasy sports, and casino platform that delivers real-money betting and gaming experiences via web and mobile apps. It combines sports data, analytics, and content to engage fans, provides marketing and VIP/loyalty programs, and maintains global teams across engineering, product, compliance, and customer experience while emphasizing responsible gaming.
âą Define and execute the long-term strategy for our Kubernetes platform across Google Kubernetes Engine, Amazon Elastic Kubernetes Service, RKE2, and on-premise environments, ensuring reliability, scalability, and operational consistency. âą Drive architectural decisions across critical infrastructure, including cluster lifecycle management, networking, identity and access management, observability, autoscaling, capacity planning, and cost optimization. âą Lead large-scale platform initiatives across multiple engineering teams, establishing technical direction, engineering standards, and measurable outcomes that improve platform reliability and developer experience. âą Establish and evolve reliability practices by defining service level objectives, service level indicators, and error budget frameworks that align platform performance with business priorities. âą Build automation-first infrastructure through Infrastructure as Code, GitOps workflows, self-healing systems, and internal platform tooling that improve engineering velocity and reduce operational overhead. âą Champion the responsible adoption of AI-powered engineering capabilities that improve operational efficiency, accelerate incident response, and enhance developer productivity. âą Lead critical platform incidents, drive post-incident improvements, and strengthen platform resilience through automation, capacity planning, and operational excellence. âą Mentor senior engineers, influence technical strategy across the organization, and elevate engineering excellence through architecture reviews, coaching, and technical leadership.
âą A Bachelor's Degree in Computer Science or a related technical field. âą At least 8 years of experience designing, operating, and scaling distributed cloud and on-premise infrastructure, including at least 3 years operating at the Staff, Principal, or equivalent technical leadership level. âą Proven experience leading large-scale infrastructure or platform initiatives that require cross-functional alignment and long-term technical ownership. âą Deep expertise with Kubernetes, including cluster architecture, networking, storage, security, operators, lifecycle management, and large-scale production operations. âą Extensive experience building and operating production infrastructure in AWS and Google Cloud Platform using Infrastructure as Code technologies such as Terraform, Pulumi, or similar tools. âą Strong software development experience in Go, Python, or both, with expertise in GitOps, continuous integration and continuous delivery, observability, distributed systems, Linux, and reliability engineering principles. âą Experience incorporating AI-powered tools into engineering workflows while applying sound judgment around reliability, security, and operational risk. âą Exceptional communication and leadership skills with a proven ability to mentor engineers, influence technical strategy, and drive engineering excellence. âą Experience working in regulated industries, hybrid cloud environments, contributing to open-source projects, or holding cloud certifications is preferred.
âą bonus âą equity âą benefits as applicable
Apply Nowđ„ 2 hours ago
Principal Site Reliability Engineer shaping the strategy for Kubernetes platform and driving architectural decisions. Leading platform initiatives at DraftKings with a focus on reliability and automation.
đșđž United States â Remote
đ” $200k - $250k / year
â° Full Time
đŽ Lead
â DevOps & Site Reliability Engineer (SRE)
AWS
Cloud
Distributed Systems
Google Cloud Platform
Kubernetes
Linux
Python
Terraform
Go
đ„ 6 hours ago
Director of DevOps leading a team of engineers at Convoso, an AI-powered contact center platform. Responsible for developing and optimizing the platform and ensuring service reliability.
đșđž United States â Remote
đ” $220k - $260k / year
â° Full Time
đŽ Lead
â DevOps & Site Reliability Engineer (SRE)
Ansible
AWS
Chef
Cloud
Docker
Google Cloud Platform
Jenkins
Kubernetes
Linux
Puppet
SaltStack
SDLC
đ„ 10 hours ago
Principal Operations Engineer overseeing critical operations in data centers for Fluidstack. Leading on-call escalation, root cause analysis, and operational excellence in real-time situations.
đșđž United States â Remote
đ” $150k - $250k / year
â° Full Time
đŽ Lead
â DevOps & Site Reliability Engineer (SRE)
đ 2 days ago
10,000+ employees
đ Cybersecurity
đ€ Artificial Intelligence
Site Reliability Engineer blending software engineering, automation, and operations expertise. Building scalable platforms and enabling high-velocity delivery for critical Defense systems.
đșđž United States â Remote
đ” $164.4k - $215.1k / year
â° Full Time
đ Senior
đŽ Lead
â DevOps & Site Reliability Engineer (SRE)
đŠ H1B Visa Sponsor
Cloud
Distributed Systems
Grafana
Kubernetes
Linux
Prometheus
Python
Splunk
đ 4 days ago
Staff Software Engineer responsible for enhancing reliability and security in production environments. Collaborating on projects to scale systems at Coinbase.
đșđž United States â Remote
đ” $218k - $256.5k / year
đ° $21.4M Post-IPO Equity on 2022-11
â° Full Time
đŽ Lead
â DevOps & Site Reliability Engineer (SRE)
đŠ H1B Visa Sponsor
AWS
Azure
Cloud
Google Cloud Platform
Ruby
Terraform
Go