
10,000+ employees
Founded 1978
🛒 Retail
👥 B2C
💰 Debt Financing on 2007-07
Retail • B2C
The Home Depot is a leading home improvement retailer, offering a wide range of building materials, home improvement products, lawn and garden products, and related services. The company operates both physical stores and an online platform, providing comprehensive solutions for DIY enthusiasts, professional contractors, and homeowners. The Home Depot is committed to diversity, equity, and inclusion, providing employment opportunities and benefits to a diverse workforce. Additionally, the company places a high emphasis on customer service and associate engagement to maintain its position as a trusted leader in the home improvement industry.
🕒 April 29
🇺🇸 United States – Remote
💵 $90k - $180k / year
⏰ Full Time
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
Improve your chances of getting an interview by checking your resume score before you apply.

10,000+ employees
Founded 1978
🛒 Retail
👥 B2C
💰 Debt Financing on 2007-07
Retail • B2C
The Home Depot is a leading home improvement retailer, offering a wide range of building materials, home improvement products, lawn and garden products, and related services. The company operates both physical stores and an online platform, providing comprehensive solutions for DIY enthusiasts, professional contractors, and homeowners. The Home Depot is committed to diversity, equity, and inclusion, providing employment opportunities and benefits to a diverse workforce. Additionally, the company places a high emphasis on customer service and associate engagement to maintain its position as a trusted leader in the home improvement industry.
• Develops, tests, deploys, and maintains software for internal platforms • Designs, develops, and maintains tools for reliability engineering teams • Extends internal reliability tools using Kubernetes, Terraform on Google Cloud Platform • Deploys and maintains production logging, tracing, and profiling systems • Identifies and automates repetitive operational tasks • Maintains and extends SLO and Critical User Journey platforms • Participates in on-call rotation and contributes to incident response
• 3-5 years of experience in Site Reliability Engineering, Platform Engineering, DevOps, or Infrastructure Engineering • Hands-on experience with Google Cloud Platform (GCP), including GKE, GCS, BigQuery, Cloud Pub/Sub, Cloud Logging, IAM, and Workload Identity. • Strong Kubernetes experience: deploying and managing workloads on GKE or similar managed Kubernetes services, writing and debugging Helm charts, managing namespaces, RBAC, service accounts, and troubleshooting issues • Experience with infrastructure-as-code tools, particularly Terraform for cloud resource management. • Proficiency in one or more of: Go, Python, JavaScript/TypeScript, YAML. • Experience with observability platforms: deploying, configuring, or operating log aggregation, distributed tracing, metrics, dashboarding, or continuous profiling • Practical understanding of SLOs, SLIs, and error budgets. • Experience with synthetic monitoring or performance testing frameworks (k6, Playwright, Selenium, Locust, or similar). • Familiarity with incident management and on-call practices: Blameless post-mortems, runbook development, and incident communication • Experience with CI/CD pipelines using GitHub Actions, Spinnaker, ArgoCD, or similar. • Understanding of deployment strategies (blue/green, canary, rolling).
• Health insurance • 401(k) matching • Flexible work hours • Paid time off • Remote work options
Apply Now🕒 April 29
Senior Site Reliability Engineer managing multi-cloud infrastructure at Satsuma. Ensuring reliability, scalability, and operational posture using AI-assisted development.
AWS
Azure
Cloud
Google Cloud Platform
Grafana
Kubernetes
Terraform
🕒 April 28
Senior Site Reliability Engineer managing AWS infrastructure and Kubernetes for autonomous systems testing. Collaborating across teams to ensure system reliability and security.
🇺🇸 United States – Remote
💵 $145k - $185k / year
💰 $30M Series B on 2022-11
⏰ Full Time
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
AWS
Cloud
DNS
Grafana
Kubernetes
Linux
Node.js
Prometheus
Python
Terraform
🕒 April 28
Senior Manager of Cloud and DevOps Engineering managing daily operations of AWS and Kubernetes infrastructure across businesses. Leading a team and working closely with senior leadership for operational excellence.
🇺🇸 United States – Remote
💰 $110M Series A on 2021-12
⏰ Full Time
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
🦅 H1B Visa Sponsor
AWS
Cloud
Docker
EC2
Kubernetes
Terraform
🕒 April 28
Cloud Infrastructure Engineer managing cloud resources for large-scale infrastructure. Supporting development teams in a microservices environment to streamline deployments and optimize performance.
🇺🇸 United States – Remote
⏰ Full Time
🟡 Mid-level
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
🦅 H1B Visa Sponsor
Airflow
Azure
BigQuery
Cloud
DNS
Google Cloud Platform
Grafana
Kafka
Kubernetes
Matillion
Microservices
Postgres
Prometheus
Python
Redis
Spark
SQL
Terraform
Vault
Go
🕒 April 27
Senior Site Reliability Engineer for Veeam's Government & Sovereign Cloud environments. Building a global SRE function with an emphasis on high availability and operational excellence.
🇺🇸 United States – Remote
💵 $138.9k - $231.4k / year
💰 $500M Private Equity Round on 2019-01
⏰ Full Time
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
🦅 H1B Visa Sponsor
AWS
Azure
Cloud
Dagger
Distributed Systems
Grafana
Java
JavaScript
Kubernetes
Prometheus
Terraform
TypeScript
Go