Senior Software Engineer – Reliability Engineering

🕒 May 14

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of The Home Depot

The Home Depot

10,000+ employees

Founded 1978

🛒 Retail

👥 B2C

💰 Debt Financing on 2007-07

Retail • B2C

The Home Depot is a leading home improvement retailer, offering a wide range of building materials, home improvement products, lawn and garden products, and related services. The company operates both physical stores and an online platform, providing comprehensive solutions for DIY enthusiasts, professional contractors, and homeowners. The Home Depot is committed to diversity, equity, and inclusion, providing employment opportunities and benefits to a diverse workforce. Additionally, the company places a high emphasis on customer service and associate engagement to maintain its position as a trusted leader in the home improvement industry.

📋 Description

• Develops, tests, deploys, and maintains software, with a clear understanding of the value the software is to provide • Takes on new opportunities and tough challenges with a sense of urgency, high energy, and enthusiasm • Consistently achieves results, even under tough circumstances • Develops test suites (functional, destructive, etc) to enable success, rapid deployment of code to production • Takes a broad view when approaching issues; using a global lens • Learns through successful and failed experiments when tackling new problems • Actively seeks ways to grow and be challenged using both formal and informal development channels • Collaborates with other team members in agile processes • Creates new and better ways for the organization to be successful • Works with the Product Team to ensure user stories are valuable, developer ready, easy to understand, and testable • Delivers multi-mode communications that convey a clear understanding of the unique needs of different audiences • Adapts approach and demeanor in real-time to match the shifting demands of different situations • Relates openly and comfortably with diverse groups of people • Helps grow junior engineers by providing guidance on modern software development frameworks and leading technical discussions

🎯 Requirements

• Must be eighteen years of age or older. • Must be legally permitted to work in the United States. • GCP Cloud Infrastructure — BigQuery analytics, ADC auth, cloud-native services • Observability — Grafana, Prometheus, Kibana/Elasticsearch (WES logs), OCP Health Dashboards • Terraform Enterprise — Infrastructure as Code • GitHub — SCM • GH Copilot + AI Agents — AI-accelerated incident analysis, automated remediation workflows, prompt-engineered operational tooling • SRE Practices — Production Readiness Review, Capacity Planning, Change Validation, Prod Support, Post-Mortems, SLO Definition & Tracking • ServiceNow — Incident, Problem, and Change management; trend analysis; RCA grouping • BigQuery — Incident analytics, problem candidate identification, operational reporting • PagerDuty — On-call scheduling, escalation paths, push-button paging • Rundeck — Self-heal automation, push-button remediation jobs • Atlassian (Jira/Confluence) — RCA documentation, runbooks, architecture diagrams, onboarding • CyberArk — Privileged access for WMS/DFC log pulls and node access • Manhattan WMS — Warehouse Management System operations, RF/UI/LM node support • Python Automation — Operational scripting, BQ pipelines, alert correlation, report generation.

🏖️ Benefits

• Health insurance • 401(k) matching • Flexible work hours • Paid time off • Remote work options

Apply Now

Similar Jobs

🕒 May 14

Avaya

5001 - 10000

🤝 B2B

Site Reliability Engineer at Avaya driving stability and performance across Azure and GCP platforms. Collaborating with DevOps and Security teams to manage incidents and optimize operations.

Ansible

Azure

Cloud

Google Cloud Platform

Terraform

🕒 May 13

DMI (Digital Management, LLC)

1001 - 5000

☁️ SaaS

🏢 Enterprise

DevOps Engineer at DMI building and automating processes across environments. Supporting modernization and transition to operations with expertise in CI/CD, Linux, and DevOps.

Cloud

Linux

🕒 May 13

DMI (Digital Management, LLC)

1001 - 5000

☁️ SaaS

🏢 Enterprise

Mid-Level DevOps Engineer at DMI building and automating CI/CD processes across environments. Supporting modernization and operational transitions for various cloud and hosting platforms.

Cloud

Linux

🕒 May 13

Slingshot Aerospace

51 - 200

🚀 Aerospace

🤖 Artificial Intelligence

🔐 Security

Senior DevOps Engineer at Slingshot Aerospace designing cloud-native infrastructure and managing CI/CD pipelines for scalable space operations. Collaborating across teams to enhance deployment workflows.

AWS

Cloud

Docker

Kubernetes

Linux

Terraform

🕒 May 13

Wikimedia Foundation

501 - 1000

🤝 Non-profit

📚 Education

📱 Media

Senior Site Reliability Engineer with Wikimedia Foundation supporting platform for Wikipedia. Focus on operational tasks, collaboration, and continual improvement of infrastructure reliability.

Ansible

Kubernetes

Linux

Puppet

Python

Ruby

Go