Site Reliability Engineer

Job not on LinkedIn

🔥 18 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of General Dynamics Information Technology

General Dynamics Information Technology

10,000+ employees

Founded 1954

🔒 Cybersecurity

🤖 Artificial Intelligence

Defense • Cybersecurity • Artificial Intelligence

General Dynamics Information Technology is a company at the forefront of technological innovation, offering a wide range of services including consulting, digital modernization, and application services. The company is heavily involved in implementing solutions related to artificial intelligence, cloud computing, cybersecurity, high-performance computing, and quantum technologies. GDIT is committed to supporting government and defense sectors, providing mission-critical services such as logistics and supply chain management, intelligence, and homeland security. The company also focuses on diverse and inclusive hiring practices and actively promotes employee well-being. Through its digital accelerator solutions and pioneering use of emerging technologies, GDIT aims to propel agencies' missions forward and address complex technological challenges.

📋 Description

• Build/Design and maintain highly available, scalable systems across cloud and on-prem environments. • Develop automation solutions that improves observability, speeds recovery, and eliminates manual operational work. • Implement monitoring, alerting, and performance tuning strategies that ensure system health. • Collaborate with development and infrastructure teams to design reliable architectures and CI/CD pipelines. • Conduct root cause analysis and drive systemic improvements to prevent future incidents. • Champion SRE best practices such as SLIs/SLOs, error budgets, and automated incident response. • Provide inputs into proposal operations in area of subject matter expertise, collaborating on solution elements and providing written narratives that describe technical solution elements designed for a specific opportunity

🎯 Requirements

• 15+ years of related experience • Bachelor's with 15 years or an additional 4 years of work experience in lieu of degree • Strong scripting and automation skills (Python, Bash, PowerShell, etc.) • Hands-on experience with monitoring tools (Prometheus, Grafana, Splunk, ELK, Datadog, etc.) • Familiarity with Kubernetes, container orchestration, and modern CI/CD pipelines • Understanding of networking, Linux system internals, and distributed systems • Ability to troubleshoot complex technical issues across the stack • US Citizenship Required • Candidate must possess active secret to start, and ability to attain Top Secret/SCI

🏖️ Benefits

• variety of medical plan options • Health Savings Accounts • dental plan options • vision plan • 401(k) plan offering the ability to contribute both pre and post-tax dollars up to the IRS annual limits and receive a company match • full flex work weeks where possible • a variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave • 15 days of paid leave per calendar year • an additional 10 paid holidays per year • GDIT Paid Family Leave program provides a total of up to 160 hours of paid leave in a rolling 12 month period for eligible employees • short and long-term disability benefits • life insurance • accidental death and dismemberment insurance • personal accident insurance • critical illness insurance • business travel and accident insurance

Apply Now

Similar Jobs

🔥 17 hours ago

NBCUniversal

10,000+ employees

📱 Media

DevOps Engineer at NBCUniversal working on video streaming automation solutions. Collaborating with teams for high-visibility projects in a fast-paced environment.

Ansible

AWS

Azure

Chef

Cloud

Cyber Security

Docker

Google Cloud Platform

Grafana

JavaScript

Kubernetes

Linux

Python

Splunk

Terraform

Unix

🔥 21 hours ago

Site Reliability Engineer designing, building, and maintaining highly available systems for health technology company. Collaborating with software developers to improve reliability and automate processes.

🇺🇸 United States – Remote

💵 $130k - $160k / year

💰 Venture Round on 2020-07

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

AWS

Azure

Cloud

Google Cloud Platform

Kubernetes

Python

Ruby

Go

🔥 23 hours ago

CACI International Inc

10,000+ employees

🔒 Cybersecurity

Cloud DevOps Engineer managing CI/CD pipelines and applications in AWS cloud. Collaborating on security initiatives and providing DevSecOps training with Agile teams.

Ansible

AWS

Cloud

Docker

EC2

Firewalls

Grafana

Java

JavaScript

Kubernetes

OpenShift

Prometheus

Python

SDLC

Splunk

Terraform

Go

🔥 23 hours ago

Alkami Technology

501 - 1000

🏦 Banking

💳 Fintech

☁️ SaaS

Site Reliability Engineer at Alkami developing and testing code for application releases. Collaborating with teams to improve delivery and participate in on-call rotations.

🇺🇸 United States – Remote

💵 $110k - $137.5k / year

💰 $300M Post-IPO Debt - Alkami Technology on 2025-03

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Ansible

Docker

Jenkins

Kubernetes

Postgres

Python

Redis

🔥 23 hours ago

Amgen

10,000+ employees

🧬 Biotechnology

💊 Pharmaceuticals

🔬 Science

DevOps Engineer responsible for designing, developing, and maintaining critical software applications for a biotech company. Collaborate with teams to deliver innovative solutions that impact patient care.

AWS

Cloud

EC2

Jenkins

Python