Site Reliability Engineering Technical Leader

🕒 Yesterday

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Cisco

Cisco

10,000+ employees

Founded 1984

🔧 Hardware

🔐 Security

🏢 Enterprise

Hardware • Security • Enterprise

Cisco is a multinational technology company that provides networking hardware, software, and services to enterprises, service providers, and governments. It builds routers, switches, optical transceivers, programmable silicon, and edge computing platforms, and offers security, collaboration (Webex), observability, and AI-enabled software and support services to help organizations design, operate, and secure large-scale networks and data centers. Cisco also delivers professional services, training, and cloud-managed solutions to support digital transformation and AI-ready infrastructure.

📋 Description

• responsible for designing, developing, testing, and deploying advanced AI-driven software features for data center networks • strong interpersonal skills • comfortable collaborating with fellow engineers, cross-functional engineering teams, and internal clients • create and implement innovative, high-quality capabilities to provide our clients with the best possible experience

🎯 Requirements

• Bachelor of Engineering or Technology • 10+ years of experience designing and building scalable, reliable networking solutions for AI/ML infrastructure and high-performance computing • strong expertise in Cisco Data Center Networking technologies, ACI networks • technologies such as Routing, Switching, Nexus, VPC, VDC, VLAN, VXLAN, and BGP • Proven leadership in driving strategic automation initiatives • experience managing networking for GPU cluster environments • implementing AI-based observability tools • Skilled in creating documentation and training materials • Proficiency in Terraform and Ansible for Infrastructure as Code (IaC) • Strong Programming skills and solid grasp of software engineering concepts including common data structures/standard algorithms, object-oriented design, distributed computing, and cloud computing paradigms • Expertise in AI Fabric and Networking with deep understanding of high-performance networking for AI/ML workloads • Ability to implement and utilize AI-based observability tools

🏖️ Benefits

• medical, dental and vision insurance • a 401(k) plan with a Cisco matching contribution • paid parental leave • short and long-term disability coverage • basic life insurance • 10 paid holidays per full calendar year, plus 1 floating holiday for non-exempt employees • 1 paid day off for employee’s birthday • paid year-end holiday shutdown • 4 paid days off for personal wellness determined by Cisco • Non-exempt employees receive 16 days of paid vacation time per full calendar year • Exempt employees participate in Cisco’s flexible vacation time off program • 80 hours of sick time off provided on hire date and each January 1st thereafter • up to 80 hours of unused sick time carried forward from one calendar year to the next • Additional paid time away may be requested for critical or emergency issues for family members • Optional 10 paid days per full calendar year to volunteer • Eligible to earn annual bonuses for non-sales roles

Apply Now

Similar Jobs

🕒 Yesterday

HBK - Hottinger Brüel & Kjær

1001 - 5000

🚀 Aerospace

⚡ Energy

Software Architect leading architectural direction on DevOps/AI/LLM technologies for ReliaSoft's cloud and desktop products. Collaborating with teams to enhance product capabilities and modernize systems.

Cloud

🕒 Yesterday

Mind Computing

11 - 50

🤖 Artificial Intelligence

DevOps Engineer at Mind Computing responsible for AWS infrastructure and automation. Implementing cloud architecture and CI/CD pipelines for project with Department of Veterans Affairs.

AWS

Cloud

EC2

Python

Terraform

🕒 Yesterday

Origami Risk

501 - 1000

⚕️ Healthcare Insurance

🏢 Enterprise

DevOps Engineer responsible for deploying product updates and addressing production issues. Supporting cloud-native applications and collaborating with Architecture and Engineering teams for Origami Risk.

ASP.NET

AWS

Azure

Cloud

DNS

Docker

DynamoDB

ElasticSearch

Kubernetes

Linux

NGINX

React

SQL

Terraform

TypeScript

.NET

🕒 Yesterday

VetsEZ

201 - 500

🤝 B2B

☁️ SaaS

🏛️ Government

Release Train Engineer & DevOps Systems Lead leading Agile Release Train for federal government project. Providing oversight for CI/CD pipelines and collaboration across multiple Agile teams.

AWS

Cloud

Docker

Jenkins

Kubernetes

🕒 Yesterday

Filevine

201 - 500

☁️ SaaS

🤖 Artificial Intelligence

Senior Site Reliability Engineer focused on Observability and Incident Management for Filevine, a Legal AI company. Partnering with engineering teams to enhance system visibility, reliability, and operational excellence.

AWS

Cloud

Grafana

Kubernetes

Prometheus

Python