Staff Site Reliability Operations Engineer

🔥 0 minutes ago

🏄 California – Remote

info

💵 $136k - $265.7k / year

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

info
Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Calix

Calix

1001 - 5000 employees

Founded 2000

📡 Telecommunications

☁️ SaaS

🏢 Enterprise

💰 $50M Venture Round on 2009-08

Telecommunications • SaaS • Enterprise

Calix is a comprehensive solutions provider that focuses on enabling broadband service providers (BSPs) to simplify, innovate, and grow their businesses. Through its advanced broadband platform, Calix offers technologies like 10G-PON, 5G, Wi-Fi 7, and more to improve network operations, reduce downtime, and enhance the subscriber experience. The company provides managed services such as SmartLife and SmartHome, which help subscribers operate, secure, and enhance their connected lifestyles. Calix serves diverse provider types including telcos, cable operators, and municipal utilities, helping them deliver critical broadband connectivity and transform community access to digital services. With a focus on cloud technology, analytics, and transformation guidance, Calix empowers service providers to thrive in the digital age.

📋 Description

• Architect, optimize, and troubleshoot complex networking infrastructure. • Design, scale, and optimize our unified observability platform. • Deploy machine learning models and automated anomaly detection. • Drive the architecture, scaling, security of production Google Kubernetes Engine (GKE) clusters. • Tune and maintain high-throughput Apache Kafka clusters. • Ensure performance, scalability, and disaster recovery readiness across PostgreSQL, AlloyDB, and BigQuery. • Integrate AIOps insights with Grafana workflows to automate triage and analysis. • Coach engineers on advanced debugging techniques and distributed systems.

🎯 Requirements

• 8+ years in SRE, Production Engineering, or Distributed Systems infrastructure roles. • Deep technical knowledge and debugging mastery across all OSI layers. • Expert-level mastery of Google Kubernetes Engine (GKE) internals. • Proven track record managing high-throughput Apache Kafka pipelines. • Deep, hands-on experience deploying and managing Grafana Enterprise/Cloud. • Advanced, production-scale expertise utilizing HashiCorp Terraform. • High proficiency in Go and Python.

🏖️ Benefits

• As a part of the total compensation package, this role may be eligible for a bonus. • Click here for information on our benefits.

Apply Now

Similar Jobs

🕒 Yesterday

ClassWallet

11 - 50

💳 Fintech

📚 Education

🏛️ Government

DevOps Engineer optimizing cloud infrastructure and deployment pipelines for fintech company. Redefining public funds management and ensuring system reliability with high compliance standards.

🕒 Yesterday

Domino Data Lab

201 - 500

🤖 Artificial Intelligence

🏢 Enterprise

☁️ SaaS

Staff Site Reliability Engineer working on AI-assisted reliability tooling at Domino Data Lab. Leading incident response and enhancing system observability for critical services.

🕒 2 days ago

General Dynamics Information Technology

10,000+ employees

🔒 Cybersecurity

🤖 Artificial Intelligence

DevSecOps Software Developer SME designing and maintaining automation and integration capabilities for cloud and software delivery environments. Enhance software delivery and reduce manual work for mission-focused solutions.

🕒 2 days ago

TrueML

51 - 200

💳 Fintech

💸 Finance

👥 B2C

Sr. Security Engineer leading integration of security across the software development lifecycle at TrueML. Engaging in security automation, cloud security, and innovative AI solutions.

🕒 2 days ago

Stord

1001 - 5000

☁️ SaaS

🚗 Transport

🛍️ eCommerce

Staff Site Reliability Engineer focusing on security to enhance GCP and CI/CD processes. Join Stord in advancing tech solutions for better consumer experiences.