Staff AI Ops Engineer

🕒 April 21

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Calix

Calix

1001 - 5000 employees

Founded 2000

📡 Telecommunications

☁️ SaaS

🏢 Enterprise

💰 $50M Venture Round on 2009-08

Telecommunications • SaaS • Enterprise

Calix is a comprehensive solutions provider that focuses on enabling broadband service providers (BSPs) to simplify, innovate, and grow their businesses. Through its advanced broadband platform, Calix offers technologies like 10G-PON, 5G, Wi-Fi 7, and more to improve network operations, reduce downtime, and enhance the subscriber experience. The company provides managed services such as SmartLife and SmartHome, which help subscribers operate, secure, and enhance their connected lifestyles. Calix serves diverse provider types including telcos, cable operators, and municipal utilities, helping them deliver critical broadband connectivity and transform community access to digital services. With a focus on cloud technology, analytics, and transformation guidance, Calix empowers service providers to thrive in the digital age.

📋 Description

• Design, implement, and maintain scalable infrastructure for ML and GenAI applications • Deploy, operate, and troubleshoot production ML/GenAI pipelines/services • Build and optimize CI/CD pipelines for ML model deployment and serving • Scale compute resources across CPU/GPU architectures to meet performance requirements • Implement container orchestration with Kubernetes • Architect and optimize cloud resources on GCP for ML training and inference • Setup and maintain runtime frameworks and job management systems (Airflow, KubeFlow, MLflow, etc.) • Establish monitoring, logging and alerting for systems observability • Optimize system performance and resource utilization for cost efficiency • Develop and enforce AIOps best practices across the organization

🎯 Requirements

• Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent experience) • 8+ years of overall software engineering experience • 3+ years of focused experience in DevOps/AIOps or similar ML infrastructure roles • Proficient in IaC, using Terraform • Strong experience with containerization and orchestration using Docker and Kubernetes • Demonstrated expertise in cloud infrastructure management on GCP • Proficiency with workflow management such as Airflow & Kubeflow • Strong CI/CD expertise with experience implementing automated testing and deployment pipelines • Experience with scaling distributed compute architectures utilizing various accelerators (CPU/GPU) • Solid understanding of system performance optimization techniques • Experience implementing comprehensive observability solutions for complex systems • Knowledge of monitoring and logging tools (Prometheus, Grafana, ELK stack) • Strong proficiency in Python • Familiarity with ML frameworks such as PyTorch and ML platforms like Vertex AI • Excellent problem-solving skills and ability to work independently • Strong communication skills and ability to work effectively in cross-functional teams

🏖️ Benefits

• Health insurance • 401(k) matching • Flexible work arrangements • Professional development • Possible bonuses

Apply Now

Similar Jobs

🕒 April 21

Adobe

10,000+ employees

Lead discussions on AI as a Principal AI Technologist at Adobe. Collaborating with industry experts and creating impactful content on AI workflows and trends.

🕒 April 20

phData

201 - 500

🤖 Artificial Intelligence

☁️ SaaS

🏢 Enterprise

Director of Advisory responsible for AI Adoption & Change Management at phData, focusing on enterprise-wide AI solutions and transformation strategies.

🕒 April 17

CSC Generation

1001 - 5000

🛒 Retail

🛍️ eCommerce

Director of AI Analytics defining how data, insight, and AI drive decision-making at Sur La Table. Leading analytics across multiple business functions in a remote role.

🕒 April 17

Zencore

11 - 50

🤖 Artificial Intelligence

Principal Architect specializing in AI/ML for Zencore, a fast-growing data analytics firm. Engaging with clients to architect advanced AI solutions while leading a remote engineering team.

🕒 April 16

Gartner

10,000+ employees

🏢 Enterprise

AI Executive Partner managing C-Suite advisory for Gartner's AI strategies and solutions across industries. Leading innovative AI implementations while fostering member relationships and team collaboration.