Senior AI/ML Engineer – RAG, Co-Pilot Systems

Data Centers • Telecommunications • Cloud Computing

Switch is a world-leading technology infrastructure company that specializes in exascale data center ecosystems and innovative telecommunications solutions. The company focuses on providing enterprise-class, emerging hybrid cloud technology solutions while emphasizing sustainability and social responsibility. Switch operates multiple data centers across the U. S. , offering colocation and edge data center services tailored to the needs of its diverse clientele, including global logistics and e-commerce firms.

501 - 1000 employees

Founded 2000

📡 Telecommunications

Senior AI/ML Engineer – RAG, Co-Pilot Systems

October 31

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

🤖 Machine Learning Engineer

🦅 H1B Visa Sponsor

Cloud

Distributed Systems

Kubernetes

Microservices

Python

PyTorch

Apply Now

Switch

Data Centers • Telecommunications • Cloud Computing

501 - 1000 employees

Founded 2000

📡 Telecommunications

📋 Description

• Design and Build scalable AI/ML systems, with a focus on RAG architectures (retrieval pipelines, embeddings, vector stores, LLM orchestration). • Prototype and Deploy generative AI features into production using PyTorch, LangChain, LlamaIndex, and other modern AI frameworks. • Operationalize Models with MLflow for experiment tracking, model registry, deployment, and governance. • Optimize Inference Pipelines for performance, efficiency, and cost, leveraging NVIDIA GPUs, NVIDIA NIM inference microservices, and the NVIDIA GPU Operator for Kubernetes-based cluster management. • Implement Cloud-Native Patterns: containerization, microservices, sidecars, service mesh, and event-driven pipelines. • Deploy in Secure Environments: architect and support systems in air-gapped, regulated, or on-premises environments with strict compliance requirements. • Ensure Reliability with robust observability (monitoring, logging, alerting, tracing) and CI/CD automation. • Collaborate Cross-Functionally to translate product and research requirements into technical specifications for data, models, and serving. • Continuously evaluate and integrate emerging tools in LLM orchestration, vector databases, and distributed ML infrastructure. • Drive next-level innovation in Product Development to support our mission-critical infrastructure. • Work in a fast-paced, high-impact environment where execution is key. • Leverage cutting-edge technology and sustainable design principles to create world-class solutions. • Uphold Switchs Karma philosophy leading with integrity and empowering those around you.

🎯 Requirements

• 8+ years professional experience in AI/ML engineering or software engineering (or equivalent). • Deep expertise with large language models and RAG systems (retrieval, embeddings, vector search, prompt control). • Strong proficiency in Python for AI/ML workflows and integration. • Hands-on experience with: PyTorch, MLflow, LangChain / LlamaIndex, NVIDIA NIM, NVIDIA GPU Operator. • Kubernetes expertise: deploying, scaling, and securing ML workloads in containers. • Familiarity with cloud-native design patterns (microservices, service mesh, sidecars, event-driven pipelines, serverless). • Experience building and deploying systems in air-gapped, regulated, or hybrid on-prem/cloud environments. • Solid understanding of distributed systems, ML system scaling, and performance tuning. • Strong engineering practices (CI/CD, testing, code reviews, version control). • Excellent communication skills and ability to explain complex systems and trade-offs. • Preferred advanced degree (MS/Ph.D.) in Computer Science, Machine Learning, or related field. • Preferred experience with vector databases (e.g., Qdrant, Weaviate, Milvus, Pinecone) or custom vector search solutions. • Preferred experience with large-scale LLM serving and optimization techniques (quantization, distillation, sharding). • Preferred experience in regulated industries (finance, healthcare, energy, government) with compliance frameworks (SOC 2, HIPAA, FedRAMP, etc.). • Contributions to open-source AI/ML frameworks or cloud-native tooling.

🏖️ Benefits

• Generous Benefits Package - Switch provides comprehensive coverage for you and your family that can be tailored to fit your personal needs, and more!

Apply Now

Similar Jobs

Machine Learning Engineer

October 31

Quantiphi

1001 - 5000

🤖 Artificial Intelligence

🏢 Enterprise

📚 Education

Machine Learning Engineer responsible for designing and deploying ML models using Google Cloud AI tools. Join Quantiphi to work on impactful AI and cloud solutions.

🇺🇸 United States – Remote

💰 Series A on 2019-12

⏰ Full Time

🟡 Mid-level

🟠 Senior

🤖 Machine Learning Engineer

🦅 H1B Visa Sponsor

BigQuery

Cloud

Google Cloud Platform

Python

PyTorch

Scikit-Learn

SQL

Tensorflow

AI / ML Engineer

October 31

Montauk Capital

1 - 10

💸 Finance

⚡ Energy

☁️ SaaS

AI / ML Engineer designing and deploying intelligent systems for grid analytics at Stealth Grid Co. Focusing on optimization of grid interconnections through advanced machine learning techniques.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🤖 Machine Learning Engineer

Cloud

Python

PyTorch

Tensorflow

Senior ML Engineer

October 30

Shopmonkey

51 - 200

🤝 B2B

☁️ SaaS

Senior ML Engineer building production-ready AI agents for automotive needs at Shopmonkey. Collaborating in a distributed team to shape the future of automotive care technology.

🇺🇸 United States – Remote

💵 $163k - $195k / year

⏰ Full Time

🟠 Senior

🤖 Machine Learning Engineer

Airflow

AWS

Cloud

Google Cloud Platform

Python

SQL

Tensorflow

TypeScript

Software Engineer, ML Infrastructure

October 30

Serve Robotics

51 - 200

🚗 Transport

🤖 Artificial Intelligence

Software Engineer developing and maintaining data processing pipelines for ML infrastructure at Serve Robotics. Collaborating with teams to refine data attributes and classifications for a rapidly expanding fleet.

🇺🇸 United States – Remote

💵 $155k - $190k / year

💰 $30M Venture Round on 2023-08

⏰ Full Time

🟡 Mid-level

🟠 Senior

🤖 Machine Learning Engineer

🦅 H1B Visa Sponsor

Python

SQL

Machine Learning Engineer

October 29

Scopic

201 - 500

Machine Learning Engineer at Scopic developing AI/ML services and pipelines in cloud environments. Seeking innovative engineers with expertise in Python and machine learning for a remote position.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🤖 Machine Learning Engineer

AWS

Cloud

Python