Senior AI/ML Engineer – RAG, Co-Pilot Systems

October 31

Apply Now
Logo of Switch

Switch

Data Centers • Telecommunications • Cloud Computing

Switch is a world-leading technology infrastructure company that specializes in exascale data center ecosystems and innovative telecommunications solutions. The company focuses on providing enterprise-class, emerging hybrid cloud technology solutions while emphasizing sustainability and social responsibility. Switch operates multiple data centers across the U. S. , offering colocation and edge data center services tailored to the needs of its diverse clientele, including global logistics and e-commerce firms.

501 - 1000 employees

Founded 2000

📡 Telecommunications

📋 Description

• Design and Build scalable AI/ML systems, with a focus on RAG architectures (retrieval pipelines, embeddings, vector stores, LLM orchestration). • Prototype and Deploy generative AI features into production using PyTorch, LangChain, LlamaIndex, and other modern AI frameworks. • Operationalize Models with MLflow for experiment tracking, model registry, deployment, and governance. • Optimize Inference Pipelines for performance, efficiency, and cost, leveraging NVIDIA GPUs, NVIDIA NIM inference microservices, and the NVIDIA GPU Operator for Kubernetes-based cluster management. • Implement Cloud-Native Patterns: containerization, microservices, sidecars, service mesh, and event-driven pipelines. • Deploy in Secure Environments: architect and support systems in air-gapped, regulated, or on-premises environments with strict compliance requirements. • Ensure Reliability with robust observability (monitoring, logging, alerting, tracing) and CI/CD automation. • Collaborate Cross-Functionally to translate product and research requirements into technical specifications for data, models, and serving. • Continuously evaluate and integrate emerging tools in LLM orchestration, vector databases, and distributed ML infrastructure. • Drive next-level innovation in Product Development to support our mission-critical infrastructure. • Work in a fast-paced, high-impact environment where execution is key. • Leverage cutting-edge technology and sustainable design principles to create world-class solutions. • Uphold Switchs Karma philosophy leading with integrity and empowering those around you.

🎯 Requirements

• 8+ years professional experience in AI/ML engineering or software engineering (or equivalent). • Deep expertise with large language models and RAG systems (retrieval, embeddings, vector search, prompt control). • Strong proficiency in Python for AI/ML workflows and integration. • Hands-on experience with: PyTorch, MLflow, LangChain / LlamaIndex, NVIDIA NIM, NVIDIA GPU Operator. • Kubernetes expertise: deploying, scaling, and securing ML workloads in containers. • Familiarity with cloud-native design patterns (microservices, service mesh, sidecars, event-driven pipelines, serverless). • Experience building and deploying systems in air-gapped, regulated, or hybrid on-prem/cloud environments. • Solid understanding of distributed systems, ML system scaling, and performance tuning. • Strong engineering practices (CI/CD, testing, code reviews, version control). • Excellent communication skills and ability to explain complex systems and trade-offs. • Preferred advanced degree (MS/Ph.D.) in Computer Science, Machine Learning, or related field. • Preferred experience with vector databases (e.g., Qdrant, Weaviate, Milvus, Pinecone) or custom vector search solutions. • Preferred experience with large-scale LLM serving and optimization techniques (quantization, distillation, sharding). • Preferred experience in regulated industries (finance, healthcare, energy, government) with compliance frameworks (SOC 2, HIPAA, FedRAMP, etc.). • Contributions to open-source AI/ML frameworks or cloud-native tooling.

🏖️ Benefits

• Generous Benefits Package - Switch provides comprehensive coverage for you and your family that can be tailored to fit your personal needs, and more!

Apply Now

Similar Jobs

October 31

Quantiphi

1001 - 5000

🤖 Artificial Intelligence

🏢 Enterprise

📚 Education

Machine Learning Engineer responsible for designing and deploying ML models using Google Cloud AI tools. Join Quantiphi to work on impactful AI and cloud solutions.

🇺🇸 United States – Remote

💰 Series A on 2019-12

⏰ Full Time

🟡 Mid-level

🟠 Senior

🤖 Machine Learning Engineer

🦅 H1B Visa Sponsor

October 31

Montauk Capital

1 - 10

💸 Finance

⚡ Energy

☁️ SaaS

AI / ML Engineer designing and deploying intelligent systems for grid analytics at Stealth Grid Co. Focusing on optimization of grid interconnections through advanced machine learning techniques.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🤖 Machine Learning Engineer

October 30

Shopmonkey

51 - 200

🤝 B2B

☁️ SaaS

Senior ML Engineer building production-ready AI agents for automotive needs at Shopmonkey. Collaborating in a distributed team to shape the future of automotive care technology.

🇺🇸 United States – Remote

💵 $163k - $195k / year

⏰ Full Time

🟠 Senior

🤖 Machine Learning Engineer

October 30

Serve Robotics

51 - 200

🚗 Transport

🤖 Artificial Intelligence

Software Engineer developing and maintaining data processing pipelines for ML infrastructure at Serve Robotics. Collaborating with teams to refine data attributes and classifications for a rapidly expanding fleet.

🇺🇸 United States – Remote

💵 $155k - $190k / year

💰 $30M Venture Round on 2023-08

⏰ Full Time

🟡 Mid-level

🟠 Senior

🤖 Machine Learning Engineer

🦅 H1B Visa Sponsor

October 29

Scopic

201 - 500

Machine Learning Engineer at Scopic developing AI/ML services and pipelines in cloud environments. Seeking innovative engineers with expertise in Python and machine learning for a remote position.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🤖 Machine Learning Engineer

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com