Senior MLOps, Generative AI Engineer

10,000+ employees

Founded 1890

⚕️ Healthcare Insurance

Healthcare Insurance

Sentara Health is a leading healthcare system that operates over 300 sites of care in Virginia and northeastern North Carolina, including 12 acute care hospitals. The company is dedicated to providing exceptional patient care, fostering professional development, and maintaining a diverse workforce. Sentara Health aims to improve health every day and has been recognized for its clinical and operational performance, being named one of the top 15 health systems by IBM Watson Health. The organization supports its employees in achieving their full potential and encourages growth and innovation in the healthcare sector.

Senior MLOps, Generative AI Engineer

Job not on LinkedIn

🕒 6 days ago

🏈 Alabama, Florida, +23 more states – Remote

💵 $91.4k - $152.4k / year

⏰ Full Time

🟠 Senior

🗣️ LLM Engineer

AWS

Azure

Cloud

Cyber Security

Distributed Systems

Google Cloud Platform

Kubernetes

Microservices

Python

PyTorch

Tensorflow

Apply Now

Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Sentara Health

10,000+ employees

Founded 1890

⚕️ Healthcare Insurance

Healthcare Insurance

📋 Description

• Design, build, and maintain scalable ML infrastructure and pipelines supporting model training, deployment, monitoring, governance, and lifecycle management. • Develop and optimize CI/CD pipelines for machine learning and AI workloads across development, staging, and production environments. • Build reusable ML platform capabilities including feature stores, model registries, experimentation frameworks, artifact management, and deployment automation. • Implement scalable orchestration and workflow solutions for batch and real-time ML inference workloads. • Create robust monitoring systems to measure model performance, detect model drift, monitor data quality, and ensure production reliability. • Develop automation tools and self-service capabilities to improve the efficiency, scalability, and reliability of MLOps processes. • Collaborate with Data Scientists and Software Engineers to streamline the ML lifecycle from experimentation through enterprise production deployment. • Apply software engineering best practices to AI/ML systems including testing, observability, resiliency, security, versioning, and infrastructure-as-code. • Identify gaps and improvement opportunities within the organization’s ML platform ecosystem and architect scalable solutions to address them. • Support enterprise AI governance, compliance, auditability, and model risk management requirements. • Ensure platform scalability, reliability, security, and operational excellence across AI/ML systems. • Lead the architecture, design, and deployment of enterprise Generative AI solutions leveraging LLMs, foundation models, and agentic AI systems. • Design and implement Retrieval-Augmented Generation (RAG) pipelines using vector databases, embeddings, semantic search, reranking, and retrieval optimization strategies. • Build scalable LLM orchestration frameworks using technologies such as LangChain, LlamaIndex, Semantic Kernel, or equivalent frameworks. • Develop advanced prompt engineering strategies, prompt chaining, context management, and agent workflows to improve LLM accuracy and reliability. • Evaluate and implement fine-tuning, parameter-efficient tuning, and prompt-based optimization approaches for domain-specific use cases. • Build AI evaluation and benchmarking frameworks to measure hallucination rates, response quality, grounding accuracy, toxicity, bias, latency, and business performance metrics. • Implement AI safety guardrails, governance controls, content filtering, and responsible AI practices for enterprise healthcare environments. • Design scalable GenAI APIs and microservices supporting high-throughput enterprise AI applications. • Optimize GenAI systems for cost, latency, throughput, and inference performance across cloud and hybrid environments. • Integrate enterprise data sources, healthcare systems, and knowledge repositories into secure GenAI workflows. • Research and evaluate emerging GenAI technologies, open-source frameworks, and foundation models to drive innovation and continuous improvement. • Develop architecture diagrams, technical roadmaps, implementation strategies, and executive-level documentation for enterprise AI initiatives. • Collaborate with cybersecurity, compliance, and infrastructure teams to ensure secure and compliant deployment of GenAI solutions involving PHI and sensitive healthcare data. • Contribute to the development of AI platform standards, reusable GenAI accelerators, templates, and engineering best practices.

🎯 Requirements

• 5+ years of experience building and deploying production software, ML systems, or AI platforms. • 1+ years of hands-on experience building production Generative AI or LLM-based applications. • Strong programming skills in Python and experience with software engineering best practices. • Experience with major deep learning and LLM frameworks such as PyTorch, Hugging Face Transformers, TensorFlow, or equivalent. • Hands-on experience implementing RAG architectures, vector search, embeddings, prompt engineering, and LLM orchestration frameworks. • Experience with vector databases such as Pinecone, Weaviate, Chroma, FAISS, Milvus, or equivalent technologies. • Experience deploying AI/ML systems in cloud environments including AWS, Azure, or GCP. • Strong understanding of APIs, distributed systems, microservices, and scalable backend architectures. • Experience with Kubernetes, containerization, orchestration, and cloud-native infrastructure. • Experience implementing CI/CD pipelines, infrastructure automation, and MLOps best practices. • Experience building monitoring, observability, and alerting solutions for ML and AI systems. • Strong understanding of AI/ML lifecycle management, governance, model versioning, and production operations. • Experience designing secure, scalable, production-ready AI platforms and services. • Strong communication and collaboration skills with the ability to work across technical and business teams.

🏖️ Benefits

• Medical, Dental, Vision plans • Adoption, Fertility and Surrogacy Reimbursement up to $10,000 • Paid Time Off and Sick Leave • Paid Parental & Family Caregiver Leave • Emergency Backup Care • Long-Term, Short-Term Disability, and Critical Illness plans • Life Insurance • 401k/403B with Employer Match • Tuition Assistance – $5,250/year and discounted educational opportunities through Guild Education • Student Debt Pay Down – $10,000 • Reimbursement for certifications and free access to complete CEUs and professional development • Pet Insurance • Legal Resources Plan • Colleagues have the opportunity to earn an annual discretionary bonus if established system and employee eligibility criteria is met.

Apply Now

Similar Jobs

Senior Account Executive, AI Infrastructure Sales

🕒 June 18

Vultr

201 - 500

🤖 Artificial Intelligence

🤝 B2B

🔧 Hardware

Senior Account Executive at Vultr driving growth in AI Infrastructure solutions for enterprises. Collaborating with stakeholders to leverage AI Infrastructure services for innovation.

🇺🇸 United States – Remote

💵 $110k - $125k / year

💰 $329M Debt Financing - Vultr on 2025-06

⏰ Full Time

🟠 Senior

🗣️ LLM Engineer

Cloud

Senior Product Manager, AI Infrastructure

🕒 June 8

Vultr

201 - 500

🤖 Artificial Intelligence

🤝 B2B

🔧 Hardware

Senior Product Manager managing AI infrastructure capabilities for Vultr and working directly with customers. Collaborating with engineering teams to deliver large-scale GPU workload solutions.

🇺🇸 United States – Remote

💵 $130k - $165k / year

💰 $329M Debt Financing - Vultr on 2025-06

⏰ Full Time

🟠 Senior

🗣️ LLM Engineer

Cloud

Kubernetes

Senior Software Engineer, DGX Cloud AI Infrastructure

🕒 June 4

NVIDIA

10,000+ employees

🤖 Artificial Intelligence

🎮 Gaming

Senior Software Engineer at NVIDIA leading the optimization of large-scale AI workloads across GPU platforms. Focused on benchmarking, performance tuning, and infrastructure resilience with a team of experts.

🇺🇸 United States – Remote

💵 $184k - $356.5k / year

⏰ Full Time

🟠 Senior

🗣️ LLM Engineer

🦅 H1B Visa Sponsor

Distributed Systems

Node.js

Python

PyTorch

AI Infrastructure TPM II

🕒 May 22

Astreya

1001 - 5000

🔒 Cybersecurity

🏢 Enterprise

☁️ SaaS

AI Infrastructure Project Manager leading complex infrastructure deployment for AI programs across multiple sites. Drive execution, coordination, and risk management initiatives in technical project management.

🇺🇸 United States – Remote

💵 $73k - $100.8k / year

⏰ Full Time

🟠 Senior

🔴 Lead

🗣️ LLM Engineer

🦅 H1B Visa Sponsor

PMP

AI Infrastructure TPM II

🕒 May 22

Astreya

1001 - 5000

🔒 Cybersecurity

🏢 Enterprise

☁️ SaaS

Technical Project Manager leading AI infrastructure deployment across multiple business units and sites. Delivering execution excellence through project management of GPU, compute, and storage systems.

🇺🇸 United States – Remote

💵 $73k - $100.8k / year

⏰ Full Time

🟠 Senior

🔴 Lead

🗣️ LLM Engineer

🦅 H1B Visa Sponsor

PMP