MLOps Engineer

Job not on LinkedIn

🔥 0 minutes ago

🇺🇸 United States – Remote

💵 $185k - $200k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

🤖 Machine Learning Engineer

🦅 H1B Visa Sponsor

info
Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of dv01

dv01

51 - 200 employees

Founded 2014

💸 Finance

💳 Fintech

☁️ SaaS

Finance • Fintech • SaaS

dv01 is a data management and analytics platform that serves as a crucial link between lenders and capital markets. It specializes in providing standardized loan-level data and integrated analytics tools to facilitate easier access and analysis of structured finance products. The platform supports various asset classes, including consumer unsecured loans, mortgages, auto loans, and student loans, and offers insights through features like ESG Data Enrichment and Portfolio Surveillance. dv01's offerings help transform raw, error-laden data into trustworthy, actionable insights, aiding investment banks, hedge funds, asset managers, and institutional investors in making smarter, data-driven financial decisions. The company addresses outdated technologies in the structured products market, offering a modern, cloud-based solution to enhance data integrity and optimize financial strategies.

📋 Description

• Build and operate the ML lifecycle platform. Own the tooling that makes model development reproducible and production-ready, with MLflow (or comparable systems) at the center: experiment tracking, model registry, artifact and metadata management, and versioned, repeatable training and inference pipelines. • Own CI/CD and deployment for ML workloads. Build automated pipelines that move models from notebook to production safely, including packaging, containerization, automated testing and validation, staged rollouts, and rollback. • Make models observable and reliable in production. Stand up monitoring for model and service health, including latency, drift, data-quality, and cost signals, with alerting and clear runbooks so issues surface and resolve quickly. • Build the cloud-native foundations. Contribute to and manage containerized workloads on Kubernetes and codify infrastructure with infrastructure-as-code tooling such as Terraform, keeping environments consistent, secure, and reproducible. • Establish sensible guardrails. Implement infrastructure-level governance for ML systems, including access controls, deployment policies, and auditability, partnering with security and compliance to align with our risk and regulatory requirements. • Enable and mentor the teams you support. Define repeatable patterns and shared services that reduce friction for data and application teams, provide technical guidance and mentorship to junior engineers, and contribute to the direction of dv01's MLOps practices.

🎯 Requirements

• 4–7 years of relevant experience in platform engineering, DevOps, or MLOps, with solid experience operating systems in production. • Hands-on experience with ML lifecycle tooling. You've built or operated experiment tracking, model registry, and pipeline workflows using MLflow or similar platforms (e.g., Weights & Biases, Kubeflow, SageMaker, Vertex AI Pipelines). This is core to the role. • Strength in cloud-native infrastructure. You're comfortable with Kubernetes, containerized workloads, and infrastructure-as-code tools such as Terraform. • CI/CD fluency. You've designed and maintained automated build, test, and deployment pipelines, ideally for ML or data workloads. • Solid Python/Go skills and comfort supporting PyTorch-based production systems (deploying, serving, and operating them, not necessarily authoring the models). • An operations and security mindset. You understand infrastructure security, IAM, secrets management, and operational risk, and you build with secure, reliable defaults. • Clear communication and collaboration. You work well cross-functionally, can mentor and provide technical guidance, and are comfortable making pragmatic decisions in ambiguous problem spaces.

🏖️ Benefits

• Unlimited PTO. Unplug and rejuvenate, however you want—whether that’s vacationing on the beach or at home on a mental-health day. • $1,000 Learning & Development Fund. No matter where you are in your career, always invest in your future. We encourage you to attend conferences, take classes, and lead workshops. We also host hackathons, brunch & learns, and other employee-led learning opportunities. • Remote-First Environment. People thrive in a flexible and supportive environment that best invigorates them. You can work from your home, cafe, or hotel. You decide. • Health Care and Financial Planning. We offer a comprehensive medical, dental, and vision insurance package for you and your family. We also offer a 401(k) for you to contribute. • Stay active your way! Get $138/month to put toward your favorite gym or fitness membership — wherever you like to work out. Prefer to exercise at home? You can also use up to $1,650 per year through our Fitness Fund to purchase workout equipment, gear, or other wellness essentials. • New Family Bonding. Primary caregivers can take 16 weeks off 100% paid leave, while secondary caregivers can take 4 weeks. Returning to work after bringing home a new child isn’t easy, which is why we’re flexible and empathetic to the needs of new parents.

Apply Now

Similar Jobs

🔥 4 hours ago

Motional

1001 - 5000

🚗 Transport

🤖 Artificial Intelligence

Machine Learning Engineer at Motional developing ML-powered tools for data mining in autonomous driving. Collaborate across teams using state-of-the-art models on multimodal sensor data.

🕒 2 days ago

OneStudyTeam

201 - 500

⚕️ Healthcare Insurance

🧬 Biotechnology

💊 Pharmaceuticals

Senior Machine Learning Engineer at OneStudyTeam developing AI solutions to accelerate clinical trials and improve patient outcomes. Collaborate with multidisciplinary teams leveraging cutting-edge technology.

🕒 3 days ago

Qohash

51 - 200

🔒 Cybersecurity

💳 Fintech

Senior Machine Learning Developer at Qohash building and deploying AI models for data security. Optimize models for on-device inference and own them in production environments.

🗣️🇫🇷 French Required

🕒 3 days ago

Mozilla

501 - 1000

👥 B2C

🔒 Cybersecurity

Senior Machine Learning Engineer developing AI-driven features for Firefox. Focusing on applied AI and LLM-based applications to enhance browser experiences.

🕒 3 days ago

Mercury

201 - 500

💳 Fintech

💸 Finance

☁️ SaaS

Senior Machine Learning Operations Engineer building real-time inference systems for risk decisioning at Mercury. Leading deployment infrastructures and ensuring high availability for ML models in production.