Staff ML Ops Engineer

Job not on LinkedIn

🕒 February 4

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Albert Invent

Albert Invent

51 - 200 employees

Founded 2022

🤖 Artificial Intelligence

🧬 Biotechnology

🔬 Science

💰 Seed Round on 2023-06

Artificial Intelligence • Biotechnology • Science

Albert Invent is an end-to-end platform that digitalizes synthesis, formulation, and materials science for the age of AI. It combines capabilities like inventory management, Electronic Lab Notebooks (ELN), Lab Information Management Systems (LIMS), and regulatory intelligence to streamline research and development processes. Trusted by thousands of chemists across 36 countries, Albert Invent enhances productivity and accelerates innovation in chemical research through the use of advanced AI and machine learning technologies.

📋 Description

• Design, deploy, and maintain Kubernetes infrastructure supporting AI/ML workloads • Manage containerized services, autoscaling, networking, and resource optimization • Design and build high-performance Python APIs and services using FastAPI or similar frameworks • Architect backend systems for scalability, reliability, and low latency • Build integrations between AI/ML systems and the broader Albert platform • Build and operate distributed systems that handle compute-intensive and high-throughput workloads • Design for fault tolerance, graceful degradation, and horizontal scalability • Implement async workflows, job queues, and task orchestration as needed • Architect and maintain data pipelines and storage systems supporting AI/ML workflows • Implement observability including logging, metrics, tracing, and alerting • Own system reliability—troubleshoot issues, conduct post-mortems, and continuously improve • Design CI/CD pipelines and promote automation best practices • Partner closely with ML engineers to understand requirements and deliver production-ready infrastructure • Translate ML prototypes and research code into scalable, maintainable systems

🎯 Requirements

• A degree in Computer Science or a related field with 7+ years of industry experience (Bachelor's) or 5+ years (Master's or PhD) in software engineering • Experience supporting AI/ML teams or deploying ML systems in production • Experience with GPU workloads and scheduling • Advanced proficiency in Python including async programming and performance optimization • Deep experience with Kubernetes—cluster management, networking, autoscaling, and troubleshooting • Strong background in distributed systems and microservices architecture • Experience with cloud platforms (AWS, GCP, or Azure) and infrastructure-as-code • Proficiency in REST API development using FastAPI, Flask, or similar • Experience with containerization and CI/CD pipelines • Track record of operating production systems at scale

🏖️ Benefits

• Health insurance • Flexible working hours • Professional development opportunities

Apply Now

Similar Jobs

🕒 January 29

BJAK

51 - 200

🛍️ eCommerce

🏪 Marketplace

Staff Machine Learning Engineer responsible for building production-grade ML systems in the AI field. Collaborate with engineering teams to enhance ML capabilities for A1's proactive AI system operation.

🕒 January 23

Escape Velocity Entertainment

51 - 200

🎮 Gaming

AI/Machine Learning Director responsible for AI/ML research and development at Escape Velocity Entertainment. Collaborating with teams to drive innovative solutions and unlock new experiences.

Python

PyTorch

Tensorflow

🕒 January 23

BJAK

51 - 200

🛍️ eCommerce

🏪 Marketplace

Machine Learning Engineer focused on building and improving ML components. Collaborating within a small, motivated team to enhance AI systems under production constraints.

🕒 January 20

Hightouch

51 - 200

☁️ SaaS

Engineering leader at Hightouch driving machine learning efforts and overseeing product development for AI marketing solutions. Leading teams to enhance customer communication strategies utilizing data.

🕒 December 10, 2025

Airbnb

5001 - 10000

👥 B2C

🛍️ eCommerce

Staff Machine Learning Engineer at Airbnb working with ML models to enhance host and guest tools. Collaboration with data scientists and engineering teams to develop impactful solutions.

Airflow

Java

Kubernetes

Python

PyTorch

Scala

Spark

Tensorflow