Lead Machine Learning Engineer, Inference – Performance

🔥 1 hour ago

🇺🇸 United States – Remote

💵 $159.3k - $250.1k / year

⏰ Full Time

🟠 Senior

🤖 Machine Learning Engineer

🦅 H1B Visa Sponsor

info
Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Egen

Egen

501 - 1000 employees

Founded 2000

🤖 Artificial Intelligence

Artificial Intelligence • Healthcare • Public Sector

Egen is a company that specializes in engineering innovative solutions using platforms, data, and generative AI. They focus on harnessing the potential of data to empower organizations and individuals, providing services across various sectors including communications, healthcare, and public sector solutions. Their customizable platforms address critical challenges in urban management, customer engagement, clinical care, and operational excellence, aiming to drive impactful outcomes for clients.

📋 Description

• Optimize Inference: Build and tune production LLM serving with vLLM and SGLang • Profile & Accelerate Training: Instrument and profile training runs to find bottlenecks • Engineer for the Hardware: Apply a working understanding of GPU architecture • Serve at Scale: Deploy and operate multiple models within shared GPU clusters on GKE • Drive Efficiency: Own GPU utilization as a first-class metric • Collaborate & Consult: Work directly with clients to understand performance requirements

🎯 Requirements

• Bachelor's or Master's degree in Computer Science, Engineering, or a related technical field • 5+ years of experience in ML/AI engineering, with a meaningful portion focused on performance, infrastructure, or systems • Proven track record of deploying and optimizing models in a production environment • Demonstrated experience profiling and improving GPU utilization for training and/or inference • Experience with Classic Machine Learning (neural nets, training, tuning) is a strong plus • Knowledge of Data Engineering and SQL

🏖️ Benefits

• Comprehensive Health Insurance • Paid Leave (Vacation/PTO) • Paid Holidays • Sick Leave • Parental Leave • Bereavement Leave • 401 (k) Employer Match • Employee Referral Bonuses

Apply Now

Similar Jobs

🔥 23 hours ago

Terray Therapeutics

11 - 50

🧬 Biotechnology

🤖 Artificial Intelligence

💊 Pharmaceuticals

ML Scientist at Terray Therapeutics extending structure informed models to causal models of in-vivo molecular interactions. Collaborating with teams to improve intermolecular interactions using machine learning.

🔥 23 hours ago

Terray Therapeutics

11 - 50

🧬 Biotechnology

🤖 Artificial Intelligence

💊 Pharmaceuticals

ML Scientist at Terray Therapeutics developing systems for discovering novel chemical matter using reinforcement learning techniques. Collaborative environment with a focus on machine learning and scientific problem-solving.

🕒 Yesterday

OpenTeams

11 - 50

☁️ SaaS

🤝 B2B

Machine Learning Engineer developing deep learning models for customer behavior prediction. Collaborating with data scientists to optimize model architecture and enhance business decisions.

🇺🇸 United States – Remote

💵 $145k - $250k / year

💰 $100k Pre Seed Round on 2019-08

⏰ Full Time

🟠 Senior

🤖 Machine Learning Engineer

🕒 Yesterday

Diligent Robotics

1 - 10

⚕️ Healthcare Insurance

🤖 Artificial Intelligence

ML Engineer building and deploying manipulation systems for robots interacting in dynamic environments. Focusing on perception-to-action models, datasets, and evaluation tooling.

🕒 Yesterday

Citrin Cooperman

1001 - 5000

🤝 B2B

Senior MLOps/LLMOps Engineer at Citrin Cooperman developing automated pipelines for generative AI applications. Leading the deployment and management of AI evaluation infrastructure and monitoring.