Forward Deployed Engineer, AI Inference, vLLM, Kubernetes

🕒 May 20

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Red Hat

Red Hat

10,000+ employees

Founded 1993

🏢 Enterprise

💰 Corporate Round on 1999-03

Enterprise • Cloud

Red Hat is a leading provider of enterprise open source software solutions, helping companies worldwide to build and deploy applications across hybrid cloud infrastructures. With a strong focus on developing secure, stable, and innovative technologies, Red Hat offers a broad portfolio including products like Red Hat Enterprise Linux, Red Hat OpenShift, and Red Hat Ansible Automation Platform. These products support IT services on any infrastructure efficiently. Trusted by more than 90% of the U. S. Fortune 500, Red Hat empowers organizations to modernize their IT environments, leveraging open source communities to drive technological advancement.

📋 Description

• Orchestrate Distributed Inference: Deploy and configure LLM-D and vLLM on Kubernetes clusters. • Optimize for Production: Run performance benchmarks and tune vLLM parameters. • Code Side-by-Side: Work directly with customer engineers to write production-quality code. • Solve the "Unsolvable": Debug complex interactions between model architectures and hardware accelerators. • Feedback Loop: Channel field learnings back to product development.

🎯 Requirements

• 8+ Years of Engineering Experience • Customer Fluency • Bias for Action • Deep Kubernetes Expertise • AI Inference Proficiency • Systems Programming: Proficiency in Python and Go • Infrastructure as Code: Experience with Helm, Terraform, or similar tools • Cloud & GPU Hardware Fluency • Experience contributing to open-source AI infrastructure projects is a plus • Knowledge of Envoy Proxy or Inference Gateway (IGW) is a plus

🏖️ Benefits

• Comprehensive medical, dental, and vision coverage • Flexible Spending Account - healthcare and dependent care • Health Savings Account - high deductible medical plan • Retirement 401(k) with employer match • Paid time off and holidays • Paid parental leave plans for all new parents • Leave benefits including disability, paid family medical leave, and paid military leave • Additional benefits including employee stock purchase plan, family planning reimbursement, tuition reimbursement, transportation expense account, and employee assistance program

Apply Now

Similar Jobs

🕒 May 19

Apogee Therapeutics

51 - 200

🧬 Biotechnology

💊 Pharmaceuticals

Director of AI Enablement at Apogee Therapeutics shaping and executing AI strategy across the enterprise. Leading the adoption and governance of advanced AI capabilities in a dynamic and clinical-focused environment.

🇺🇸 United States – Remote

💵 $225k - $250k / year

💰 $149M Series B on 2022-12

⏰ Full Time

🔴 Lead

🤖 Artificial Intelligence

🕒 May 19

GN Group

5001 - 10000

📡 Telecommunications

🤖 Artificial Intelligence

AI Alliance Manager responsible for developing strategic AI use cases and managing technology partnerships. Collaborating with internal teams to enhance product offerings and commercial strategies.

🇺🇸 United States – Remote

💵 $120k - $160k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

🤖 Artificial Intelligence

🕒 May 19

Personified

11 - 50

🔒 Cybersecurity

📋 Compliance

Director of AI at Personified leading AI strategy and development for a managed IT and cybersecurity service provider. Shaping internal capabilities and external offerings through innovative use of AI.

🇺🇸 United States – Remote

💵 $130k - $155k / year

⏰ Full Time

🔴 Lead

🤖 Artificial Intelligence

🕒 May 19

Salesforce

10,000+ employees

☁️ SaaS

🤝 B2B

🤖 Artificial Intelligence

Technical Builder for Salesforce's AI solutions, focused on building implementations and reusable assets for customers. Collaborating with partner teams to drive AI-driven solutions and enhance customer confidence.

🇺🇸 United States – Remote

💵 $150.1k - $227k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

🤖 Artificial Intelligence

🦅 H1B Visa Sponsor

info

🕒 May 19

ONE

201 - 500

💳 Fintech

Forward Deployed Engineer developing automation workflows for OnePay’s financial services platform. Collaborating with cross-functional teams to enhance operations using AI-driven technical solutions.

🇺🇸 United States – Remote

💵 $170k - $210k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

🤖 Artificial Intelligence

🦅 H1B Visa Sponsor

info