Product Manager – AI Inference, Model Serving

🕒 il y a 10 jours

🤠 Texas – Distant

info

⏰ Temps Plein

🟠 Senior

🔴 Expert

✅ Responsable Produit

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

Postuler Maintenant
Trouver des Emplois à Distance Similaires

📊 Vérifiez votre score de CV pour ce poste

Améliorez vos chances d'obtenir un entretien en vérifiant votre score de CV avant de postuler.

Logo of Mirantis

Mirantis

501 - 1000 employés

🏢 Entreprise

☁️ SaaS

Cloud Computing • Enterprise • SaaS

Mirantis est une entreprise spécialisée dans la gestion de conteneurs et les solutions d'infrastructure cloud. Elle propose une gamme de produits, notamment Mirantis Kubernetes Engine (MKE), Mirantis OpenStack pour Kubernetes (MOSK) et Mirantis Container Cloud (MCC), qui offrent des plateformes de gestion de Kubernetes et de conteneurs de niveau entreprise. Mirantis développe également des outils pour des chaînes d'approvisionnement logicielles sécurisées, tels que le Mirantis Container Runtime (MCR) et le Mirantis Secure Registry (MSR). En tant que défenseur des technologies open source, Mirantis soutient divers projets et fournit des ressources comme Lens Desktop, un IDE Kubernetes populaire, et un support technique pour les entreprises adoptant des technologies cloud-natives. Leurs solutions s'adressent à des secteurs tels que les services publics, les services financiers et les industries des services technologiques et SaaS au sens large.

Description

• Own product strategy, roadmap, and lifecycle for inference and model serving, including serverless inference, dedicated endpoints, autoscaling, routing, KV cache management, and the related observability • Lead deep technical discovery with NeoClouds, sovereign clouds, and enterprise platform teams, and translate findings into prioritized requirements and architecture direction • Partner with engineering on system design trade-offs across runtime integration, GPU scheduling, network, storage, and serving topology, including disaggregated serving and multi-model serving • Define positioning grounded in measurable outcomes: latency distributions, throughput per GPU, utilization, tail reliability, and cost per tokens • Drive go-to-market execution: pricing and packaging, reference architectures, sizing guides, PoC playbooks, and direct engagement with customers, analysts, and ecosystem partners

🎯 Exigences

• 7+ years in product management, technical product management, or a senior technical role owning AI/ML and inference product(s) • Strong understanding of production AI inference, including model serving, serverless execution, dedicated endpoints, autoscaling, routing, workload placement, observability, and reliability • Proven capability to reason about performance trade-offs across GPU, network, storage, orchestration, and runtime layers, and to translate low-level technical capability into business value such as TTFT, throughput per GPU, and TCO • Working knowledge of modern inference runtimes (vLLM, SGLang, TensorRT-LLM, Dynamo, Triton) and the optimization patterns that matter in production: continuous batching, KV cache management, cold starts, prefill versus decode, disaggregated serving, and multi-model serving • Credibility with engineering leaders and infrastructure operators, including comfort in production architecture reviews and technical commercial conversations with platform engineering buyers.

🏖️ Avantages

• Work with an established Silicon Valley leader in the cloud infrastructure industry. • Work with exceptionally passionate, talented and engaging colleagues, helping Fortune 500 and Global 2000 customers implement next-generation cloud technologies. • Be a part of cutting-edge, open-source innovation. • Thrive in the high-energy environment of a young company where openness, collaboration, risk-taking, and continuous growth are valued. • Professional development and training. • Attend conferences and working groups. • Customized workstation (macOS, Windows). • A competitive compensation package with strong benefits plan and stock options.

Postuler Maintenant

Emplois Similaires

🕒 il y a 10 jours

First American

10 000+ employés

🏠 Immobilier

💸 Finance

🏢 Entreprise

Product Owner managing API and transactional capabilities for banking platform. Collaborating with engineering and business teams to deliver clear product requirements and enhance customer experience.

🇺🇸 États-Unis – Télétravail

💵 $112 400 - $149 800 / an

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

✅ Responsable Produit

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 10 jours

First American

10 000+ employés

🏠 Immobilier

💸 Finance

🏢 Entreprise

Product Manager collaborating with cross-functional teams to define strategy and deliver product solutions. Excelling in an inclusive, people-first culture within a Fortune 100 company.

🇺🇸 États-Unis – Télétravail

💵 $112 400 - $149 800 / an

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

✅ Responsable Produit

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 10 jours

First American

10 000+ employés

🏠 Immobilier

💸 Finance

🏢 Entreprise

Senior Product Manager leading Wealth Management Product Team at First American Trust. Responsible for driving the strategy, roadmap, and execution of technology capabilities for wealth management.

🇺🇸 États-Unis – Télétravail

💵 $129 300 - $172 300 / an

⏰ Temps Plein

🟠 Senior

✅ Responsable Produit

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 10 jours

Stratus

501 - 1000

🤝 B2B

🏢 Entreprise

🤖 Intelligence artificielle

Senior Product Manager at Stratus, focusing on leading product strategies for B2B SaaS in construction. Responsible for managing all aspects of the product lifecycle from strategy to adoption.

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 10 jours

GoTo

1001 - 5000

☁️ SaaS

📡 Télécommunications

🏢 Entreprise

Staff Product Manager driving GoTo Connect for Healthcare's product roadmap through customer insights and cross-functional collaboration. Delivering innovative solutions in the U.S. healthcare space.

🇺🇸 États-Unis – Télétravail

💵 $130 000 - $173 000 / an

💰 Seed Round en 2013-11

⏰ Temps Plein

🔴 Expert

✅ Responsable Produit

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis