Senior Engineer 2 – Inference Data Plane

🕒 il y a 2 mois

☕ Washington – Distant

info

💵 $167 200 - $209 000 / an

⏰ Temps Plein

🟠 Senior

🧑‍💻 Développeur Full-Stack

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

Postuler Maintenant
Trouver des Emplois à Distance Similaires

📊 Vérifiez votre score de CV pour ce poste

Améliorez vos chances d'obtenir un entretien en vérifiant votre score de CV avant de postuler.

Logo of DigitalOcean

DigitalOcean

1001 - 5000 employés

Fondée en 2011

☁️ SaaS

SaaS • Cloud Computing

DigitalOcean est un fournisseur d'infrastructure cloud qui propose une suite de produits et services permettant aux développeurs de créer, déployer et faire évoluer des applications. Leur plateforme offre des didacticiels complets, des documents de référence et des supports documentaires pour aider les utilisateurs à gérer efficacement leurs ressources en utilisant leurs outils API et CLI. Avec des fonctionnalités telles que Droplets (machines virtuelles), des bases de données gérées, Kubernetes et un marketplace pour les applications tierces, DigitalOcean met l'accent sur la simplicité et les performances. Ils s'adressent à la fois aux développeurs individuels et aux grandes organisations à la recherche de solutions cloud faciles à mettre en œuvre et à gérer.

Description

• Act as a technical leader on the team, driving the end-to-end design, development, and delivery of critical data plane components hosting large generative AI models. • Architect and refine system design proposals for our high-scale, multi-tenant AI inference cloud ecosystem, ensuring they meet rigorous availability and resiliency standards. • Implement and optimize distributed inference hosting using techniques like tensor/data parallelism, KV cache optimizations, and smart routing. • Work cross-functionally with Product Managers, customer-facing teams, and other engineering teams to align technical roadmaps with customer needs. • Coach and mentor junior engineers, fostering a culture of technical excellence and continuous improvement. • Maintain and operate critical, high-scale services, utilizing observability tools and defining SLOs to ensure superior platform health.

🎯 Exigences

• Strong experience with microservices, messaging systems, databases, and infrastructure as code. • Hands-on experience hosting large language or multimodal models using inference engines like vLLM, SGLang, or Modular. • Familiarity with distributed inference serving frameworks such as llm-d, NVIDIA Dynamo, or Ray Serve. • Understanding of GPU-level optimization and experience with interconnect technologies like NVlink, XGMI, or RoCE. • Knowledge of common LLM architectures and optimization techniques (e.g., continuous batching, quantization). • Expert-level proficiency in GoLang or Python and familiarity with gRPC. • Proven experience shipping customer-facing software products and running critical services in a high-scale environment similar to DigitalOcean. • Experience integrating and building with open-source software.

🏖️ Avantages

• Employee Assistance Program • Local Employee Meetups • Flexible time off policy • Reimbursement for relevant conferences, training, and education • Access to LinkedIn Learning's 10,000+ courses

Postuler Maintenant

Emplois Similaires

🕒 il y a 2 mois

Akamai Technologies

5001 - 10000

🔒 Cybersecurity

Lead a team of developers building security solutions at Akamai. Focus on innovative enterprise security products for government and defense customers.

🇺🇸 États-Unis – Télétravail

💵 $146 400 - $263 600 / an

💰 Post-IPO Equity en 2001-07

⏰ Temps Plein

🟠 Senior

🧑‍💻 Développeur Full-Stack

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 2 mois

Cornelis Networks

51 - 200

🤖 Intelligence artificielle

🔧 Matériel

🏢 Entreprise

Senior Software Engineer designing and optimizing AI communication middleware at Cornelis Networks. Collaborating on performance-critical projects in a remote position for U.S. residents.

🇺🇸 États-Unis – Télétravail

💰 €29 000 000 Series B en 2022-11

⏰ Temps Plein

🟠 Senior

🧑‍💻 Développeur Full-Stack

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 2 mois

Owens Corning

10 000+ employés

Controls and Automation Project Manager for Owens Corning, leading projects in Robotics and Automation. Responsible for technical depth, project leadership, and team development.

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 2 mois

Clever Real Estate

51 - 200

🏠 Immobilier

🏪 Place de marché

👥 B2C

Full Stack Software Engineer developing backend systems at Clever, a real estate technology company. Shaping the future of the industry through innovative solutions and collaboration.

🇺🇸 États-Unis – Télétravail

💵 $140 000 - $160 000 / an

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

🧑‍💻 Développeur Full-Stack

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 2 mois

Fingerprint

51 - 200

🔒 Cybersecurity

🔌 API

☁️ SaaS

Full Stack Engineer for Fingerprint developing a dashboard for fraud detection. Lead front-end and back-end development tasks in a remote, collaborative environment.

🇺🇸 États-Unis – Télétravail

💰 €32 000 000 Series B en 2021-11

⏰ Temps Plein

🟠 Senior

🔴 Expert

🧑‍💻 Développeur Full-Stack

🗣️🇺🇸🇬🇧 Anglais requis