Principal Performance Engineer – Lead

🕒 il y a 2 mois

🍂 Massachusetts – Distant

info

💵 $169 300 - $304 700 / an

⏰ Temps Plein

🟠 Senior

👷🏻‍♀️ Ingénieur

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

Postuler Maintenant
Trouver des Emplois à Distance Similaires

📊 Vérifiez votre score de CV pour ce poste

Améliorez vos chances d'obtenir un entretien en vérifiant votre score de CV avant de postuler.

Logo of Akamai Technologies

Akamai Technologies

5001 - 10000 employés

🔒 Cybersecurity

💰 Post-IPO Equity en 2001-07

Cloud Computing • Cybersecurity • Content Delivery

Akamai Technologies est une plateforme mondiale en périphérie et une société de services cloud qui alimente, sécurise et accélère les applications en ligne, les médias et les API. Elle propose des services de diffusion de contenu (CDN), de calcul en périphérie, d'infrastructure cloud et une offre complète de sécurité incluant la protection contre les attaques DDoS, la sécurité des applications web et des API, l'atténuation des bots, les services DNS et des solutions zero-trust pour les entreprises. Akamai sert de grandes entreprises dans les secteurs des médias, de la finance, du commerce de détail, du jeu vidéo et du secteur public pour améliorer la performance, la fiabilité et la sécurité à grande échelle.

Description

• Optimize inference performance across the Akamai Inference Cloud • Collaborate closely with hardware performance engineers to deliver end-to-end optimization • Apply and evaluate quantization, distillation, and pruning techniques to optimize model performance while preserving accuracy • Design hardware-aware model placement and scheduling strategies to match models with optimal compute resources • Implement and tune speculative decoding, KV-cache optimization, and batching strategies to improve inference throughput and latency • Build benchmarking and profiling pipelines to measure model-layer performance across architectures, hardware, and serving configurations • Mentor and guide engineers on the team through code reviews, design discussions, and technical problem-solving • Collaborate with hardware performance engineers to identify and resolve end-to-end performance bottlenecks across the inference stack

🎯 Exigences

• 12+ years of relevant experience with a Bachelor's or Master's degree in Computer Science, Machine Learning, or a related field • Possess hands-on experience optimizing LLM inference performance (quantization, speculative decoding, model compression, etc.) • Have a solid understanding of transformer architectures and how design choices impact latency, throughput, and accuracy • Possess experience with inference serving frameworks such as vLLM, TensorRT-LLM, Triton, or similar systems • Be proficient in Python and C++ with experience profiling and optimizing compute-intensive workloads • Have familiarity with hardware-aware optimization, including GPU/accelerator scheduling and memory management trade-offs

🏖️ Avantages

• healthcare • 401K savings plan • company holidays • vacation (in the form of PTO) • sick time • family friendly benefits including parental leave • employee assistance program including a focus on mental and financial wellness

Postuler Maintenant

Emplois Similaires

🕒 il y a 2 mois

The College Board

1001 - 5000

📚 Éducation

🤝 À but non lucratif

Sr. Engineer focused on Platform Threat Intelligence at College Board, translating adversary insights into measurable platform trust improvements through collaborative efforts.

🇺🇸 États-Unis – Télétravail

💵 $153 000 - $166 000 / an

⏰ Temps Plein

🟠 Senior

👷🏻‍♀️ Ingénieur

🗣️🇨🇳 Chinois requis

🗣️🇻🇳 Vietnamien requis

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 2 mois

GAI Consultants, Inc.

501 - 1000

⚡ Énergie

🚗 Transport

🏛️ Gouvernement

Transmission Project Engineer 1 in GAI's Power Delivery team, focusing on high-voltage transmission line engineering and client advisory roles.

🇺🇸 États-Unis – Télétravail

💰 Private equity en 2022-11

⏰ Temps Plein

🟠 Senior

🔴 Expert

👷🏻‍♀️ Ingénieur

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 2 mois

GE HealthCare

10 000+ employés

💊 Pharmaceutique

Senior Clinical Engineer driving clinical development of imaging products and Software as a Medical Device. Collaborating with multidisciplinary teams to ensure regulatory compliance and product success.

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 2 mois

divcon

201 - 500

🤝 B2B

⚡ Énergie

🏢 Entreprise

Lead PLC Engineer designing and developing PLC control systems for HVAC and critical systems. Providing technical guidance and mentoring to engineering teams, ensuring quality and consistency.

🇺🇸 États-Unis – Télétravail

💰 Series unknown en 2024-03

⏰ Temps Plein

🟠 Senior

👷🏻‍♀️ Ingénieur

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 2 mois

Horizon3.ai

51 - 200

Offensive Tooling Engineer designing and developing custom implant frameworks for the NodeZero proactive security platform. Focused on enhancing post-exploitation capabilities in a collaborative remote engineering team.

🇺🇸 États-Unis – Télétravail

💵 $180 000 - $240 000 / an

⏰ Temps Plein

🟠 Senior

👷🏻‍♀️ Ingénieur

🗣️🇺🇸🇬🇧 Anglais requis