Technical Staff Member, Model Efficiency

Trouver des Emplois à Distance Similaires

11 - 50 employés

🤖 Intelligence artificielle

🏢 Entreprise

☁️ SaaS

Artificial Intelligence • Enterprise • SaaS

Cohere est une plateforme d’IA de premier plan qui fournit aux entreprises des modèles de langage avancés et un espace de travail (workspace) intégré conçu pour l’efficacité et la sécurité. Grâce à une famille de modèles génératifs et de retrieval hautes performances, Cohere permet aux organisations de rationaliser leurs workflows, de renforcer la sécurité des données et de révéler des insights dans de nombreux secteurs grâce à des capacités multilingues. Leur priorité accordée à des solutions d’IA sur mesure garantit la protection des données critiques tout en facilitant une intégration fluide aux processus opérationnels existants.

Technical Staff Member, Model Efficiency

🕒 il y a 2 mois

🗽 New York – Distant

⏰ Temps Plein

🔴 Expert

🖥 Ingénieur Logiciel

🦅 Parrain de Visa H1B

🗣️🇺🇸🇬🇧 Anglais requis

Distributed Systems

Python

Rust

Postuler Maintenant

📊 Vérifiez votre score de CV pour ce poste

Améliorez vos chances d'obtenir un entretien en vérifiant votre score de CV avant de postuler.

Cohere

11 - 50 employés

🤖 Intelligence artificielle

🏢 Entreprise

☁️ SaaS

Artificial Intelligence • Enterprise • SaaS

Description

• Work across the inference stack to improve core performance metrics • Dive deep into model execution • Identify bottlenecks and develop innovative optimizations • Collaborate closely with modeling and systems teams • Experiment, measure, and ship improvements that accelerate inference • Build expertise in advanced performance techniques, including GPU/CUDA optimizations, kernel-level improvements, and model execution strategies for MoE and large-scale architectures

🎯 Exigences

• 5+ years of experience writing high-performance, production-quality code • Strong programming skills in C++ or Python (Rust/Go also welcome) • Experience working with large language models and familiarity with the LLM inference ecosystem (e.g., vLLM, SGLang, etc.) • Ability to diagnose and resolve performance bottlenecks across the model execution stack • A strong bias for action — you ship fast, measure impact, and iterate • It’s a big plus if you have experience with GPU programming, CUDA, or low-level systems optimization • Language modeling with transformers (MoE, speculative decoding, KV-cache optimizations) • Scaling performance-critical distributed systems (e.g., computation, search, storage)

🏖️ Avantages

• An open and inclusive culture and work environment • Work closely with a team on the cutting edge of AI research • Weekly lunch stipend, in-office lunches & snacks • Full health and dental benefits, including a separate budget to take care of your mental health • 100% Parental Leave top-up for up to 6 months • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend • 6 weeks of vacation (30 working days!)

Postuler Maintenant

Emplois Similaires

Technical Staff Member – Quantitative Research

🕒 il y a 2 mois

Andreessen Horowitz

201 - 500

💸 Finance

💳 Fintech

🏢 Entreprise

Full-stack scientist pioneering quantitative research efforts at Udio. Building at the intersection of research, engineering, and product with proprietary datasets.

🇺🇸 États-Unis – Télétravail

💵 $250 000 - $350 000 / an

⏰ Temps Plein

🔴 Expert

🖥 Ingénieur Logiciel

🗣️🇺🇸🇬🇧 Anglais requis

Member of Technical Staff, Machine Learning

🕒 il y a 2 mois

Reka AI

1 - 10

🤖 Intelligence artificielle

🏢 Entreprise

☁️ SaaS

Member of Technical Staff (ML) developing and evaluating deep learning models for Reka's AI applications. Collaborating with a global team to translate research into practical solutions.

🇺🇸 États-Unis – Télétravail

💰 €58 000 000 Series A en 2023-06

⏰ Temps Plein

🔴 Expert

🖥 Ingénieur Logiciel

🗣️🇺🇸🇬🇧 Anglais requis

PyTorch

Member of Technical Staff, Asset Data

🕒 il y a 2 mois

Anchorage Digital

201 - 500

💸 Finance

₿ Crypto

☁️ SaaS

Member of Technical Staff building robust streaming data infrastructure for Anchorage Digital's crypto platform. Collaborating with cross-functional teams to optimize and maintain high-quality data outputs.

🇺🇸 États-Unis – Télétravail

💰 €350 000 000 Series D en 2021-12

⏰ Temps Plein

🔴 Expert

🖥 Ingénieur Logiciel

🦅 Parrain de Visa H1B

🗣️🇺🇸🇬🇧 Anglais requis

Cloud

Python

SAP ABAP Developer

🕒 il y a 2 mois

Vytwo Technologies Inc

201 - 500

🤝 B2B

🏢 Entreprise

🎯 Recrutement

SAP ABAP Developer with over 12 years of experience in SAP ECC & S/4 HANA development. Requires strong knowledge in ABAP, REST APIs, and system integration.

🇺🇸 États-Unis – Télétravail

💵 $55 - $60 / heure

⏰ Temps Plein

🟠 Senior

🔴 Expert

🖥 Ingénieur Logiciel

🗣️🇺🇸🇬🇧 Anglais requis

Cloud

SOAP

Director of Engineering

🕒 il y a 3 mois

Intus Care

11 - 50

⚕️ Assurance santé

☁️ SaaS

🤖 Intelligence artificielle

Director of Engineering at Intus Care overseeing engineering teams for SaaS product development. Leading multiple engineering pods to build scalable healthcare technology solutions.

🇺🇸 États-Unis – Télétravail

💵 $170 000 - $190 000 / an

💰 €13 100 000 Venture Round en 2023-01

⏰ Temps Plein

🔴 Expert

🖥 Ingénieur Logiciel

🦅 Parrain de Visa H1B

🗣️🇺🇸🇬🇧 Anglais requis

Distributed Systems