Staff Research Engineer, Model Efficiency

November 8

Apply Now
Logo of Cohere

Cohere

Artificial Intelligence • Enterprise • SaaS

Cohere is a leading enterprise AI platform optimized for generative AI, search and discovery, and advanced retrieval. The company offers AI-powered applications designed to augment and elevate the global workforce, helping businesses thrive in the AI era. Cohere provides solutions such as embedding and reranking models, allowing enterprises to efficiently retrieve information and build powerful applications. The company offers flexible deployment options for enterprise-grade AI, on any cloud or on-premises, and provides extensive developer resources and support. Cohere is committed to scaling intelligence to serve humanity, making intelligence abundant, affordable, and accessible.

📋 Description

• Develop, prototype, and deploy techniques that materially improve how fast and efficiently our models run in production

🎯 Requirements

• Have a PhD in Machine Learning or a related field • Understand LLM architecture, and how to optimize LLM inference given resource constraints • Have significant experience with one or more techniques that enhance model efficiency • Strong software engineering skills • An appetite to work in a fast-paced high-ambiguity start-up environment • Publications at top-tier conferences and venues (ICLR, ACL, NeurIPS) • Passion to mentor others

🏖️ Benefits

• An open and inclusive culture and work environment • Work closely with a team on the cutting edge of AI research • Weekly lunch stipend, in-office lunches & snacks • Full health and dental benefits, including a separate budget to take care of your mental health • 100% Parental Leave top-up for up to 6 months • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend • 6 weeks of vacation (30 working days!)

Apply Now

Similar Jobs

October 23

Seeking a Staff Research Engineer to advance AI technologies for music at Spotify. Collaborate closely with research scientists and improve generative music experiences for fans and artists.

AWS

Azure

Cloud

Google Cloud Platform

PyTorch

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com