
11 - 50 employees
🤖 Artificial Intelligence
🏢 Enterprise
☁️ SaaS
Artificial Intelligence • Enterprise • SaaS
Cohere is a leading enterprise AI platform optimized for generative AI, search and discovery, and advanced retrieval. The company offers AI-powered applications designed to augment and elevate the global workforce, helping businesses thrive in the AI era. Cohere provides solutions such as embedding and reranking models, allowing enterprises to efficiently retrieve information and build powerful applications. The company offers flexible deployment options for enterprise-grade AI, on any cloud or on-premises, and provides extensive developer resources and support. Cohere is committed to scaling intelligence to serve humanity, making intelligence abundant, affordable, and accessible.
🕒 March 18
Improve your chances of getting an interview by checking your resume score before you apply.

11 - 50 employees
🤖 Artificial Intelligence
🏢 Enterprise
☁️ SaaS
Artificial Intelligence • Enterprise • SaaS
Cohere is a leading enterprise AI platform optimized for generative AI, search and discovery, and advanced retrieval. The company offers AI-powered applications designed to augment and elevate the global workforce, helping businesses thrive in the AI era. Cohere provides solutions such as embedding and reranking models, allowing enterprises to efficiently retrieve information and build powerful applications. The company offers flexible deployment options for enterprise-grade AI, on any cloud or on-premises, and provides extensive developer resources and support. Cohere is committed to scaling intelligence to serve humanity, making intelligence abundant, affordable, and accessible.
• Work across the inference stack to improve core performance metrics • Dive deep into model execution • Identify bottlenecks and develop innovative optimizations • Collaborate closely with modeling and systems teams • Experiment, measure, and ship improvements that accelerate inference • Build expertise in advanced performance techniques, including GPU/CUDA optimizations, kernel-level improvements, and model execution strategies for MoE and large-scale architectures
• 5+ years of experience writing high-performance, production-quality code • Strong programming skills in C++ or Python (Rust/Go also welcome) • Experience working with large language models and familiarity with the LLM inference ecosystem (e.g., vLLM, SGLang, etc.) • Ability to diagnose and resolve performance bottlenecks across the model execution stack • A strong bias for action — you ship fast, measure impact, and iterate • It’s a big plus if you have experience with GPU programming, CUDA, or low-level systems optimization • Language modeling with transformers (MoE, speculative decoding, KV-cache optimizations) • Scaling performance-critical distributed systems (e.g., computation, search, storage)
• An open and inclusive culture and work environment • Work closely with a team on the cutting edge of AI research • Weekly lunch stipend, in-office lunches & snacks • Full health and dental benefits, including a separate budget to take care of your mental health • 100% Parental Leave top-up for up to 6 months • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend • 6 weeks of vacation (30 working days!)
Apply Now🕒 March 18
Full-stack scientist pioneering quantitative research efforts at Udio. Building at the intersection of research, engineering, and product with proprietary datasets.
🕒 March 18
Member of Technical Staff (ML) developing and evaluating deep learning models for Reka's AI applications. Collaborating with a global team to translate research into practical solutions.
PyTorch
🕒 March 18
Member of Technical Staff building robust streaming data infrastructure for Anchorage Digital's crypto platform. Collaborating with cross-functional teams to optimize and maintain high-quality data outputs.
🇺🇸 United States – Remote
💰 $350M Series D on 2021-12
⏰ Full Time
🔴 Lead
🖥 Software Engineer
🦅 H1B Visa Sponsor
Cloud
Python
Go
🕒 March 18
SAP ABAP Developer with over 12 years of experience in SAP ECC & S/4 HANA development. Requires strong knowledge in ABAP, REST APIs, and system integration.
Cloud
SOAP
🕒 March 18
Director of Implementation & Professional Services leading customer onboarding and integration for Zus’s healthcare data platform. Focused on maximizing value from customer interactions and technical deliveries.
🇺🇸 United States – Remote
💵 $150k - $200k / year
💰 $40M Series A on 2023-03
⏰ Full Time
🔴 Lead
🖥 Software Engineer
ETL