Senior Research Engineer

October 28

Apply Now
Logo of AssemblyAI

AssemblyAI

Progress in AI is moving at an unprecedented pace. At AssemblyAI, we keep our pulse on the latest developments and breakthroughs in AI research and use these advances to inform our production-ready AI models. Developers and product teams use our API to access state-of-the-art AI models to transcribe and understand speech, and build AI-powered features faster.

51 - 200 employees

💰 $30M Series B on 2022-07

📋 Description

• Maintain and evolve our JAX training framework, ensuring scalability and efficiency for large-scale distributed training runs • Optimize production JAX inference systems for speech-to-text models using advanced techniques • Refactor and modernize model architectures and infrastructure • Investigate and resolve performance bottlenecks across the stack • Design and deploy scalable, distributed workloads optimized for TPU and GPU architectures • Bridge Research and Engineering teams, ensuring seamless knowledge transfer

🎯 Requirements

• Expert-level proficiency with JAX and its ecosystem (Flax, Optax, XLA compilation pipeline) • Strong experience optimizing inference systems for production, ideally with LLMs or speech models • Hands-on experience with TPU programming and optimization; GPU/CUDA expertise is also valuable • Passion for refactoring and improving existing systems • Familiarity with modern inference optimization techniques • Domain knowledge in Speech-to-Text is a plus • Strong Python skills; C++ or Rust experience for kernel-level work is a plus • Deep understanding of distributed training at scale and ML infrastructure best practices • Excellent communication skills and a collaborative mindset

🏖️ Benefits

• Committed to creating a space for all employees • Equal opportunity to succeed

Apply Now
Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com