
Progress in AI is moving at an unprecedented pace. At AssemblyAI, we keep our pulse on the latest developments and breakthroughs in AI research and use these advances to inform our production-ready AI models. Developers and product teams use our API to access state-of-the-art AI models to transcribe and understand speech, and build AI-powered features faster.
51 - 200 employees
đ° $30M Series B on 2022-07
September 27
đşđ¸ United States â Remote
đľ $240k - $275k / year
â° Full Time
đ Senior
đ Research Engineer

Progress in AI is moving at an unprecedented pace. At AssemblyAI, we keep our pulse on the latest developments and breakthroughs in AI research and use these advances to inform our production-ready AI models. Developers and product teams use our API to access state-of-the-art AI models to transcribe and understand speech, and build AI-powered features faster.
51 - 200 employees
đ° $30M Series B on 2022-07
⢠Investigate and mitigate performance bottlenecks in large-scale distributed training and inference systems. ⢠Develop and implement low-level (operator/kernel) and high-level (system/architecture) optimization strategies. ⢠Translate research models and prototypes into highly optimized, production-ready inference systems. ⢠Explore and integrate inference compilers such as TensorRT, ONNX Runtime, AWS Neuron and Inferentia. ⢠Design, test, and deploy scalable solutions for parallel and distributed workloads on heterogeneous hardware. ⢠Facilitate knowledge transfer and bidirectional support between Research and Engineering teams, ensuring alignment of priorities and solutions. ⢠Collaborate closely with Research and Engineering teams to bridge research and production engineering.
⢠Strong expertise in the Python ecosystem and major ML frameworks (PyTorch, JAX). ⢠Experience with lower-level programming (C++ or Rust preferred). ⢠Deep understanding of GPU acceleration (CUDA, profiling, kernel-level optimization); TPU experience is a strong plus. ⢠Proven ability to accelerate deep learning workloads using compiler frameworks, graph optimizations, and parallelization strategies. ⢠Deep understanding of modern deep learning systems, including layer-level optimization, large-scale distributed training, streaming, low-latency and asynchronous inference, inference compilers, and advanced parallelization techniques. ⢠Solid understanding of the deep learning lifecycle: model design, large-scale training, data processing pipelines, and inference deployment. ⢠Strong debugging, profiling, and optimization skills in large-scale distributed environments. ⢠Excellent communication and collaboration skills, with the ability to clearly prioritize and articulate impact-driven technical solutions. ⢠Experience with inference compilers such as TensorRT, ONNX Runtime, AWS Neuron, Inferentia, or similar technologies.
⢠Fully remote team ⢠Competitive salary range: $240,000 - $275,000 ⢠Compensation, benefit, and other reward opportunities ⢠Commitment to pay equity ⢠Inclusive, equal opportunity workplace
Apply NowSeptember 13
Research Engineer transitioning oracle research into production for Chainlink Labs, collaborating with cryptography, mechanism-design, and distributed-systems experts.
August 13
Join Roboflow to push state-of-the-art computer vision.\nBuild scalable models for a global developer base; contribute to open source.
đşđ¸ United States â Remote
đľ $200k - $275k / year
â° Full Time
đĄ Mid-level
đ Senior
đ Research Engineer
August 9
Yotta Labs seeks a Research Engineer to optimize AI workloads on a decentralized framework. Join us in pioneering decentralized AI infrastructure and workload orchestration.
July 16
Join Pareto.AI as a Research Engineer, working on cutting-edge AI research applications remotely.
đşđ¸ United States â Remote
â° Full Time
đĄ Mid-level
đ Senior
đ Research Engineer
đŚ H1B Visa Sponsor
May 22
Join Helm.ai to enhance AI for autonomous driving and robotics through unsupervised learning.
đşđ¸ United States â Remote
đľ $150k - $250k / year
â° Full Time
đĄ Mid-level
đ Senior
đ Research Engineer
đŚ H1B Visa Sponsor