Post a Job Affiliates

Search Remote Jobs

AssemblyAI

Website LinkedIn All Job Openings

Progress in AI is moving at an unprecedented pace. At AssemblyAI, we keep our pulse on the latest developments and breakthroughs in AI research and use these advances to inform our production-ready AI models. Developers and product teams use our API to access state-of-the-art AI models to transcribe and understand speech, and build AI-powered features faster.

51 - 200 employees

💰 $30M Series B on 2022-07

Senior Research Engineer

September 27

🇺🇸 United States – Remote

💵 $240k - $275k / year

⏰ Full Time

🟠 Senior

📚 Research Engineer

AWS

Python

PyTorch

Rust

Apply Now

AssemblyAI

Website LinkedIn All Job Openings

51 - 200 employees

💰 $30M Series B on 2022-07

📋 Description

• Investigate and mitigate performance bottlenecks in large-scale distributed training and inference systems. • Develop and implement low-level (operator/kernel) and high-level (system/architecture) optimization strategies. • Translate research models and prototypes into highly optimized, production-ready inference systems. • Explore and integrate inference compilers such as TensorRT, ONNX Runtime, AWS Neuron and Inferentia. • Design, test, and deploy scalable solutions for parallel and distributed workloads on heterogeneous hardware. • Facilitate knowledge transfer and bidirectional support between Research and Engineering teams, ensuring alignment of priorities and solutions. • Collaborate closely with Research and Engineering teams to bridge research and production engineering.

🎯 Requirements

• Strong expertise in the Python ecosystem and major ML frameworks (PyTorch, JAX). • Experience with lower-level programming (C++ or Rust preferred). • Deep understanding of GPU acceleration (CUDA, profiling, kernel-level optimization); TPU experience is a strong plus. • Proven ability to accelerate deep learning workloads using compiler frameworks, graph optimizations, and parallelization strategies. • Deep understanding of modern deep learning systems, including layer-level optimization, large-scale distributed training, streaming, low-latency and asynchronous inference, inference compilers, and advanced parallelization techniques. • Solid understanding of the deep learning lifecycle: model design, large-scale training, data processing pipelines, and inference deployment. • Strong debugging, profiling, and optimization skills in large-scale distributed environments. • Excellent communication and collaboration skills, with the ability to clearly prioritize and articulate impact-driven technical solutions. • Experience with inference compilers such as TensorRT, ONNX Runtime, AWS Neuron, Inferentia, or similar technologies.

🏖️ Benefits

• Fully remote team • Competitive salary range: $240,000 - $275,000 • Compensation, benefit, and other reward opportunities • Commitment to pay equity • Inclusive, equal opportunity workplace

Apply Now

Similar Jobs

Research Engineer

September 13

Chainlink Labs

201 - 500

💸 Finance

💳 Fintech

🌐 Web 3

Website LinkedIn All Job Openings

Research Engineer transitioning oracle research into production for Chainlink Labs, collaborating with cryptography, mechanism-design, and distributed-systems experts.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

📚 Research Engineer

Cloud

Distributed Systems

Linux

Oracle

Python

Rust

Solidity

Swift

TypeScript

Unix

Web3

Apply

View Job

Machine Learning Research Engineer

August 13

Roboflow

11 - 50

🤖 Artificial Intelligence

Website LinkedIn All Job Openings

Join Roboflow to push state-of-the-art computer vision.\nBuild scalable models for a global developer base; contribute to open source.

🇺🇸 United States – Remote

💵 $200k - $275k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

📚 Research Engineer

Open Source

PyTorch

Apply

View Job

Research Engineer - Decentralized AI Systems

August 9

Yotta Labs

1 - 10

🤖 Artificial Intelligence

☁️ SaaS

Website LinkedIn All Job Openings

Yotta Labs seeks a Research Engineer to optimize AI workloads on a decentralized framework. Join us in pioneering decentralized AI infrastructure and workload orchestration.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

📚 Research Engineer

Cloud

Distributed Systems

Python

PyTorch

Ray

Apply

View Job