Research Engineer – Reinforcement Learning

Job not on LinkedIn

🕒 March 27

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Prime Intellect

Prime Intellect

1 - 10 employees

🤖 Artificial Intelligence

☁️ SaaS

Artificial Intelligence • SaaS • Cloud Computing

Prime Intellect is a company focused on democratizing AI development by providing scalable and decentralized computing resources for training models. Their platform allows users to find and share global compute resources, enabling the training of state-of-the-art models through distributed clusters. They promote the collective ownership of AI innovations, including language and scientific models. Prime Intellect also offers a range of GPU options to facilitate affordable and efficient model training. They aim to advance decentralized training research and open-source AI development on a global scale.

📋 Description

• Lead and participate in novel research to build a massive scale synthetic data generation pipeline and orchestration solution • Optimize the performance, cost, and resource utilization of AI inference workloads by leveraging the most recent advances for compute & memory optimization techniques. • Contribute to the development of our open-source libraries and frameworks for synthetic data generation and distributed RL frameworks. • Publish research in top-tier AI conferences such as ICML & NeurIPS. • Distill highly technical project outcomes in layman approachable technical blogs to our customers and developers. • Stay up-to-date with the latest advancements in AI/ML infrastructure and tools, synthetic data gen research and proactively identify opportunities to enhance our platform's capabilities and user experience.

🎯 Requirements

• Strong background in AI/ML engineering, with extensive experience in designing and implementing end-to-end pipelines for the inference or training of large-scale AI models. • Deep expertise in distributed inference techniques and frameworks (e.g. vllm, sglang) for optimizing the performance and scalability of AI workloads. • Solid understanding of MLOps best practices, including model versioning, experiment tracking, and continuous integration/deployment (CI/CD) pipelines. • Passion for advancing the state-of-the-art in reasoning and democratizing access to AI capabilities for researchers, developers, and businesses worldwide. • If you're not familiar with these, but feel like that you can contribute to our mission and you're a high-energy person, get familiar with these resources (here, here and here) and please reach out!

🏖️ Benefits

• Competitive compensation, including equity incentives, aligning your success with the growth and impact of Prime Intellect. • Flexible work arrangements, with the option to work remotely or in-person at our offices in San Francisco. • Visa sponsorship and relocation assistance for international candidates. • Quarterly team off-sites, hackathons, conferences and learning opportunities. • Opportunity to work with a talented, hard-working and mission-driven team, united by a shared passion for leveraging technology to accelerate science and AI.

Apply Now

Similar Jobs

🕒 March 27

Mercury Insurance

5001 - 10000

💸 Finance

👥 B2C

Senior Configuration Management Database Engineer at Mercury Insurance ensuring CMDB data accuracy and leading automated discovery capabilities across IT environments.

🕒 March 25

murmuration

11 - 50

🌍 Social Impact

🤝 Non-profit

📚 Education

Senior Research Engineer driving complex projects across research and data science enablement systems. Collaborating to build systems that enhance civic participation in America.

🕒 March 19

Hungryroot

51 - 200

🛍️ eCommerce

🧘 Wellness

👥 B2C

Senior Operations Research Engineer at Hungryroot developing algorithms for grocery personalization, recipe selection, and supply chain forecasting. Collaborating with data scientists on machine learning projects.

🕒 March 14

Deepgram

51 - 200

🤖 Artificial Intelligence

☁️ SaaS

🔌 API

Machine Learning Engineer at Deepgram prototyping novel modeling ideas to build scalable speech technologies. Collaborate with researchers to develop world-class speech recognition and synthesis systems.

🕒 February 28

Rwazi

11 - 50

🤖 Artificial Intelligence

☁️ SaaS

🤝 B2B

Research Engineer building technical infrastructure for decision research. Developing prototypes and evaluation tooling in a remote flexible environment.

🇺🇸 United States – Remote

🔥 Funding within the last year

💰 $12M Series A - Rwazi on 2025-07

⏰ Full Time

🟡 Mid-level

🟠 Senior

📚 Research Engineer