Research Intern – Applied Reinforcement Learning

Job not on LinkedIn

🔥 0 minutes ago

🏄 California, Washington – Remote

info

💵 $35 - $45 / hour

👨‍🎓 Internship

⚪️ Entry-level

🦅 H1B Visa Sponsor

info
Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Thermo Fisher Scientific

Thermo Fisher Scientific

10,000+ employees

⚕️ Healthcare Insurance

🧬 Biotechnology

💊 Pharmaceuticals

Healthcare Insurance • Biotechnology • Pharmaceuticals

Thermo Fisher Scientific is a leading global supplier of scientific instrumentation, reagents and consumables, and software services. They support the life sciences, healthcare, and analytical chemistry sectors by providing robust solutions for laboratory research and production processes. Their innovative products and services encompass a range of applications, including diagnostics, lab workflow automation, and drug discovery.

📋 Description

• Design and evaluate reinforcement learning (RL) systems for agentic AI workflows • Develop RL environments, reward models, and post-training pipelines for LLM-based agents • Create end-to-end RL pipelines for agentic systems (simulation → training → evaluation) • Align LLM-based agents using RLHF, DPO, PPO, and emerging methods • Design reward functions, verifiers, and evaluation frameworks • Build simulation environments (digital twins) for enterprise workflows • Ensure scalable training and inference for RL-based systems • Document experiments, ablations, and findings for research and productionization

🎯 Requirements

• PhD candidate in CS, ML, or related field with research in reinforcement learning or agentic AI • Strong Python and PyTorch skills with GPU-based training experience • Solid understanding of RL fundamentals (MDPs, policy gradients, value methods) • Experience with LLMs and post-training techniques (RLHF, DPO, PPO, etc.) • Strong experimentation practices (ablation, reproducibility, clear reporting) • Experience with RL environments (Gymnasium, RLlib, Stable Baselines) (preferred) • Research in offline RL, model-based RL, or hierarchical RL (preferred) • Publications at top ML conferences (NeurIPS, ICML, ICLR, ACL) (preferred) • Experience with simulation, synthetic data, or multi-agent systems (preferred) • Distributed training and large-scale experimentation (preferred)

🏖️ Benefits

• Competitive stipend • Mentorship from researchers and engineers • Access to modern GPU infrastructure • Opportunities to publish and present research

Apply Now

Similar Jobs

🕒 6 days ago

The Walt Disney Company

10,000+ employees

📱 Media

Digital Video Newsroom Intern producing video content for ESPN digital channels. Collaborating with teams and delivering high-quality video highlights in a fast-paced environment.

🕒 6 days ago

The Walt Disney Company

10,000+ employees

📱 Media

Digital Video Newsroom Intern creating engaging video content for ESPN platforms. Collaborating with teams and managing timelines in a fast-paced sports environment.

🕒 June 25

Middle Seat Digital

11 - 50

📱 Media

🌍 Social Impact

🤝 Non-profit

Digital Advertising Intern assisting with trafficking ads, writing ad copy, and analyzing performance data. Opportunity to work remotely or in Washington, D.C. with a focus on progressive causes.

🕒 June 25

Copper Courier

1 - 10

📱 Media

📚 Education

🤝 Non-profit

Summer Internship Program with Courier Newsroom providing hands-on experience in journalism. Work on digital media initiatives engaging local communities and enhancing outreach to Gen Z.

🕒 June 25

Cultural Survival

11 - 50

🤝 Non-profit

🌍 Social Impact

📱 Media

Intern providing administrative support to the Grantmaking team at Cultural Survival. Assisting with communications, database updates, files organization, and multimedia content creation.

🗣️🇪🇸 Spanish Required