
10,000+ employees
⚕️ Healthcare Insurance
🧬 Biotechnology
💊 Pharmaceuticals
Healthcare Insurance • Biotechnology • Pharmaceuticals
Thermo Fisher Scientific is a leading global supplier of scientific instrumentation, reagents and consumables, and software services. They support the life sciences, healthcare, and analytical chemistry sectors by providing robust solutions for laboratory research and production processes. Their innovative products and services encompass a range of applications, including diagnostics, lab workflow automation, and drug discovery.
🔥 0 minutes ago
Improve your chances of getting an interview by checking your resume score before you apply.

10,000+ employees
⚕️ Healthcare Insurance
🧬 Biotechnology
💊 Pharmaceuticals
Healthcare Insurance • Biotechnology • Pharmaceuticals
Thermo Fisher Scientific is a leading global supplier of scientific instrumentation, reagents and consumables, and software services. They support the life sciences, healthcare, and analytical chemistry sectors by providing robust solutions for laboratory research and production processes. Their innovative products and services encompass a range of applications, including diagnostics, lab workflow automation, and drug discovery.
• Design and evaluate reinforcement learning (RL) systems for agentic AI workflows • Develop RL environments, reward models, and post-training pipelines for LLM-based agents • Create end-to-end RL pipelines for agentic systems (simulation → training → evaluation) • Align LLM-based agents using RLHF, DPO, PPO, and emerging methods • Design reward functions, verifiers, and evaluation frameworks • Build simulation environments (digital twins) for enterprise workflows • Ensure scalable training and inference for RL-based systems • Document experiments, ablations, and findings for research and productionization
• PhD candidate in CS, ML, or related field with research in reinforcement learning or agentic AI • Strong Python and PyTorch skills with GPU-based training experience • Solid understanding of RL fundamentals (MDPs, policy gradients, value methods) • Experience with LLMs and post-training techniques (RLHF, DPO, PPO, etc.) • Strong experimentation practices (ablation, reproducibility, clear reporting) • Experience with RL environments (Gymnasium, RLlib, Stable Baselines) (preferred) • Research in offline RL, model-based RL, or hierarchical RL (preferred) • Publications at top ML conferences (NeurIPS, ICML, ICLR, ACL) (preferred) • Experience with simulation, synthetic data, or multi-agent systems (preferred) • Distributed training and large-scale experimentation (preferred)
• Competitive stipend • Mentorship from researchers and engineers • Access to modern GPU infrastructure • Opportunities to publish and present research
Apply Now🕒 6 days ago
Digital Video Newsroom Intern producing video content for ESPN digital channels. Collaborating with teams and delivering high-quality video highlights in a fast-paced environment.
🇺🇸 United States – Remote
💵 $22 / hour
💰 Post-IPO Debt on 2020-04
👨🎓 Internship
⚪️ Entry-level
🦅 H1B Visa Sponsor
🕒 6 days ago
Digital Video Newsroom Intern creating engaging video content for ESPN platforms. Collaborating with teams and managing timelines in a fast-paced sports environment.
🇺🇸 United States – Remote
💵 $22 / hour
💰 Post-IPO Debt on 2020-04
👨🎓 Internship
⚪️ Entry-level
🦅 H1B Visa Sponsor
🕒 June 25
Digital Advertising Intern assisting with trafficking ads, writing ad copy, and analyzing performance data. Opportunity to work remotely or in Washington, D.C. with a focus on progressive causes.
🕒 June 25
Summer Internship Program with Courier Newsroom providing hands-on experience in journalism. Work on digital media initiatives engaging local communities and enhancing outreach to Gen Z.
🕒 June 25
Intern providing administrative support to the Grantmaking team at Cultural Survival. Assisting with communications, database updates, files organization, and multimedia content creation.
🗣️🇪🇸 Spanish Required