Staff Research Scientist – Reinforcement Learning

10,000+ employees

🏥 Healthcare

💼 Consulting

📦 Logistics

Healthcare • Consulting • Logistics

Thermo Fisher Scientific is a leading global supplier of scientific instrumentation, reagents and consumables, and software services. They support the life sciences, healthcare, and analytical chemistry sectors by providing robust solutions for laboratory research and production processes. Their innovative products and services encompass a range of applications, including diagnostics, lab workflow automation, and drug discovery.

Staff Research Scientist – Reinforcement Learning

Job not on LinkedIn

🕒 July 3

🏄 California – Remote

💵 $200k - $250k / year

⏰ Full Time

🔴 Lead

🧬 Research Scientist

🦅 H1B Visa Sponsor

Apply Now

Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Thermo Fisher Scientific

10,000+ employees

🏥 Healthcare

💼 Consulting

📦 Logistics

Healthcare • Consulting • Logistics

📋 Description

• Design simulation environments and digital twins for enterprise workflows • Post-train LLM agents using RLHF, DPO, GRPO, PPO, and emerging methods • Build pipelines that convert human-labeled traces and verifiable signals into training data • Architect multi-turn, tool-using agents with closed learning loops • Design reward functions and verifiers that resist reward hacking and reflect real task outcomes • Set the technical bar across the team — architecture, code review, engineering standards • Mentor researchers and engineers; drive technical direction through influence • Translate research into production; contribute to publications

🎯 Requirements

• 7+ years in ML/AI research or engineering; 3+ years at senior/staff level • MS or PhD in Computer Science, Machine Learning, or related field (or equivalent) • 5+ years hands-on RL — environment design, reward engineering, policy optimization — with at least one production deployment LLM Post-Training • 3+ years fine-tuning LLMs with hands-on RL post-training (RLHF, DPO, GRPO, PPO) • Expert-level implementation of RLHF pipelines, reward modeling (Bradley-Terry), DPO, and KTO • Strong Python and software engineering skills — comfortable building production pipelines, not just notebooks • Deep expertise in MDPs, policy gradient methods (PPO, SAC), and temporal difference learning • Working knowledge of modern post-training and rollout-serving libraries (TRL, veRL, OpenRLHF, SkyRL)

🏖️ Benefits

• Health insurance • 401(k) matching • Flexible work hours • Paid time off • Remote work options

Apply Now

Similar Jobs

Principal Applied Scientist, Agentforce Operations

🕒 July 1

Salesforce

10,000+ employees

💼 Consulting

📣 Marketing

☁️ SaaS

Applied Scientist developing AI-driven solutions with a focus on deep learning and innovative architectures. Collaborating within a diverse team to leverage AI research for practical applications in business intelligence.

🇺🇸 United States – Remote

💵 $197.3k - $313.7k / year

⏰ Full Time

🔴 Lead

🧬 Research Scientist

🦅 H1B Visa Sponsor

Principal Scientist, Translational Oncology – Single Cell and Spatial Genomics

🕒 July 1

Natera

1001 - 5000

🏥 Healthcare

🧬 Biotechnology

⚕️ Healthcare Insurance

Principal Scientist at Natera focusing on oncology translational research and integrating scRNA-seq and spatial biology into pipelines. Leading advanced projects with a team of scientists.

🇺🇸 United States – Remote

💵 $171.9k - $214.9k / year

⏰ Full Time

🔴 Lead

🧬 Research Scientist

🦅 H1B Visa Sponsor

Staff Applied Scientist – Distribution Center

🕒 July 1

Afresh

51 - 200

🍽️ Food & Beverage

📦 Logistics

💼 Consulting

Staff Applied Scientist leading R&D work on AI/ML models for grocery replenishment technology. Apply knowledge in machine learning and optimization to reduce food waste across global supply chains.

🇺🇸 United States – Remote

💵 $191.8k - $287.6k / year

⏰ Full Time

🔴 Lead

🧬 Research Scientist

🦅 H1B Visa Sponsor

Staff Applied Scientist – AdTech

🕒 July 1

Launch Potato

51 - 200

📣 Marketing

📱 Media

👥 B2C

Staff Applied Scientist responsible for end-to-end data science engine focusing on insurance vertical. Driving revenue efficiency and collaborating with stakeholders at Launch Potato.

🇺🇸 United States – Remote

⏰ Full Time

🔴 Lead

🧬 Research Scientist

Principal Scientist, Immunology

🕒 June 26

Jade Biosciences

11 - 50

🏥 Healthcare

🏭 Manufacturing

🧬 Biotechnology

Principal Scientist leading immunology-driven research for Jade Biosciences' pipeline. Integrating insights to enhance clinical development for autoimmune therapies.

🇺🇸 United States – Remote

💵 $175k - $190k / year

⏰ Full Time

🔴 Lead

🧬 Research Scientist