Research Scientist, LLM Evaluation – Post-Training

Job not on LinkedIn

🕒 2 days ago

🏄 California, Washington – Remote

info

💵 $150k - $300k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

🧬 Research Scientist

🦅 H1B Visa Sponsor

info
Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Thermo Fisher Scientific

Thermo Fisher Scientific

10,000+ employees

⚕️ Healthcare Insurance

🧬 Biotechnology

💊 Pharmaceuticals

Healthcare Insurance • Biotechnology • Pharmaceuticals

Thermo Fisher Scientific is a leading global supplier of scientific instrumentation, reagents and consumables, and software services. They support the life sciences, healthcare, and analytical chemistry sectors by providing robust solutions for laboratory research and production processes. Their innovative products and services encompass a range of applications, including diagnostics, lab workflow automation, and drug discovery.

📋 Description

• Define and execute a rigorous research agenda focused on LLM evaluation and post-training, with emphasis on evaluation-driven model improvement • Design experiments to study how evaluation methodologies impact fine-tuning and post-training outcomes • Develop and validate comprehensive evaluation frameworks for LLM and multimodal systems • Lead research on frontier evaluation domains including long-context, cross-modal, and dynamic multi-turn evaluations • Analyze model behavior and failure patterns; generate actionable recommendations for model improvement • Partner with Language Data Scientists to integrate human-in-the-loop and synthetic data/evaluation strategies

🎯 Requirements

• MS or PhD in Computer Science, Machine Learning, Statistics, Applied Mathematics, AI, or a related quantitative field (PhD strongly preferred) • 5+ years of relevant experience in applied ML research or research science, with substantial work in LLMs or foundation models (graduate research counts) • Demonstrated experience with LLM evaluation, benchmarking, alignment, post-training, or model quality research • Strong foundation in experimental design, statistical analysis, and scientific reasoning for ML systems • Strong Python coding skills for research experimentation, data processing, evaluation pipelines, statistical analysis, and visualization • Hands-on experience with modern ML frameworks (PyTorch, Hugging Face, JAX/TensorFlow)

🏖️ Benefits

• Remote work options • Professional development opportunities

Apply Now

Similar Jobs

🕒 2 days ago

Thermo Fisher Scientific

10,000+ employees

🧬 Biotechnology

💊 Pharmaceuticals

🔬 Science

Research Scientist coordinating pregnancy safety studies for biopharmaceutical clients. Focus on generating real-world evidence throughout product lifecycle from planning to post-marketing management.

🕒 3 days ago

ŌURA

201 - 500

🧘 Wellness

Research Scientist on Health Science team translating real-world physiological data into innovative solutions for digital health. Collaborating with cross-functional teams to drive scientific insights into scalable products.

🕒 3 days ago

Wonderlic

51 - 200

👥 HR Tech

🏢 Enterprise

Senior Research Scientist merging I-O psychology and machine learning at Wonderlic. Leading jobs engine development for AI-driven job analysis and insights.

🕒 4 days ago

SandboxAQ

51 - 200

🤖 Artificial Intelligence

🔒 Cybersecurity

💊 Pharmaceuticals

Research Scientist developing ML and physics-based models for drug and materials discovery. Contributing to the next-generation structure prediction and binding affinity models for a pioneering AI solutions company.

🕒 5 days ago

American Institutes for Research

1001 - 5000

📚 Education

⚕️ Healthcare Insurance

🌍 Social Impact

Serve as a Senior Researcher at AIR leading evaluation and learning initiatives for place-based partnerships. Collaborate with stakeholders to develop frameworks and tools for community and individual level wellbeing.