Principal Applied Scientist, Agentic AI

🕒 March 24

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Zillow

Zillow

5001 - 10000 employees

Founded 2006

🏠 Real Estate

🛍️ eCommerce

👥 B2C

💰 $4.1M Post-IPO Equity on 2012-12

Real Estate • eCommerce • B2C

Zillow is a leading real estate and property rental marketplace that provides comprehensive information on homes, apartments, and properties for sale or rent. It offers users tools to search for properties, calculate mortgage rates, and connect with real estate agents. The platform also features innovative algorithms that provide Zestimates, which are estimated market values of homes. Zillow is a go-to resource for individuals looking to buy, sell, or rent properties, as well as for agents and brokers who want to reach a wider audience.

📋 Description

• Lead the technical direction and strategy for RL post‑training of production models, partnering with other scientists, engineers, and product leaders to align models with customer and business needs. • Design and implement post‑training pipelines that combine techniques such as supervised fine‑tuning on curated demonstrations, preference modeling and pairwise ranking, and RL‑based alignment approaches like RLHF, RLAIF, or DPO for multi‑objective optimization. • Develop reward models and objective formulations that balance constraints such as helpfulness, safety, fairness, compliance, and customer satisfaction, and iterate on them using human and AI feedback at scale through online and batch adaptation loops with strong guardrails. • Translate conversational logs, behavioral signals, and structured attributes into training, reward, and evaluation signals for post‑training and reinforcement learning, turning heterogeneous data into actionable supervision. • Partner with model and platform teams to improve the efficiency and robustness of training and evaluation, including off‑policy evaluation, replay strategies, controlled rollouts, and metrics and evaluation frameworks such as win‑rates versus baselines, safety and quality metrics, and expert‑review programs. • Mentor applied scientists and engineers, raising the technical bar in RL, post‑training, and evaluation, and contributing to the broader AI roadmap at Zillow through thought leadership and guidance. • When appropriate, represent Zillow’s work externally through talks, publications, or open‑source contributions.

🎯 Requirements

• You have a PhD or equivalent experience in Computer Science, Electrical Engineering, Statistics, or a related field, with emphasis in areas such as reinforcement learning, bandits, large language models, or applied machine learning. • You have strong, current expertise in post‑training techniques (such as supervised fine‑tuning, DPO, RLHF/RLAIF, preference modeling, and multi‑objective optimization), in evaluation and monitoring of aligned models (including win‑rate experiments, human and AI feedback loops, long‑horizon evaluation, and safety or guardrail metrics), and in modern transformer‑based models and tooling such as LLMs, multimodal models, vector search, and orchestration frameworks. • You have experience working with cross‑functional partners (for example, engineering, product, design, operations, legal, and compliance) in domains where safety, trust, or regulation matter, such as marketplaces, finance, healthcare, or other high‑stakes verticals. • You demonstrate technical leadership and mentorship, helping senior engineers and scientists grow, creating clarity amid ambiguity, and driving alignment across teams, and you communicate complex technical ideas clearly to both expert and non‑expert audiences in writing and verbally.

🏖️ Benefits

• In addition to a competitive base salary this position is also eligible for equity awards based on factors such as experience, performance and location.

Apply Now

Similar Jobs

🕒 March 20

Lime

501 - 1000

🚗 Transport

🛍️ eCommerce

☁️ SaaS

Staff Applied Scientist at Lime leading technical strategy for vehicle perception. Tackling complex Computer Vision and Machine Learning challenges in micromobility applications with a global impact.

🕒 February 7

MapLight Therapeutics, Inc.

11 - 50

🧬 Biotechnology

⚕️ Healthcare Insurance

💊 Pharmaceuticals

Principal Scientist leading analytical development for drug substances in a biotech company. Collaborates with cross-functional teams and ensures regulatory compliance in drug development processes.

🕒 February 4

Cambium Learning Group

501 - 1000

📚 Education

🤖 Artificial Intelligence

Principal Scientist driving innovation in machine learning for educational technology. Supporting automated scoring and developing new applications in natural language processing.

🕒 December 19, 2025

Canva

1001 - 5000

☁️ SaaS

📱 Media

📚 Education

Staff Research Scientist shaping the future of AI at Canva by leading foundational model development and executing high-impact research initiatives. Collaborating with teams to translate research into scalable impact.

🕒 October 23, 2025

Spotify

5001 - 10000

📱 Media

👥 B2C

🛍️ eCommerce

Staff Research Scientist developing generative music technologies for artist-first experiences at Spotify. Pioneering research and collaborating with cross-functional teams for innovative solutions.