Staff Product Manager, AI Evaluations

October 13

Apply Now
Logo of Dropbox

Dropbox

Cloud Storage • Enterprise • Productivity

Dropbox is a cloud-based service that provides tools for storing, sharing, and accessing files across devices. It offers features such as document sharing, video review, automatic backups, and AI-driven scheduling. Dropbox also provides solutions for different sectors like teams, sales, marketing, and education, and industries including construction, media, technology, and manufacturing. With a focus on security, Dropbox ensures files are encrypted and protected against tampering. It offers integrations with various productivity tools and is trusted by major companies for efficient file management and collaboration.

1001 - 5000 employees

Founded 2007

🏢 Enterprise

⚡ Productivity

📋 Description

• Define and drive the roadmap for Dropbox’s AI Evaluation Framework, covering both quantitative metrics and human-in-the-loop systems. • Partner with ML, data, and research teams to build evaluation pipelines that measure model accuracy, helpfulness, bias, latency, and hallucination rates. • Translate business and product goals into measurable AI performance objectives. • Develop taxonomies and methodologies for evaluating large language model (LLM) behaviors in production. • Work closely with PMs across Dash to integrate eval signals into product development loops. • Partner with infra and AI platform teams to operationalize eval tooling and dashboards. • Ensure evaluations are ethical, transparent, and aligned with Dropbox’s privacy and trust standards. • Communicate findings and drive accountability across teams for improving AI quality.

🎯 Requirements

• 10+ years of product management experience, ideally in AI/ML systems, data platforms, or experimentation. • Proven experience defining and shipping metrics-driven AI products or frameworks. • Strong understanding of model evaluation concepts such as precision/recall, win-rate testing, and user feedback loops. • Excellent analytical, problem-solving, and communication skills. • Strong cross-functional collaboration across ML, data, design, and research teams. • Comfort balancing experimentation speed with rigor and reproducibility.

Apply Now

Similar Jobs

October 10

Basis Technologies

501 - 1000

☁️ SaaS

🤖 Artificial Intelligence

Director of Product Management overseeing a team for Basis Technologies' DSP platform. Driving product strategy while fostering team growth and collaboration in the ad tech industry.

🇨🇦 Canada – Remote

💵 $150k - $257k / year

💰 $25M Private Equity Round on 2021-04

⏰ Full Time

🔴 Lead

✅ Product Manager

October 9

Tailscale

51 - 200

☁️ SaaS

🔐 Security

📡 Telecommunications

Product Manager focusing on product-led growth at Tailscale. Responsible for onboarding and activation experience for all user types.

🇨🇦 Canada – Remote

💵 $175.5k - $296.1k / year

💰 $100M Series B on 2022-05

⏰ Full Time

🟠 Senior

🔴 Lead

✅ Product Manager

October 6

Super.com

201 - 500

🛍️ eCommerce

Staff Product Manager at Super.com driving growth and engagement through the Homepage and Super+ membership. Leading cross-functional teams to enhance customer experiences and retention.

🇨🇦 Canada – Remote

💵 $143k - $216k / year

💰 $25M Debt Financing on 2023-04

⏰ Full Time

🔴 Lead

✅ Product Manager

September 30

Kraken Digital Asset Exchange

1001 - 5000

₿ Crypto

💸 Finance

💳 Fintech

Lead product strategy and roadmap for Kraken Pro trading web and mobile platforms. Own end-to-end initiatives to deliver UX, analytics, and trading features.

July 20

Elastic

1001 - 5000

🏢 Enterprise

Join Elastic to craft a world-class developer experience with OpenTelemetry instrumentation. Drive innovation in observability software development.

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com