Product Manager – Agent Evaluation Platform

Job not on LinkedIn

November 9

Apply Now
Logo of Hyperskill

Hyperskill

Education • Artificial Intelligence • SaaS

Hyperskill is reimagining education for the AI era, building intelligent, AI-driven learning systems and productivity tools that adapt to individual learners. It offers project-based courses, career paths, bootcamps, and team training focused on programming (Python, Java, Kotlin, SQL, Go, C++), AI engineering, and product-minded development, plus AI-native tools like Enlighter, Rolloo, InMind Lab, and StoryFlow to help professionals and organizations upskill and deploy practical AI solutions.

11 - 50 employees

📚 Education

🤖 Artificial Intelligence

☁️ SaaS

📋 Description

• As a Product Manager, you'll figure out how to turn agent evaluation research into a real product. • Understanding how companies currently test their AI systems, what they're missing, and how our platform can fill that gap. • Work with our engineering team to build evaluation pipelines that can assess everything from simple chatbots to complex multi-step agent workflows. • Find early customers — companies building AI agents who are frustrated with current testing methods. • Build Product Strategy from Scratch — Define how we position and sell this technology: embeddable SaaS for AI platforms, standalone evaluation service, developer tool integration, or something else. • Execute Go-to-Market — Find first customers among companies building AI agents, build sales processes, and determine pricing models for an emerging evaluation category. • Own Technical Product Vision — Work with engineering on evaluation pipelines, understand testing approaches, and translate assessment methodologies into clear business value.

🎯 Requirements

• Experience with AI-based products or features — you've built AI products or features and faced challenges evaluating their real-world quality and business impact. • You deeply understand AI agents — you've actually built AI agents, used MCPs, understand tool calls, and their security implications. • You understand evaluation methodologies — you've created evals, built testing datasets, and you distinguish between evaluation approaches. • You think in outcomes — you distinguish between 'agent completed the task' and 'agent solved the user's problem effectively'. • You thrive in ambiguity — you'll need to build a lot, figure things out on the go, experiment constantly, and handle multiple different tasks across various areas simultaneously.

🏖️ Benefits

• Contractor agreement with a US-registered legal entity. • 100% remote — work from anywhere in the world • Competitive salary in USD + equity in the product you're working on — we focus on market rates, ready to hear your expectations and prepare an offer matching your expertise • Resources — budget for tools, learning, and whatever you need to succeed • Fast-moving environment — we ship fast, learn fast, and iterate based on real customer feedback

Apply Now

Similar Jobs

October 28

GitLab

1001 - 5000

🤖 Artificial Intelligence

🏢 Enterprise

☁️ SaaS

Senior Product Manager for Secret Detection at GitLab working on security product development. Driving initiatives like Secret Push Protection to enhance secure software production.

🌏 Anywhere in the World

💵 $127.7k - $273.6k / year

💰 Secondary Market on 2020-11

⏰ Full Time

🟠 Senior

✅ Product Manager

October 20

GR8 Tech

501 - 1000

🎮 Gaming

☁️ SaaS

Middle Product Manager driving payment integrations on a B2B iGaming platform. Collaborating with teams to enhance product roadmaps, market analysis, and customer feedback implementation.

🌏 Anywhere in the World

⏰ Full Time

🟢 Junior

🟡 Mid-level

✅ Product Manager

October 16

GR8 Tech

501 - 1000

🎮 Gaming

☁️ SaaS

Product Lead driving payment integrations for GR8 Tech's B2B iGaming platform. Overseeing product strategy, collaboration, and innovation to ensure success.

🌏 Anywhere in the World

⏰ Full Time

🟠 Senior

✅ Product Manager

September 25

Alpaca

201 - 500

🔌 API

💳 Fintech

₿ Crypto

Lead roadmap for Alpaca's Broker, Trading, and Market Data APIs. Drive API design, documentation, and adoption for developer-focused brokerage platform.

🌏 Anywhere in the World

⏰ Full Time

🟠 Senior

✅ Product Manager

August 24

Transak

51 - 200

💳 Fintech

🌐 Web 3

₿ Crypto

Lead card payment orchestration and smart routing at Transak, a Web3 payments infrastructure provider.

🌏 Anywhere in the World

⏰ Full Time

🟡 Mid-level

🟠 Senior

✅ Product Manager

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com