Product Data Scientist – AI Evaluation, Quality

Job not on LinkedIn

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Finom

Finom

501 - 1000 employees

Founded 2019

💳 Fintech

💸 Finance

🤝 B2B

Fintech • Finance • B2B

Finom is a fintech company offering comprehensive financial services for SMEs, freelancers, and companies. The platform provides tools for invoicing, expense management, and easy online account opening with added benefits like cashback and international IBAN accounts. Finom ensures data security through partnerships with banks like SolarisBank and Treezor, making it a fast and reliable service for business financial management.

📋 Description

• Own and extend our offline eval suite across products — datasets (capability + regression), judges, metrics • Build and maintain online quality dashboards: resolution rate, CSAT, thumbs up/down, LLM-as-judge signals, error rate, latency • Close the production feedback loop: mine failure patterns from real traffic → turn them into regression cases → propose fixes to Product and domain experts • Harden methodology: judge stability, non-determinism handling • Translate numbers into decisions – weekly syncs, clear trade-offs, no dashboards for their own sake

🎯 Requirements

• Python and SQL – you can build an analysis end-to-end • Solid foundation in statistics – sampling, hypothesis testing, variance, understanding what a noisy metric is • Analytical mindset – you start from the business question, not from the tool • 3+ years in analyst / data scientist roles, at least one in a product context • Hands-on experience evaluating LLM applications (RAG, agents, tool use, judges) • Experience building LLM agents – side projects, toy builds, personal experiments all count • AI-assisted coding is our default authoring environment • Curious and fluent with AI coding, or genuinely excited to become fluent fast • Care about what you ship and how clearly you think

🏖️ Benefits

• Make a genuine impact on the product • Join our upward trajectory and grow with us • Work in the EU • Enjoy the flexibility of traveling and working remotely or in a hybrid model across Europe • Become a stock options holder • Unlock your inner entrepreneur through our Stock Options Program • Receive unwavering support and care • Constant support and care to ensure your experience is successful and fulfilling • Immerse yourself in our exclusive Work & Swim Program • Spend one month in a comfortable corporate apartment in enchanting Cyprus • Ideal opportunity for work-life balance while enjoying Mediterranean views • Equal Opportunity Employer valuing diversity

Apply Now

Similar Jobs

🕒 3 days ago

OpenX

201 - 500

Data Scientist III managing data science projects from conception to deployment at OpenX. Collaborating with cross-functional teams, mentoring juniors, and solving complex marketplace problems.

🇵🇱 Poland – Remote

💵 zł19.6k - zł21.9k / month

💰 Secondary Market on 2015-05

⏰ Full Time

🟡 Mid-level

🟠 Senior

📊 Data Scientist

🕒 June 24

SD Solutions

201 - 500

🤝 B2B

🎯 Recruiter

🏢 Enterprise

Data Scientist leading design and implementation of ML/DL algorithms for AI-driven healthcare startup. Collaborating with product and engineering teams to deliver analytical interfaces.

🕒 June 20

InPost Group

10,000+ employees

🛍️ eCommerce

🚗 Transport

Data Scientist leveraging analytics and machine learning to improve e-commerce operations at InPost. Collaborating with cross-functional teams to drive strategic decision-making and operational excellence.

🗣️🇵🇱 Polish Required

🕒 June 19

Neurons Lab

51 - 200

Data Science Lead managing the foundational data-science build for a UK credit and lending company. Leading a small data team while building risk validation models and insights.

🕒 June 17

Globaldev Group

201 - 500

🤝 B2B

☁️ SaaS

🏢 Enterprise

Senior Data Scientist analyzing complex data and developing machine learning models for a fast-growing mobile data platform. Collaborating with multiple teams to enhance data-driven decision-making.