Solutions Engineer – Media

Job not on LinkedIn

🕒 March 31

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Grupo Protege

Grupo Protege

10,000+ employees

Founded 1971

🤖 Artificial Intelligence

🤝 B2B

☁️ SaaS

Artificial Intelligence • B2B • SaaS

Grupo Protege is an AI training data platform that connects AI developers with high-quality, ethically sourced training data. It serves both AI developers by providing a vast and rich collection of data for model training and data holders by enabling them to monetize their data while maintaining governance and control. The platform aims to streamline the data procurement process significantly, making it easier for developers to access the data they need efficiently.

📋 Description

• Own data quality and curate media datasets • Partner with Sales and Solutions to translate customer requirements into curation strategies • Work with imperfect partner data, including mismatched metadata, schema differences, and incomplete labeling • Normalize and standardize datasets for reliable downstream use • Query and analyze Protege’s media catalog using SQL, internal APIs, and metadata tools to identify relevant content • Build validation checks and workflows to ensure dataset integrity before delivery • Identify, debug, and resolve data quality issues across file structures, metadata, and content alignment • Use AI tools and transcoded embeddings to surface and refine clip-level content • Turn messy, real-world data into structured datasets that meet customer and model requirements • Run iterative sample reviews with customers, incorporate feedback, refine selections, and ensure final packages meet spec • Build deep expertise in Protege’s media catalog structure, metadata, and growth patterns • Track content coverage, diversity, and modality mix, and identify gaps relative to customer demand • Partner with Product and Partnerships to share catalog insights that inform sourcing priorities • Work cross-functionally to ensure content packaging meets technical, ethical, and licensing requirements • Develop methods, scripts, and internal tools that improve curation efficiency and scale • Help shape Protege’s delivery platform, including how internal users and customers search, sample, and export data • Work closely with embedding-based systems to iterate between algorithmic selection and human review • Define best practices for embedding queries, relevance evaluation, and content diversity • Maintain a high bar for operational excellence and quality assurance throughout the process

🎯 Requirements

• 4-7 years of experience in data science, media analytics, technical curation, or similarly hands-on data roles. • Strong SQL proficiency and comfort querying large, messy datasets to generate insight and action. • Experience working with media metadata, embeddings, or unstructured content. • Ability to translate nuanced customer or model requirements into concrete dataset specifications. • High standard for data quality, operational rigor, and usability of delivered outputs. • Clear communicator who can move between technical depth and customer-friendly clarity. • Thrive in ambiguous, fast-moving environments and treats teammates with kindness.

🏖️ Benefits

• Health insurance • Professional development opportunities • Flexible work arrangements • Remote work options

Apply Now

Similar Jobs

🕒 March 31

Germain UX

51 - 200

☁️ SaaS

🛍️ eCommerce

Customer Solution Engineer responsible for delivering monitoring and analytics solutions for Germain UX clients. Communicating with clients and configuring cloud or on-premise setups to meet their needs.

Apache

AWS

Cloud

Kubernetes

Python

SQL

Unix

🕒 March 27

Senior Solutions Engineer at Nexer specializing in Microsoft AI technologies. Leading cloud solution architecture and integration for substantial business transformation projects.

🗣️🇧🇷🇵🇹 Portuguese Required

Azure

JavaScript

Node.js

Python

🕒 March 24

Docusign

5001 - 10000

🛍️ eCommerce

💸 Finance

☁️ SaaS

Solutions Architect collaborating with field sales teams to deliver integrated solutions leveraging Docusign's Agreement Cloud platform. Engaging with clients, understanding their business challenges, and providing tailored solutions.

ERP

SOAP

🕒 March 19

Oscilar

51 - 200

💳 Fintech

🏦 Banking

📋 Compliance

Solutions Architect designing and implementing intelligent risk decisioning systems at Oscilar. Collaborating with teams to drive customer success and product evolution.

Amazon Redshift

AWS

ElasticSearch

Postgres

Python

Redis

🕒 March 10

Logiks TI

201 - 500

☁️ SaaS

Business Analyst at Logiks responsible for implementing BI solutions in public sector projects. Growing client's business through innovative data solutions across Brazil.

🗣️🇧🇷🇵🇹 Portuguese Required