Data Scientist – AI Data, LLM Specialist

🕒 Novembro 7, 2025

🇺🇸 Estados Unidos – Remoto (EUA)

⏰ Tempo Integral

🟡 Pleno

🟠 Sênior

📊 Cientista de Dados

🗣️🇺🇸🇬🇧 Inglês obrigatório

Candidatar-se
Encontrar Vagas Remotas Similares

📊 Verifique sua pontuação de currículo para esta vaga

Melhore suas chances de conseguir uma entrevista verificando sua pontuação de currículo antes de se candidatar.

Logo of Eclipse Labs

Eclipse Labs

11 - 50 funcionários

Fundada em 2022

💳 Fintech

🎮 Jogos

💰 $9.000.000 Seed Round em 2022-09

Fintech • Blockchain • Gaming

A Eclipse é uma startup de blockchain L2 sediada em São Francisco, fundada por Neel Somani em 2022. Com lançamento previsto para o segundo trimestre de 2024, a Eclipse Mainnet será a primeira SVM Layer 2 da Ethereum. Com uma arquitetura que combina as melhores partes da pilha modular, os desenvolvedores obtêm todos os benefícios de uma capacidade de processamento dedicada, sem as desvantagens. A Eclipse possibilita a construção de aplicações que podem coordenar grandes grupos de pessoas e escalar conforme as necessidades de qualquer desenvolvedor.

Descrição

• Develop Data Labeling Strategies: Design and document a formal data annotation strategy, including clear, scalable, and efficient guidelines for labeling our data. Define and enforce quality metrics, including inter-annotator agreement. • Optimize for LLM Consumption: Research, define, and prototype the optimal data formats, structures, and pre-processing steps required for fine-tuning and training LLMs on our datasets. • Data Quality Analysis: Establish automated processes and metrics to analyze the quality of both raw and labeled data, providing feedback to improve our data collection and labeling workflows. • Collaborate with Engineering: Work closely with the engineering team to guide the implementation of data processing pipelines and ensure the data infrastructure meets the needs of ML applications.

🎯 Requisitos

• Proven experience as a Data Scientist or Machine Learning Engineer with a focus on data quality and preparation. • Strong understanding of data labeling methodologies and hands-on experience with data annotation platforms and workflows. • Demonstrated experience preparing datasets for training and fine-tuning Large Language Models (LLMs), including knowledge of techniques like tokenization, embeddings, and NER. • Proficiency in Python and common data science libraries (e.g., Pandas, NumPy, Scikit-learn, spaCy, Hugging Face). • Experience using APIs/SDKs to automate data annotation and active learning loops. • Excellent communication skills, with an ability to create clear documentation for technical and non-technical audiences.

🏖️ Benefícios

• Opportunity. We believe blockchains should be fast AND highly usable. You’ll do high-impact work to enhance Ethereum’s scalability, shaping the future of crypto • Flexibility. We collaborate synchronously and asynchronously, across weekly all-hands meetings, Slack messaging, and quarterly in-person meetups • Team. Our founding team has experience launching and scaling blue-chip projects such as dYdX, Uniswap, and zkSync. We’re backed by leading funds and leaders including Polychain, Tribe, Placeholder, DBA, Mustafa Al-Bassam, Tarun Chitra, Meltem Demirors, and others • Culture. As an early member of our team, you’ll have a unique opportunity to help shape our culture. We value intellectual honesty, bias towards action, and believe every member plays a key role in achieving our ambitious goals • Compensation. You’ll receive a competitive salary + equity + benefits package

Candidatar-se

Vagas Similares

🕒 Novembro 4, 2025

First Quality

1001 - 5000

⚕️ Seguro de Saúde

🛒 Varejo

⚡ Produtividade

Data Scientist with a focus on leveraging AI/ML for business improvement at First Quality. Collaborating with teams to design, implement, and assess performance of AI/ML tools.

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Novembro 1, 2025

Senior Data Scientist building innovative, data-driven recruiting services at SmartRecruiters. Collaborating with R&D and product teams to enhance data quality and support recruitment processes.

🇺🇸 Estados Unidos – Remoto (EUA)

⏰ Tempo Integral

🟠 Sênior

📊 Cientista de Dados

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Novembro 1, 2025

Senior Manager Analytics & Data Science leading a team of data scientists and analysts. Collaborating with product and tech teams to drive data-informed decision making in a complex environment.

🇺🇸 Estados Unidos – Remoto (EUA)

⏰ Tempo Integral

🟠 Sênior

📊 Cientista de Dados

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Outubro 29, 2025

Codvo.ai

51 - 200

🔒 Cibersegurança

☁️ SaaS

Data Scientist developing data-driven solutions in connected vehicle analytics at Codvo. Expertise in predictive modeling, machine learning, and data visualization required.

🇺🇸 Estados Unidos – Remoto (EUA)

⏰ Tempo Integral

🟡 Pleno

🟠 Sênior

📊 Cientista de Dados

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Outubro 29, 2025

Select Minds LLC

51 - 200

☁️ SaaS

🏢 Corporativo

🤝 B2B

Sr Data Scientist developing LLMs, NLP models, and GenAI solutions. Collaborating with cross-functional teams to ensure model reliability and scalability.

🇺🇸 Estados Unidos – Remoto (EUA)

💵 $150.000 - $210.000 / ano

⏰ Tempo Integral

🟠 Sênior

📊 Cientista de Dados

🦅 Patrocina Visto H1B

info

🗣️🇺🇸🇬🇧 Inglês obrigatório