Data Scientist – AI Data, LLM Specialist

🕒 November 7, 2025

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Eclipse Labs

Eclipse Labs

11 - 50 employees

Founded 2022

💳 Fintech

🎮 Gaming

💰 $9M Seed Round on 2022-09

Fintech • Blockchain • Gaming

Eclipse Labs is a pioneering software company building Ethereum's first layer-2 solution utilizing the Solana Virtual Machine (SVM). This innovative approach aims to combine the speed of Solana with the security of Ethereum, creating a powerful platform for decentralized applications. Eclipse Labs focuses on enhancing the user experience and scalability for developers, facilitating seamless integration of applications across both Ethereum and Solana ecosystems.

📋 Description

• Develop Data Labeling Strategies: Design and document a formal data annotation strategy, including clear, scalable, and efficient guidelines for labeling our data. Define and enforce quality metrics, including inter-annotator agreement. • Optimize for LLM Consumption: Research, define, and prototype the optimal data formats, structures, and pre-processing steps required for fine-tuning and training LLMs on our datasets. • Data Quality Analysis: Establish automated processes and metrics to analyze the quality of both raw and labeled data, providing feedback to improve our data collection and labeling workflows. • Collaborate with Engineering: Work closely with the engineering team to guide the implementation of data processing pipelines and ensure the data infrastructure meets the needs of ML applications.

🎯 Requirements

• Proven experience as a Data Scientist or Machine Learning Engineer with a focus on data quality and preparation. • Strong understanding of data labeling methodologies and hands-on experience with data annotation platforms and workflows. • Demonstrated experience preparing datasets for training and fine-tuning Large Language Models (LLMs), including knowledge of techniques like tokenization, embeddings, and NER. • Proficiency in Python and common data science libraries (e.g., Pandas, NumPy, Scikit-learn, spaCy, Hugging Face). • Experience using APIs/SDKs to automate data annotation and active learning loops. • Excellent communication skills, with an ability to create clear documentation for technical and non-technical audiences.

🏖️ Benefits

• Opportunity. We believe blockchains should be fast AND highly usable. You’ll do high-impact work to enhance Ethereum’s scalability, shaping the future of crypto • Flexibility. We collaborate synchronously and asynchronously, across weekly all-hands meetings, Slack messaging, and quarterly in-person meetups • Team. Our founding team has experience launching and scaling blue-chip projects such as dYdX, Uniswap, and zkSync. We’re backed by leading funds and leaders including Polychain, Tribe, Placeholder, DBA, Mustafa Al-Bassam, Tarun Chitra, Meltem Demirors, and others • Culture. As an early member of our team, you’ll have a unique opportunity to help shape our culture. We value intellectual honesty, bias towards action, and believe every member plays a key role in achieving our ambitious goals • Compensation. You’ll receive a competitive salary + equity + benefits package

Apply Now

Similar Jobs

🕒 November 4, 2025

First Quality

1001 - 5000

⚕️ Healthcare Insurance

🛒 Retail

⚡ Productivity

Data Scientist with a focus on leveraging AI/ML for business improvement at First Quality. Collaborating with teams to design, implement, and assess performance of AI/ML tools.

🕒 November 1, 2025

Senior Data Scientist building innovative, data-driven recruiting services at SmartRecruiters. Collaborating with R&D and product teams to enhance data quality and support recruitment processes.

🕒 November 1, 2025

Senior Manager Analytics & Data Science leading a team of data scientists and analysts. Collaborating with product and tech teams to drive data-informed decision making in a complex environment.

🕒 October 29, 2025

Codvo.ai

51 - 200

🔒 Cybersecurity

☁️ SaaS

Data Scientist developing data-driven solutions in connected vehicle analytics at Codvo. Expertise in predictive modeling, machine learning, and data visualization required.

🕒 October 29, 2025

Select Minds LLC

51 - 200

☁️ SaaS

🏢 Enterprise

🤝 B2B

Sr Data Scientist developing LLMs, NLP models, and GenAI solutions. Collaborating with cross-functional teams to ensure model reliability and scalability.