Member of Engineering – Pre-training, Data Acquisition

🕒 il y a 23 jours

🇺🇸 États-Unis – Télétravail

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

🖥 Ingénieur Logiciel

🗣️🇺🇸🇬🇧 Anglais requis

Postuler Maintenant
Trouver des Emplois à Distance Similaires

📊 Vérifiez votre score de CV pour ce poste

Améliorez vos chances d'obtenir un entretien en vérifiant votre score de CV avant de postuler.

Logo of poolside

poolside

51 - 200 employés

Fondée en 2023

🤖 Intelligence artificielle

🏢 Entreprise

Artificial Intelligence • Enterprise

Poolside est un accélérateur spécifiquement conçu pour les fondateurs et builders Web3. Il apporte un soutien aux projets en finance décentralisée (DeFi), gaming, gouvernance, infrastructure et NFT. Fort d’un écosystème de 20 000 membres, incluant des mentors, des investisseurs et des builders Web3, Poolside a co-lancé et accompagné plus de 110 projets. L’accélérateur offre un accès privilégié au mentorat et à l’expertise technique pour aider les projets Web3 à scaler et à réussir leurs lancements. Poolside collabore également avec des entreprises et des protocoles de premier plan pour stimuler la croissance et l’innovation dans l’écosystème Web3.

Description

• Design, build, and operate a large-scale web crawler responsible for acquiring all openly accessible data on the internet • Develop specialized deep crawlers targeting high-value sources to improve recall and coverage • In collaboration with data researchers, own a long-term road map for data acquisition • Build observability, monitoring, and debugging tooling to ensure reliability and transparency across crawl infrastructure • Collaborate with pre-training, post-training, and evaluations teams to align data acquisition priorities with model training needs • Build high-throughput ingestion pipelines for rapidly onboarding partner data and evaluating it for quality

🎯 Exigences

• Strong distributed systems background with proven experience building and operating large-scale infrastructure — data pipelines, web crawlers, or similar • Proficiency in Python, and comfortable optimizing performance and debugging complex systems under production conditions • Hands-on experience with web crawling or large-scale data extraction: understanding of HTTP protocols, distributed job queues, and data parsing at scale • Familiarity with cloud platforms (AWS) and container orchestration (Kubernetes, Docker) for deploying and managing high-throughput workloads • Awareness of the non-technical dimensions of internet-scale crawling: data privacy, robots.txt adherence, and responsible crawl practices • Nice to have: • Prior experience pre-training LLMs • Experience in building trillion-scale SOTA pre-training datasets • Experience translating research to production at scale

🏖️ Avantages

• Fully remote work & flexible hours • 37 days/year of vacation & holidays • 16 weeks of flexible, full-pay parental leave • Health insurance allowance for you & dependents • Company-provided equipment • Well-being, always-be-learning & home office allowances • Frequent team get togethers • Diverse & inclusive people-first culture

Postuler Maintenant

Emplois Similaires

🕒 il y a 23 jours

decircle

1 - 10

Protocol Engineer at an AI/ML organization developing decentralized computing platforms. Designing and managing blockchain protocols and smart contracts to revolutionize AI training.

🇺🇸 États-Unis – Télétravail

⏰ Temps Plein

🟢 Junior

🟡 Intermédiaire

🖥 Ingénieur Logiciel

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 23 jours

decircle

1 - 10

Mobile Engineer crafting high-performance React Native trading experiences at PVP. Collaborating closely with design and product teams to achieve exceptional UI/UX standards.

🇺🇸 États-Unis – Télétravail

💵 $150 000 - $220 000 / an

⏰ Temps Plein

🟠 Senior

🖥 Ingénieur Logiciel

🗣️🇺🇸🇬🇧 Anglais requis

Kotlin

React

React Native

Swift

TypeScript

🕒 il y a 24 jours

The Arena Club

51 - 200

🧘 Bien-être

⚽ Sports

👥 B2C

Senior Mobile Engineer developing and enhancing Arena Club’s mobile app using React Native. Collaborating with teams to deliver seamless user experiences and robust app features.

🇺🇸 États-Unis – Télétravail

⏰ Temps Plein

🟠 Senior

🖥 Ingénieur Logiciel

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 24 jours

General Dynamics Information Technology

10 000+ employés

🔒 Cybersecurity

🤖 Intelligence artificielle

Application Developer at GDIT transforming technology into opportunity for CNIC’s enterprise applications. Supporting .NET, BI, and SharePoint solutions while resolving technical issues and maintaining documentation.

🇺🇸 États-Unis – Télétravail

💵 $129 813 - $155 250 / an

⏰ Temps Plein

🟠 Senior

🔴 Expert

🖥 Ingénieur Logiciel

🦅 Parrain de Visa H1B

info

🗣️🇺🇸🇬🇧 Anglais requis

🕒 il y a 25 jours

Courseware Developer/Programmer designing and maintaining eLearning solutions for DoD and Federal training environments. Collaborating with instructional designers and SMEs to enhance learner engagement through innovative technologies.

🇺🇸 États-Unis – Télétravail

⏰ Temps Plein

🟡 Intermédiaire

🟠 Senior

🖥 Ingénieur Logiciel

🗣️🇺🇸🇬🇧 Anglais requis