Member of Engineering – Pre-training, Data Acquisition

🕒 Maio 18

🇺🇸 Estados Unidos – Remoto (EUA)

⏰ Tempo Integral

🟡 Pleno

🟠 Sênior

🖥 Engenheiro de Software

🗣️🇺🇸🇬🇧 Inglês obrigatório

Candidatar-se
Encontrar Vagas Remotas Similares

📊 Verifique sua pontuação de currículo para esta vaga

Melhore suas chances de conseguir uma entrevista verificando sua pontuação de currículo antes de se candidatar.

Logo of poolside

poolside

51 - 200 funcionários

Fundada em 2023

🤖 Inteligência Artificial

🏢 Corporativo

Artificial Intelligence • Enterprise

Poolside é uma aceleradora projetada especificamente para fundadores e builders de Web3. Ela oferece suporte a projetos de finanças descentralizadas (DeFi), games, governança, infraestrutura e NFTs. Com um ecossistema robusto de 20. 000 membros — incluindo mentores, investidores e builders de Web3 — a Poolside co-lançou e apoiou mais de 110 projetos. A aceleradora proporciona acesso diferenciado a mentoria e expertise técnica para ajudar projetos Web3 a escalar e alcançar lançamentos bem-sucedidos. A Poolside também se engaja com empresas e protocolos líderes para impulsionar o crescimento e a inovação no espaço Web3.

Descrição

• Design, build, and operate a large-scale web crawler responsible for acquiring all openly accessible data on the internet • Develop specialized deep crawlers targeting high-value sources to improve recall and coverage • In collaboration with data researchers, own a long-term road map for data acquisition • Build observability, monitoring, and debugging tooling to ensure reliability and transparency across crawl infrastructure • Collaborate with pre-training, post-training, and evaluations teams to align data acquisition priorities with model training needs • Build high-throughput ingestion pipelines for rapidly onboarding partner data and evaluating it for quality

🎯 Requisitos

• Strong distributed systems background with proven experience building and operating large-scale infrastructure — data pipelines, web crawlers, or similar • Proficiency in Python, and comfortable optimizing performance and debugging complex systems under production conditions • Hands-on experience with web crawling or large-scale data extraction: understanding of HTTP protocols, distributed job queues, and data parsing at scale • Familiarity with cloud platforms (AWS) and container orchestration (Kubernetes, Docker) for deploying and managing high-throughput workloads • Awareness of the non-technical dimensions of internet-scale crawling: data privacy, robots.txt adherence, and responsible crawl practices • Nice to have: • Prior experience pre-training LLMs • Experience in building trillion-scale SOTA pre-training datasets • Experience translating research to production at scale

🏖️ Benefícios

• Fully remote work & flexible hours • 37 days/year of vacation & holidays • 16 weeks of flexible, full-pay parental leave • Health insurance allowance for you & dependents • Company-provided equipment • Well-being, always-be-learning & home office allowances • Frequent team get togethers • Diverse & inclusive people-first culture

Candidatar-se

Vagas Similares

🕒 Maio 18

decircle

1 - 10

Protocol Engineer at an AI/ML organization developing decentralized computing platforms. Designing and managing blockchain protocols and smart contracts to revolutionize AI training.

🇺🇸 Estados Unidos – Remoto (EUA)

⏰ Tempo Integral

🟢 Júnior

🟡 Pleno

🖥 Engenheiro de Software

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Maio 18

decircle

1 - 10

Mobile Engineer crafting high-performance React Native trading experiences at PVP. Collaborating closely with design and product teams to achieve exceptional UI/UX standards.

🇺🇸 Estados Unidos – Remoto (EUA)

💵 $150.000 - $220.000 / ano

⏰ Tempo Integral

🟠 Sênior

🖥 Engenheiro de Software

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Maio 18

The Arena Club

51 - 200

🧘 Bem-estar

⚽ Esportes

👥 B2C

Senior Mobile Engineer developing and enhancing Arena Club’s mobile app using React Native. Collaborating with teams to deliver seamless user experiences and robust app features.

🇺🇸 Estados Unidos – Remoto (EUA)

⏰ Tempo Integral

🟠 Sênior

🖥 Engenheiro de Software

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Maio 17

General Dynamics Information Technology

10.000+ funcionários

🔒 Cibersegurança

🤖 Inteligência Artificial

Application Developer at GDIT transforming technology into opportunity for CNIC’s enterprise applications. Supporting .NET, BI, and SharePoint solutions while resolving technical issues and maintaining documentation.

🇺🇸 Estados Unidos – Remoto (EUA)

💵 $129.813 - $155.250 / ano

⏰ Tempo Integral

🟠 Sênior

🔴 Especialista

🖥 Engenheiro de Software

🦅 Patrocina Visto H1B

info

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Maio 17

Courseware Developer/Programmer designing and maintaining eLearning solutions for DoD and Federal training environments. Collaborating with instructional designers and SMEs to enhance learner engagement through innovative technologies.

🇺🇸 Estados Unidos – Remoto (EUA)

⏰ Tempo Integral

🟡 Pleno

🟠 Sênior

🖥 Engenheiro de Software

🗣️🇺🇸🇬🇧 Inglês obrigatório