Software Engineer, Data

December 16, 2023

Apply Now
Imbue logo

Imbue

We build AI systems that can reason.

11 - 50

Description

We believe that high quality data is the most important part of creating high performance machine learning systems, regardless of whether they are simple classifiers or state of the art reasoning agents. Unlike many other organizations, we view this work, and this role, as one of the most important at the company. In this role, you will work on the most important part of our system--the software infrastructure for collecting, preprocessing, generating, analyzing, and distilling the wide variety of data sources that go into both our primary pretraining data corpus, as well as the datasets for all of the other ancillary and secondary models and system. You will make a meaningful, measurable impact on the performance of our systems, and experience the joy of spending time to make high quality software that makes high quality data. Example projects • Incorporate new sources of high quality text data into our existing data pipelines • Develop models for accurately classifying and extracting meaningful text from raw html • Create a high quality OCR pipeline for pulling pretraining text from images and scans • Collect a ludicrous amount of multimodal data(ex: transcripts for thousands of years of video) • Design unique data generation pipelines that leverage existing data(ex: convert code from one language to another) • Integrate multiple annotation service providers into a sensible interface for researchers

Requirements

• Detail oriented. Data mistakes are easy to make and hard to catch. • Passionate about data. You should be happy to look at and deeply engage with the raw data. • An excellent software engineer. We care about engineering best practices. • Familiar with python.

Benefits

• Work on the most important part of our system • Work at a place that deeply cares about data quality • Work directly on creating software with human-like intelligence • Very generous compensation • Flexible working hours • Work remotely • Time and budget for learning and self improvement

Apply Now

Similar Jobs

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com
Jobs by Title
Remote Account Executive jobsRemote Accounting, Payroll & Financial Planning jobsRemote Administration jobsRemote Android Engineer jobsRemote Backend Engineer jobsRemote Business Operations & Strategy jobsRemote Chief of Staff jobsRemote Compliance jobsRemote Content Marketing jobsRemote Content Writer jobsRemote Copywriter jobsRemote Customer Success jobsRemote Customer Support jobsRemote Data Analyst jobsRemote Data Engineer jobsRemote Data Scientist jobsRemote DevOps jobsRemote Ecommerce jobsRemote Engineering Manager jobsRemote Executive Assistant jobsRemote Full-stack Engineer jobsRemote Frontend Engineer jobsRemote Game Engineer jobsRemote Graphics Designer jobsRemote Growth Marketing jobsRemote Hardware Engineer jobsRemote Human Resources jobsRemote iOS Engineer jobsRemote Infrastructure Engineer jobsRemote IT Support jobsRemote Legal jobsRemote Machine Learning Engineer jobsRemote Marketing jobsRemote Operations jobsRemote Performance Marketing jobsRemote Product Analyst jobsRemote Product Designer jobsRemote Product Manager jobsRemote Project & Program Management jobsRemote Product Marketing jobsRemote QA Engineer jobsRemote SDET jobsRemote Recruitment jobsRemote Risk jobsRemote Sales jobsRemote Scrum Master + Agile Coach jobsRemote Security Engineer jobsRemote SEO Marketing jobsRemote Social Media & Community jobsRemote Software Engineer jobsRemote Solutions Engineer jobsRemote Support Engineer jobsRemote Technical Writer jobsRemote Technical Product Manager jobsRemote User Researcher jobs