Software Engineer, Data Infrastructure – Research

🕒 September 22, 2025

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of OpenAI

OpenAI

WebsiteLinkedIn

201 - 500 employees

Founded 2015

🤖 Artificial Intelligence

☁️ SaaS

🏢 Enterprise

Artificial Intelligence • SaaS • Enterprise

OpenAI is a leading research organization and company dedicated to creating advanced artificial intelligence technology, with a strong emphasis on safety and ethical considerations. OpenAI's mission is to ensure that artificial general intelligence (AGI) benefits all of humanity. The company develops AI products like ChatGPT, which can assist users with tasks ranging from everyday requests to complex enterprise solutions. OpenAI also provides an API platform that integrates its AI models into various applications. The company is focused on innovation in AI and improving data analysis capabilities, while emphasizing safety and ethical governance of their systems.

📋 Description

• Design and maintain standardized dataset APIs, including for multimodal data that cannot fit in memory • Build proactive testing and scale validation pipelines for dataset loading at GPU scale • Integrate datasets into training and inference pipelines, collaborating with multimodal researchers and infra teams • Document and maintain dataset interfaces for discoverability and consistent adoption • Establish safeguards and validation systems to ensure reproducibility of standardized datasets • Debug and resolve performance bottlenecks in distributed dataset loading (e.g., stragglers) • Provide visualization and inspection tools to surface errors, bugs, or bottlenecks • Work on LLM training and inference infrastructure to support massive-scale GPU/accelerator fleets

🎯 Requirements

• Strong engineering fundamentals with experience in distributed systems, data pipelines, or infrastructure • Experience building APIs, modular code, and scalable abstractions with attention to UX • Comfortable debugging bottlenecks across large fleets of machines • Collaborative, humble, and able to own foundational ML infrastructure • Bonus: background in data math, probability, or distributed data theory • Bonus: experience with GPU-scale distributed systems or dataset scaling for real-time data

🏖️ Benefits

• Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit) • 401(k) retirement plan with employer match • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks) • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees • 13+ paid company holidays and multiple paid coordinated company office closures, plus paid sick or safe time • Mental health and wellness support • Employer-paid basic life and disability coverage • Annual learning and development stipend • Daily meals in our offices, and meal delivery credits as eligible • Relocation support for eligible employees • Additional taxable fringe benefits, such as charitable donation matching and wellness stipends

Apply Now

Similar Jobs

🕒 September 21, 2025

Insight Value

1 - 10

📱 Media

WebsiteLinkedIn

Fullstack engineer building AI/AR products for field workforce at Navigate AI. Own product features end-to-end and collaborate with founders.

🏢🏡 San Francisco – Hybrid

⏰ Full Time

🟡 Mid-level

🟠 Senior

🧑‍💻 Full-stack Engineer

Python

React

TypeScript

🕒 September 13, 2025

Doppel

1 - 10

🤖 Artificial Intelligence

🔐 Security

🔒 Cybersecurity

WebsiteLinkedIn

Fullstack engineer building Doppel's Simulation product to create phishing simulations and voice deepfakes. Develop scalable AI systems to identify and neutralize digital threats.

🏢🏡 San Francisco – Hybrid

💵 $135k - $300k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

🧑‍💻 Full-stack Engineer

Cyber Security

🕒 September 13, 2025

Doppel

1 - 10

🤖 Artificial Intelligence

🔐 Security

🔒 Cybersecurity

WebsiteLinkedIn

Build AI-native infrastructure to detect and neutralize social engineering threats at a cybersecurity startup. Design scalable systems monitoring billions of entities and leveraging AI agents.

🏢🏡 San Francisco – Hybrid

💵 $135k - $300k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

🧑‍💻 Full-stack Engineer

Cloud

Cyber Security

GraphQL

Postgres

Python

React

TypeScript

🕒 September 4, 2025

Eventual

1 - 10

WebsiteLinkedIn

Develop query planning, execution engine, scheduler, and storage integrations for Daft. Work on distributed systems to scale multimodal AI data processing.

Apache

AWS

BigQuery

Cloud

Hadoop

Linux

Postgres

Ray

Rust

Spark

🕒 September 4, 2025

Eventual

1 - 10

WebsiteLinkedIn

Ship full-stack product features and APIs for Eventual's multimodal data platform. Collaborate cross-functionally and work in SF office four days weekly.

AWS

Cloud