Data Processing Principal Engineer

Job not on LinkedIn

🕒 October 22, 2025

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of DataPelago

DataPelago

51 - 200 employees

Founded 2022

🤖 Artificial Intelligence

🏢 Enterprise

🤝 B2B

Artificial Intelligence • Enterprise • B2B

DataPelago is a company that provides a universal data processing engine (DataPelago Nucleus) and the DataPelago Accelerator for Spark to dramatically accelerate GenAI and analytics workloads. Their platform leverages heterogeneous accelerated computing (GPUs, FPGAs, and CPUs), integrates with open-source frameworks like Spark, Trino, and Gluten/Substrait, and aims to deliver orders-of-magnitude performance and significant cost reductions with zero changes to applications or vendor lock-in. DataPelago targets enterprise data and AI practitioners, offering deployment guides, benchmarks, and case studies demonstrating large savings and faster processing for large-scale structured, semi-structured, and unstructured datasets.

📋 Description

• Lead architecture, design, and implementation of next-generation data processing engine built to exploit accelerated computing. • Work with cross-functional teams to advance the architecture at the core of Datapelago’s parallel and distributed execution engine based on accelerated computing • Lead the execution engine team in the design, implementation, and rollout of enterprise grade, highly reliable acceleration engine for data processing • Design, implement, test, and maintain major components of the execution engine • Analyze and identify areas for differentiation and improvements in the execution engine • Collaborate with other teams and team members to drive code reviews, design reviews, performance and reliability reviews, and development process and to drive continuous improvement in all of these

🎯 Requirements

• B.S. EE/CS or equivalent with 15+ years experience or MS with 10+ years experience • 10+ years of experience developing core components of an enterprise-grade database or analytics execution engine serving large-scale data processing workloads. Experience developing for platforms such as Apache Spark, Gluten, Velox, DataFusion preferred. • Experienced developing high-performance parallel implementations of data processing operators and functions, such as joins, aggregations, sorts • Experience leading 10+ teams in designing, developing, and releasing high-performance data processing engines for large production deployments • Strong programming ability in C, C++, and Rust • Strong development experience on Linux platforms

Apply Now

Similar Jobs

🕒 October 21, 2025

NFPA

201 - 500

📚 Education

PPE Specialist at NFPA contributing to safety standards development for emergency services. Focusing on personal protective equipment and guiding organizational initiatives.

🕒 October 18, 2025

Triumph Financial, Inc.

1001 - 5000

💳 Fintech

🚗 Transport

🏦 Banking

Staff Software Engineer at Triumph leading technical decisions and mentoring teams to improve scalable systems. Focused on design and architectural influence within a modern freight transaction network.

🕒 October 18, 2025

Confluent

1001 - 5000

🤖 Artificial Intelligence

☁️ SaaS

Staff Software Engineer developing core components for Flink SQL on Confluent Cloud. Involved in open source contributions while mentoring engineers and enhancing product features.

Apache

Cloud

Distributed Systems

Open Source

SQL

🕒 October 16, 2025

Bizi Digital

1 - 10

🤝 B2B

Principal Software Engineer at O'Reilly Auto Parts focused on digital commerce and complex application systems. Leading development and guiding technical teams to ensure high-quality software solutions.

SDLC

🕒 October 9, 2025

PingWind Inc. (SDVOSB)

51 - 200

🔒 Cybersecurity

🏛️ Government

☁️ SaaS

Software Developer SME providing expert-level software engineering support, specializing in advanced software solutions. Collaborating with various teams to ensure scalable, secure, and efficient applications.

Cyber Security