Founding Data Engineer – Core Data Platform

🕒 March 26

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Embedding VC

Embedding VC

1 - 10 employees

Founded 2023

🤖 Artificial Intelligence

💸 Finance

Artificial Intelligence • Finance

Embedding VC is an early-stage venture capital firm based in Menlo Park, CA that backs engineering, product, and research-focused founders, with a particular emphasis on AI-driven startups. The firm maintains a curated portfolio of developer and data infrastructure companies and communicates via developer-forward channels (CLI-style website, newsletters, and LinkedIn). Embedding VC focuses on seed and early-stage investments in technology companies leveraging machine learning and AI.

📋 Description

• Own the entire data foundation of a fast-scaling AI company — from raw data to executive metrics. • Build from 0 → 1 — define the architecture that powers product, finance, and company-wide decision making. • High visibility and impact — your work directly informs leadership, product direction, and company strategy. • Help define how data supports AI systems, agents, and long-term intelligence. • Design and build core data pipelines (e.g., product events, payments, internal systems → BigQuery) • Define and maintain the data warehouse architecture, including schema design, data modeling, and table structure • Establish and own the single source of truth (SOT) for product and business metrics • Build and maintain core data models (user, subscription, revenue, engagement, etc.) • Ensure data consistency across systems (product analytics, billing, internal tools) • Lead data reconciliation efforts (e.g., Stripe vs internal systems vs reporting) • Implement data quality checks, validation, and monitoring systems • Build reliable reporting layers used by leadership and finance (not ad hoc dashboards) • Establish data standards and contracts (event naming, schema governance, tracking consistency) • Partner with engineering to improve instrumentation and data correctness at source • Support downstream teams (analytics, DS) by providing clean, well-documented datasets • Continuously improve data reliability, performance, and cost efficiency

🎯 Requirements

• 5+ years of experience in data engineering or analytics engineering • Proven experience building data platforms or warehouses from 0 → 1 • Strong SQL and Python — you write clean, production-quality data code • Deep expertise in data modeling, ETL/ELT design, and warehouse architecture • Experience with modern data stack: BigQuery / Snowflake / Redshift • dbt or similar transformation tools • Workflow orchestration tools (Airflow / Prefect or similar) • Experience working with financial and product data (e.g., payments, subscriptions, usage data) • Strong understanding of data reliability, testing, and validation • Ability to translate business definitions into durable, consistent data models • High ownership — you can define and drive architecture decisions independently • Comfortable operating in ambiguous, fast-moving environments

🏖️ Benefits

• Competitive base salary and bonus program • Equity — meaningful ownership in what you build • High autonomy, high growth environment

Apply Now

Similar Jobs

🕒 March 26

D.A. Davidson Companies

1001 - 5000

💸 Finance

Senior Data Engineer at D.A. Davidson optimizing data integrations and migrating legacy structures. Involves cloud architecture and coordination with product teams.

Airflow

Azure

Cloud

ETL

MS SQL Server

Python

RDBMS

SQL

SSIS

🕒 March 26

Pivotal Solutions

51 - 200

👥 HR Tech

🎯 Recruiter

Data Architect at Pivotal Solutions, Inc. architecting and designing robust ETL processes using Cloud tools to client specifications.

Airflow

Amazon Redshift

AWS

Azure

Cloud

ETL

Google Cloud Platform

Java

MS SQL Server

Python

Scala

Spark

SQL

🕒 March 25

Keyrus

1001 - 5000

🤝 B2B

Data Engineer managing AWS data architecture and development of data pipelines. Collaborating with business and tech teams to ensure data reliability and performance.

🗣️🇧🇷🇵🇹 Portuguese Required

AWS

Python

SQL

🕒 March 25

CloudZero

11 - 50

☁️ SaaS

🤖 Artificial Intelligence

💸 Finance

Distinguished Architect leading the design of a next-generation data platform at CloudZero. Driving technical decisions and architectural innovations for scalable cloud cost management solutions.

🇺🇸 United States – Remote

💵 $275k - $330k / year

💰 Series A on 2021-03

⏰ Full Time

🟠 Senior

🔴 Lead

🚰 Data Engineer

Apache

Kafka

Spark

🕒 March 24

Automotive Software Engineer developing data solutions for autonomous vehicles, analyzing sensor data and supporting ML training. Collaborating within a remote team for advanced mobility solutions.

ETL

PySpark

Python

Spark

SQL