Founding Data Engineer – Core Data Platform

🕒 March 26

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Embedding VC

Embedding VC

1 - 10 employees

Founded 2023

🤖 Artificial Intelligence

💸 Finance

Artificial Intelligence • Finance

Embedding VC is an early-stage venture capital firm based in Menlo Park, CA that backs engineering, product, and research-focused founders, with a particular emphasis on AI-driven startups. The firm maintains a curated portfolio of developer and data infrastructure companies and communicates via developer-forward channels (CLI-style website, newsletters, and LinkedIn). Embedding VC focuses on seed and early-stage investments in technology companies leveraging machine learning and AI.

📋 Description

• Own the entire data foundation of a fast-scaling AI company — from raw data to executive metrics. • Build from 0 → 1 — define the architecture that powers product, finance, and company-wide decision making. • High visibility and impact — your work directly informs leadership, product direction, and company strategy. • Help define how data supports AI systems, agents, and long-term intelligence. • Design and build core data pipelines (e.g., product events, payments, internal systems → BigQuery) • Define and maintain the data warehouse architecture, including schema design, data modeling, and table structure • Establish and own the single source of truth (SOT) for product and business metrics • Build and maintain core data models (user, subscription, revenue, engagement, etc.) • Ensure data consistency across systems (product analytics, billing, internal tools) • Lead data reconciliation efforts (e.g., Stripe vs internal systems vs reporting) • Implement data quality checks, validation, and monitoring systems • Build reliable reporting layers used by leadership and finance (not ad hoc dashboards) • Establish data standards and contracts (event naming, schema governance, tracking consistency) • Partner with engineering to improve instrumentation and data correctness at source • Support downstream teams (analytics, DS) by providing clean, well-documented datasets • Continuously improve data reliability, performance, and cost efficiency

🎯 Requirements

• 5+ years of experience in data engineering or analytics engineering • Proven experience building data platforms or warehouses from 0 → 1 • Strong SQL and Python — you write clean, production-quality data code • Deep expertise in data modeling, ETL/ELT design, and warehouse architecture • Experience with modern data stack: BigQuery / Snowflake / Redshift • dbt or similar transformation tools • Workflow orchestration tools (Airflow / Prefect or similar) • Experience working with financial and product data (e.g., payments, subscriptions, usage data) • Strong understanding of data reliability, testing, and validation • Ability to translate business definitions into durable, consistent data models • High ownership — you can define and drive architecture decisions independently • Comfortable operating in ambiguous, fast-moving environments

🏖️ Benefits

• Competitive base salary and bonus program • Equity — meaningful ownership in what you build • High autonomy, high growth environment

Apply Now

Similar Jobs

🕒 March 26

D.A. Davidson Companies

1001 - 5000

💸 Finance

Senior Data Engineer at D.A. Davidson optimizing data integrations and migrating legacy structures. Involves cloud architecture and coordination with product teams.

🇺🇸 United States – Remote

💵 $120k - $126k / year

⏰ Full Time

🟠 Senior

🚰 Data Engineer

🕒 March 26

Pivotal Solutions

51 - 200

👥 HR Tech

🎯 Recruiter

Data Architect at Pivotal Solutions, Inc. architecting and designing robust ETL processes using Cloud tools to client specifications.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

🕒 March 25

CloudZero

11 - 50

☁️ SaaS

🤖 Artificial Intelligence

💸 Finance

Distinguished Architect leading the design of a next-generation data platform at CloudZero. Driving technical decisions and architectural innovations for scalable cloud cost management solutions.

🇺🇸 United States – Remote

💵 $275k - $330k / year

💰 Series A on 2021-03

⏰ Full Time

🟠 Senior

🔴 Lead

🚰 Data Engineer

🕒 March 24

Automotive Software Engineer developing data solutions for autonomous vehicles, analyzing sensor data and supporting ML training. Collaborating within a remote team for advanced mobility solutions.

🕒 March 24

PFF

201 - 500

⚽ Sports

📱 Media

☁️ SaaS

Senior Data Engineer leading the architectural redesign of PostgreSQL databases at a sports analytics company. Driving high-performance data delivery and maintaining scalable data pipelines.

🇺🇸 United States – Remote

💵 $120k - $160k / year

💰 $1M Seed Round - Pro Football Doc on 2020-10

⏰ Full Time

🟠 Senior

🚰 Data Engineer