Staff Data Engineer

11 - 50 employees

Founded 2016

🎯 Recruiter

🤝 B2B

Recruitment • B2B

Zigsaw is on a mission to help people find the job of their choice, making job-search and talent discovery simpler, faster, and more effective for job-seekers. They specialize in recruitment, talent acquisition, and operate a job board to facilitate hiring for clients.

Staff Data Engineer

Job not on LinkedIn

🕒 April 20

🏄 California – Remote

💵 $177.2k - $364.8k / year

⏰ Full Time

🔴 Lead

🚰 Data Engineer

AWS

Cloud

Scala

Spark

Apply Now

Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Zigsaw

11 - 50 employees

Founded 2016

🎯 Recruiter

🤝 B2B

Recruitment • B2B

📋 Description

• Design and maintain a scalable identity resolution platform • Build pipelines and services to ingest, normalize, link, and version identity data across multiple sources • Ensure deterministic and probabilistic matching logic that is transparent, auditable, and measurable • Partner with product and analytics teams to expose identity data through reliable, well-documented APIs and datasets • Build and operate batch and streaming pipelines using modern data stack tools • Create clear documentation, standards, and runbooks for identity and governance systems • Own data governance foundations including data lineage, quality checks, schema enforcement, and access controls • Implement privacy-by-design principles (PII handling, consent enforcement, retention policies) • Collaborate with legal, privacy, and security teams to operationalize regulatory requirements (e.g., GDPR, CCPA) • Establish monitoring and alerting for data quality, freshness, and integrity

🎯 Requirements

• Production data engineering experience • Bachelor’s degree in computer science, related field or equivalent experience • Proficiency in Spark and Scala, with proven experience building data infrastructure in Spark using Scala • Experience in delivering significant technical initiatives and building reliable, large scale services • Experience in delivering APIs backed by relationship-heavy datasets • Experience implementing data governance practices, including data quality, metadata management, and access controls • Strong understanding of privacy-by-design principles and handling of sensitive or regulated data • Familiarity with data lakes, cloud warehouses, and storage formats • Strong proficiency in AWS services • Excellent written and verbal communication skills • Successful design and implementation of scalable and efficient data infrastructure • High attention to detail in implementation of automated data quality checks • Effective collaboration with cross-functional teams • Demonstrated ability to use AI to improve speed and quality in your day-to-day workflow for relevant outputs • Strong track record of critical evaluation and verification of AI-assisted work (e.g., testing, source-checking, data validation, peer review) • High integrity and ownership: you protect sensitive data, avoid over-reliance on AI, and remain accountable for final decisions and deliverables

🏖️ Benefits

• Health insurance • Equity opportunities • Flexible work arrangements • Professional development

Apply Now

Similar Jobs

Data Engineer – Databricks, BigQuery, Snowflake

🕒 April 16

TENCYS

11 - 50

💼 Consulting

📦 Logistics

🏥 Healthcare

Data Engineer designing and optimizing data solutions at Uni Tencys Systems. Collaborating with teams for machine learning and analytics initiatives while ensuring data quality.

🇺🇸 United States – Remote

💰 $35k Pre seed on 2024-11

⏰ Full Time

🟠 Senior

🔴 Lead

🚰 Data Engineer

Airflow

Apache

AWS

Azure

Cloud

ETL

PySpark

Python

SQL

Terraform

Unity

Data Architect – Strong Azure Services, Finance Experience (GL, AR, AP)

🕒 April 10

CloudScouts

11 - 50

💼 Consulting

📣 Marketing

💸 Finance

Data Architect specializing in financial systems (GL, AR, AP) and using Azure. Fully remote position requiring 12+ years of relevant experience.

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

🔴 Lead

🚰 Data Engineer

Amazon Redshift

Azure

BigQuery

Cloud

ERP

ETL

Informatica

Kafka

Oracle

Python

Spark

SQL

Tableau

Staff Software Engineer – Data Platform

🕒 April 7

Twilio

5001 - 10000

🔌 API

🤝 B2B

Staff Software Engineer responsible for architecting scalable data solutions on Twilio's Data Platform. Collaborating with teams for effective implementation and mentoring engineers.

🇺🇸 United States – Remote

💵 $171.1k - $213.9k / year

⏰ Full Time

🔴 Lead

🚰 Data Engineer

🦅 H1B Visa Sponsor

AWS

Distributed Systems

Hadoop

Java

Kafka

Python

Scala

Spark

Principal Data Engineer

🕒 April 4

Prescryptive Health, Inc.

201 - 500

🏥 Healthcare

💼 Consulting

📦 Logistics

Principal Data Engineer playing a key role in developing data infrastructure and analytical platform at Prescryptive. Responsible for architecture, design, implementation, and mentorship across teams.

🇺🇸 United States – Remote

💵 $177k - $215k / year

⏰ Full Time

🔴 Lead

🚰 Data Engineer

Airflow

Azure

Cloud

ETL

Python

SQL

Vault

Principal Data Engineer

🕒 April 1

Waymark

11 - 50

📣 Marketing

🤖 Artificial Intelligence