Senior Data Engineer

Job not on LinkedIn

🕒 2 days ago

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

🚰 Data Engineer

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Ceresti Health

Ceresti Health

11 - 50 employees

Founded 2013

⚕️ Healthcare Insurance

🤝 B2B

🔥 Funding within the last year

💰 $11.7M Venture Round - Ceresti Health on 2025-08

Healthcare Insurance • B2B

Ceresti Health is a tech-enabled care company that partners with health plans and accountable care organizations (ACOs) to support family caregivers of people living with dementia. The company combines claims-data driven risk stratification, caregiver education, a tablet-based platform, and weekly personal care navigators to improve care plan adherence, close social determinants of health gaps, and proactively detect changes in patients' conditions. Ceresti’s caregiver-led program has independently validated results showing substantial reductions in avoidable hospitalizations and per-patient medical costs, and it offers no-risk, outcomes-based contracts for payers.

📋 Description

• Design and own Ceresti’s end-to-end data architecture: a landing zone with secure cloud object storage for raw partner files and API payloads, validated ingestion pipelines into our transactional Postgres, and a curated analytics layer that decouples reporting and AI workloads from production • Build ingestion pipelines for the data we receive today, including partner data files (CSV/JSON/XML/HL7/X12 as applicable) and REST/SFTP API integrations with schema validation, quarantine of bad records, and full lineage from raw bytes to curated row • Stand up and operate the curated layer (data warehouse / lakehouse-lite) so analytics and ML models can consume data without slowing down the transactional system • Choose, integrate, and operate the smallest set of tools needed, including object storage, an orchestrator (Dagster, Prefect, Airflow, etc.), dbt or similar for transformations, a single validation library (Great Expectations / Pandera / Soda) • Design and enforce data governance for a HIPAA-regulated environment: PHI/PII classification, encryption in transit and at rest, role-based access, audit logging, retention and minimum-necessary policies, and de-identification where appropriate • Partner with backend, ML, product, and clinical stakeholders to define data contracts with our health plan and ACO partners and hold the line on data quality • Build and maintain reliable feature data for ML models, including embeddings (e.g., pgvector) and curated feature tables for risk stratification, engagement, and outcomes work • Instrument the data platform for observability including pipeline SLAs, data freshness, schema drift, quality metrics, and act on what the data tells you • Participate fully in our Agile process: backlog grooming, sprint planning, demos, and retrospectives • Mentor engineers across the team on SQL, schema design, and the craft of building data systems that are boring in the best possible way

🎯 Requirements

• BS/BA degree or higher in Computer Science, Engineering, or a related technical field • 8+ years of professional data engineering experience, with a track record of shipping production data systems end-to-end • Mastery of PostgreSQL: schema design, indexing, query tuning, partitioning, logical replication, JSONB, extensions (pg_partman, pg_cron, pgvector, etc.), and operating Postgres at scale • Strong experience designing and operating data pipelines, including file-based ingestion (SFTP / object storage drops) and API-based ingestion (REST, webhooks) • Hands-on experience with one or more cloud platforms (AWS preferred) and their data primitives: object storage (S3), managed Postgres • Experience designing data warehouses and/or data lakes and the judgment to know which one a given problem actually needs • Strong experience with dbt (or equivalent SQL-based transformation framework) and modern data modeling patterns (Kimball dimensional, Data Vault, One Big Table — and an opinion about when each is right) • Experience with at least one orchestration framework (Dagster, Prefect, or Airflow) and a clear point of view on which to use when • Strong Python skills for ingestion, validation, and tooling • Experience with data validation and data-quality frameworks (Great Expectations, Pandera, Soda, or equivalent) • Experience with change-data-capture from Postgres (logical replication, or equivalent) • Data governance experience in a HIPAA-regulated environment or, at minimum, demonstrated instincts for protecting PHI and PII (encryption, least privilege, audit, de-identification, BAA-aware vendor selection); HITRUST or SOC 2 experience is a strong plus • Comfortable with infrastructure-as-code and CI/CD for data systems • Experience supporting ML workloads: building feature tables, managing training data, serving features at inference time; familiarity with embeddings, vector search (pgvector or equivalent), and LLM integration patterns (RAG, prompt-grounded analytics) is a plus • Excellent written and verbal communication skills: you can explain a tricky schema decision to a business stakeholder and a data contract to a partner with equal clarity • Demonstrated experience working in Agile/Scrum teams

🏖️ Benefits

• Competitive salary and benefits package • Opportunities for professional growth and development • Collaborative and dynamic work environment • Flexible work arrangements and remote work options • Access to cutting-edge technologies and tools

Apply Now

Similar Jobs

🕒 2 days ago

ParentSquare

201 - 500

📚 Education

☁️ SaaS

🤝 B2B

Senior Data Engineer building and evolving data platforms for ParentSquare's unified communication tools. Collaborating with cross-functional teams to drive data quality and governance.

🇺🇸 United States – Remote

💵 $130k - $160k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

🕒 2 days ago

Providence

10,000+ employees

⚕️ Healthcare Insurance

Data Engineer II managing Epic Cogito suite of reporting tools. Responsibilities include oversight, security management and technical troubleshooting, all conducted remotely.

🇺🇸 United States – Remote

💵 $36.3k - $131.3k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

🦅 H1B Visa Sponsor

info

🕒 2 days ago

Highmark Health

10,000+ employees

⚕️ Healthcare Insurance

🤝 Non-profit

🌍 Social Impact

Senior Data Engineer managing data infrastructure and data initiatives for Highmark Health. Collaborating with various teams to ensure data quality and integrity across systems.

🇺🇸 United States – Remote

💵 $102.7k - $164.6k / year

💰 $5M Grant on 2021-05

⏰ Full Time

🟠 Senior

🚰 Data Engineer

🦅 H1B Visa Sponsor

info

🕒 2 days ago

Fuze Health

1001 - 5000

☁️ SaaS

🤝 B2B

Data Engineer building and maintaining data platforms at Fuze Health. Collaborating with Data Scientists, Analysts, and DevOps to transform data into insights.

🇺🇸 United States – Remote

💵 $140k - $175k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

🕒 2 days ago

DecisionPoint Corporation

51 - 200

🔒 Cybersecurity

⚕️ Healthcare Insurance

☁️ SaaS

Data Architect leading data architecture design and validation for DoD modernization programs. Collaborating with technical teams to ensure data integrity and governance across cloud-hosted environments.