Senior Data Engineer

🕒 May 21

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of 9amHealth

9amHealth

11 - 50 employees

💰 $16M Series A on 2022-04

9amHealth is complete cardiometabolic care—a first-of-its-kind, whole-body approach to preventing and treating diabetes, obesity, high cholesterol, and hypertension. We partner with businesses looking to provide effective health benefits to their members while reducing overall healthcare costs.

📋 Description

• Design, build, and maintain scalable data pipelines and ETL/ELT workflows in Python using AWS Glue, Apache Spark, Airflow, or equivalent orchestration tools. • Write production-grade Python code for DWH logic, reporting jobs, data transformations, and internal tooling, following software engineering best practices (testing, code review, CI/CD). • Develop and optimize analytical data models (dimensional, OBT, or hybrid) that serve self-service BI and advanced analytics use cases. • Build and maintain dashboards, explores, and semantic layers in Looker and/or Tableau; serve as the analytics infrastructure owner ensuring data quality and governance. • Contribute backend application code in Java (Spring Boot) or Python to support data-intensive features, API integrations, and internal services. • Champion modern AI coding practices across the data team, leveraging tools like GitHub Copilot, Claude, Cursor, or similar AI-assisted development environments to accelerate delivery and code quality. • Author and maintain comprehensive SQL assets (stored procedures, views, complex queries) across Redshift, Aurora/MySQL, and Athena. • Operate and optimize AWS data infrastructure including Glue, S3, Redshift, CloudFormation, CloudWatch, IAM, and Athena. • Collaborate closely with clinical operations, product, finance, and engineering teams to translate business questions into reliable, well-documented data products. • Implement data quality frameworks, monitoring, alerting, and incident response processes for the data platform. • Contribute to the architecture and data strategy for AI/ML features, including data prep, feature engineering, and model monitoring. • Mentor the existing data analyst and data engineer; help establish team standards, code review practices, and documentation norms.

🎯 Requirements

• 10+ years of professional experience in data engineering, analytics engineering, or a hybrid data/backend software engineering role. • Strong software engineering background: this role requires someone who can write, test, debug, and ship production code, not just query data. • Expert-level Python: deep experience building production data pipelines, ETL logic, and reporting systems in Python. • Expert-level SQL: window functions, CTEs, recursive queries, query optimization, and performance tuning at scale. • Hands-on experience with AWS data services, specifically Glue, S3, Redshift, Athena, CloudFormation, CloudWatch, and IAM. • Experience with MySQL/Aurora in a production environment. • Hands-on experience building and operating data pipelines with AWS Glue, Spark, dbt, Airflow, or comparable frameworks. • Deep experience with at least one modern BI platform. Looker (LookML) strongly preferred; Tableau also valued. Should include semantic modeling, dashboard design, and self-service enablement. • Solid understanding of data modeling techniques: star/snowflake schemas, slowly changing dimensions, event-based models. • Familiarity with AI-assisted coding tools (GitHub Copilot, Claude Code, Cursor, Cody) and a demonstrated interest in integrating AI into engineering workflows. • Proficiency in Java (Spring Boot, Maven/Gradle) with experience shipping backend services or data-intensive applications to production. • AWS certifications (e.g., Solutions Architect, Data Analytics Specialty, or Database Specialty). • Experience in health tech, digital health, or regulated industries (HIPAA familiarity is a plus). • Experience with CI/CD for data assets (dbt CI, Great Expectations, or similar). • Background in building or contributing to AI/ML features: feature stores, training pipelines, model serving, or RAG architectures. • Comfort with infrastructure-as-code (Terraform, CloudFormation) and containerized deployments (Docker, ECS/EKS). • Prior experience in a startup or high-growth environment where you owned outcomes end to end. • Track record of improving developer experience and productivity through tooling, automation, or process improvements.

🏖️ Benefits

• Health insurance • Dental insurance • Vision insurance • Flexible PTO • Work from home options • Professional development budget • Support continuing education

Apply Now

Similar Jobs

🕒 May 21

Axiom

1001 - 5000

☁️ SaaS

🤝 B2B

Data Architect responsible for design and governance of enterprise data model at Axiom. Collaborating with technical teams to align data strategy with AI initiatives.

Python

SQL

🕒 May 21

HackerOne

201 - 500

🔐 Security

🔒 Cybersecurity

Manager of Applied Data Engineering at HackerOne leading the development of scalable data solutions. Driving continuous innovation and fostering collaboration across the company in a remote role.

Airflow

AWS

Python

SQL

Tableau

🕒 May 21

Tekmetric

51 - 200

Senior Data Engineer building and maintaining data infrastructure for Tekmetric's auto repair platform. Collaborating with teams to ensure accurate and reliable data for business decisions.

Airflow

Apache

ETL

Java

Python

Scala

Spark

SQL

Tableau

🕒 May 21

Blue Orange Digital

51 - 200

🤖 Artificial Intelligence

🤝 B2B

🏢 Enterprise

Senior Data Engineer developing enterprise data warehouse solutions for diverse industries. Collaborating with clients and teams on advanced data engineering tasks and cloud integration.

🇺🇸 United States – Remote

💵 $126k - $147k / year

💰 $700k Corporate round on 2022-05

⏰ Full Time

🟠 Senior

🚰 Data Engineer

Apache

Cloud

ETL

Java

Linux

Oracle

Shell Scripting

SQL

🕒 May 21

Guidehouse

10,000+ employees

Data Engineer designing and operating data pipelines on the Advana platform for the Department of Defense. Leveraging Databricks and Palantir Foundry to enable financial and operational insights.

AWS

Azure

Cloud

Oracle

Postgres

PySpark

Python

Spark

SQL