Data Engineer II

🕒 May 22

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of InStride Health

InStride Health

51 - 200 employees

⚕️ Healthcare Insurance

🧘 Wellness

💰 $26M Venture Round on 2022-10

Healthcare Insurance • Wellness

InStride Health is a provider of specialty outpatient care for pediatric anxiety and OCD, focused on children, teens, and young adults aged 7 to 22. The company offers a research-based clinical model developed at a Harvard-affiliated medical school. InStride Health provides a dedicated care team, personalized treatment plans including individual therapy, family therapy, exposure coaching, and medication management. They aim to help young people face their fears, build resilience, and support families. The service is covered by most major insurances and available in several states such as CT, MA, ME, NH, NJ, NY, OH, PA, RI, and VA.

📋 Description

• Design, develop, and maintain robust, scalable ETL/ELT data pipelines using Python, SQL, and data processing frameworks including dbt, Matillion, and AWS services. • Implement data quality checks, monitoring, and alerting across all data pipelines to ensure data integrity and reliability. • Optimize existing pipelines for performance, cost-efficiency, and error handling. • Contribute to the design and maintenance of InStride’s data warehouse and data lake solutions, including schema design, data modeling, and indexing strategies in Amazon Redshift. • Ensure data security, HIPAA compliance, and proper handling of protected health information (PHI) within all data infrastructure. • Work closely with data analysts, data scientists, and business intelligence engineers to understand their data requirements and deliver reliable, high-quality data access. • Troubleshoot and resolve complex production data issues with urgency and root-cause rigor. • Develop and maintain clear documentation for data models, pipelines, and data sources to enable self-service analytics. • Participate in code reviews and contribute to technical discussions, bringing a constructive and detail-oriented perspective. • Stay current on emerging data technologies and tools, bringing relevant insights to the team.

🎯 Requirements

• 3+ years of experience designing, developing, and deploying data pipelines and data warehouse solutions in production environments. • Strong proficiency in SQL and Python for data engineering and transformation work. • Hands-on experience with cloud data warehouses (Amazon Redshift preferred; Snowflake or BigQuery also valued) and familiarity with ETL/ELT tools such as dbt, Matillion, or similar. • Working knowledge of AWS services relevant to data engineering (e.g., S3, Glue, Lambda, Redshift). • Demonstrated understanding of HIPAA compliance requirements and experience working with sensitive or regulated data. • Ability to design and build data systems that are scalable, observable, and built to handle growth in data volume and complexity. • Strong understanding of data integration patterns, including APIs, webhooks, and batch ingestion techniques. • Experience giving and receiving structured feedback through pull request reviews and technical discussions. • Strong communication skills, with the ability to translate complex technical concepts for both technical and non-technical stakeholders. • Comfortable operating in a fast-paced startup environment, balancing competing priorities and making sound tradeoffs. • Experience working with healthcare data is a plus.

🏖️ Benefits

• Generous benefits package (401k with match) • Flexible PTO • Paid holidays • Paid service days • 4 week paid sabbatical • 12 week paid parental leave • Health benefits starting on your first day • And more

Apply Now

Similar Jobs

🕒 May 22

Circle

501 - 1000

💳 Fintech

₿ Crypto

🌐 Web 3

Senior Staff Data Engineer at Circle, responsible for driving data strategy, reliability, and operational excellence across data platforms. Participate in cross-team initiatives to optimize data ecosystem complexities.

SDLC

🕒 May 22

Guild Mortgage

1001 - 5000

💸 Finance

🏠 Real Estate

Senior DataOps Engineer at Guild Mortgage Company executing data management strategies and overseeing data pipelines. Collaborating with cross-functional teams to ensure secure and efficient data processes.

AWS

Azure

Cloud

Grafana

Prometheus

🕒 May 22

Vertical Relevance

51 - 200

💸 Finance

💳 Fintech

Data Architect at Vertical Relevance shaping customer cloud journeys with AWS solutions. Designing and implementing data solutions, ensuring customer success through strong technical guidance.

Amazon Redshift

Apache

AWS

Azure

Cloud

Postgres

PySpark

Python

Terraform

🕒 May 21

NVIDIA

10,000+ employees

🤖 Artificial Intelligence

🎮 Gaming

Senior Data Engineer designing scalable cloud services for NVIDIA's financial systems. Responsible for ensuring reliability and integrity across global financial operations.

Apache

AWS

Azure

Cloud

Distributed Systems

Docker

ETL

Google Cloud Platform

Kafka

Kubernetes

Linux

Node.js

Postgres

Python

Scala

Go

🕒 May 21

9amHealth

11 - 50

Senior Data Engineer for AI-enabled healthcare platform. Building and maintaining data pipelines and analytics solutions.

Airflow

Amazon Redshift

Apache

AWS

Docker

ETL

Gradle

Java

Maven

MySQL

Python

Spark

Spring

Spring Boot

SpringBoot

SQL

Tableau

Terraform