Senior Data Engineer, Data and Applied AI

Vaga não está no LinkedIn

🕒 Abril 10

🇺🇸 Estados Unidos – Remoto (EUA)

💵 $158.000 - $168.000 / ano

⏰ Tempo Integral

🟠 Sênior

🚰 Engenheiro de Dados

🦅 Patrocina Visto H1B

info

🗣️🇺🇸🇬🇧 Inglês obrigatório

Candidatar-se
Encontrar Vagas Remotas Similares

📊 Verifique sua pontuação de currículo para esta vaga

Melhore suas chances de conseguir uma entrevista verificando sua pontuação de currículo antes de se candidatar.

Logo of Plume

Plume

51 - 200 funcionários

Fundada em 2020

⚕️ Seguro de Saúde

🧘 Bem-estar

Healthcare Insurance • Wellness

A Plume é uma clínica virtual de atendimento afirmativo de gênero, exclusiva para indivíduos trans e não conformes. A empresa oferece terapia hormonal abrangente e serviços de bem-estar afirmativos de gênero, acessíveis por meio de um modelo de telessaúde que elimina as barreiras tradicionais ao atendimento. A Plume foca em fornecer cuidados de saúde centrados em trans, suporte e engajamento comunitário, oferecendo serviços como orientação de transição, suporte à ansiedade e depressão, tratamento de acne, cessação de tabagismo e muito mais. Com serviços disponíveis em 47 estados, a Plume está comprometida em apoiar a comunidade trans em sua jornada de gênero com atendimento conveniente e afirmativo.

Descrição

• Building and maintaining production-grade data pipelines in cloud data warehouses such as Google BigQuery or equivalent, following architectural standards set by the Director of Data and AI. • Designing and developing dbt models across bronze, silver, and gold layers, including a focus on quality and governance via automated tests, documentation, and incremental load strategies. • Creating and optimizing Airflow DAGs for data workflow orchestration, including scheduling, dependency management, error handling, and alerting. • Implement dimensional data models and data mart structures — guided by the team's modeling standards — that support clinical BI and ML feature consumption. • Crafting easy-to-understand visualizations and dashboards that align with commonly used business analytic standards in Looker or equivalent BI tools in close collaboration with product analytics, finance, operations, growth, and clinical stakeholders. • Integrating healthcare data from sources such as EHRs, Stripe, 3rd-party APIs, and application database feeds, normalizing incoming data into the unified data platform. • Applying HIPAA-compliant data handling practices, including PHI/PII masking, tokenization, audit logging, and role-based access controls across all pipeline and AI system work. • Architecting and implementing RAG pipelines — including document ingestion, chunking, embedding generation, and retrieval — using frameworks such as LangChain or LangGraph • Supporting MLOps workflows, including model training pipeline maintenance, deployment support, performance monitoring, and retraining triggers. • Code reviewing PRs from teammates, providing constructive technical feedback to peers, and upholding the team's engineering standards. • Collaborating closely with product managers to understand requirements and deliver reliable data and AI products. • Monitoring and triaging assigned pipeline and data quality failures, escalating architectural issues as appropriate. • Documenting pipeline designs, data models, and technical decisions in alignment with the team's governance and lineage tracking standards. • Evaluating new tools and frameworks, providing hands-on prototyping and technical assessments.

🎯 Requisitos

• 5+ years of hands-on experience in data engineering, analytics engineering, or a closely related role. • 2+ years of experience working within the healthcare industry, including working knowledge of healthcare data standards, clinical workflows, regulated data environments, and domain-specific data visualizations. • Working knowledge of HIPAA — including PHI/PII classification, data masking, audit logging, and access control requirements. • Proven production experience with at least one major cloud data warehouse: BigQuery, Snowflake, or Redshift — including advanced SQL and query optimization. • Strong hands-on experience with dbt (Core or Cloud), including incremental models, tests, documentation, and multi-environment workflows. • Deep experience with Apache Airflow for workflow orchestration, including DAG design, scheduling, monitoring, and failure handling. • Demonstrated knowledge of dimensional data modeling — star/snowflake schemas, SCD Types 1/2, fact and dimension table design. • Hands-on experience delivering dashboards and reports in at least one enterprise BI tool: Looker, Power BI, Tableau, Qlik, etc. • Proficiency in Python for data pipeline development, API integrations, and automation (Pandas, PySpark, or similar). • Practical exposure to RAG pipeline development and LLM integration using LangChain, LangGraph, or LlamaIndex • Hands-on exposure to MLOps concepts — model deployment, monitoring, and retraining workflows • Knowledge of CI/CD tooling for data and AI workloads (GitHub Actions, dbt Cloud CI) • Strong understanding of data quality and governance principles: lineage, access controls, data contracts, and automated testing and experience with data governance tools such as OpenMetadata • Excellent written and verbal communication skills with the ability to collaborate effectively across engineering, analytics, and clinical teams • Ability to work independently on assigned workstreams while keeping the Director and team informed of progress, blockers, and risks

🏖️ Benefícios

• Ground-Floor Equity (Series B) • Free Medical, Dental, and Vision on the first of the month after you start full-time work • Unlimited PTO • 11 paid holidays and company shut-down for a week in December • 401(k) • Free Plume and BetterHelp Subscriptions

Candidatar-se

Vagas Similares

🕒 Abril 10

Canoe Intelligence

51 - 200

💳 Fintech

☁️ SaaS

🤖 Inteligência Artificial

Data Engineer at Canoe Intelligence responsible for designing scalable data systems for alternative investment data processes. Collaborating with AI/ML Engineers and developing data architectures for new products.

🇺🇸 Estados Unidos – Remoto (EUA)

💵 $110.000 - $140.000 / ano

💰 $36.000.000 Series C - Canoe em 2024-07

⏰ Tempo Integral

🟡 Pleno

🟠 Sênior

🚰 Engenheiro de Dados

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Abril 10

CloudScouts

11 - 50

🤝 B2B

🏢 Corporativo

💸 Finanças

Data Architect specializing in financial systems (GL, AR, AP) and using Azure. Fully remote position requiring 12+ years of relevant experience.

🇺🇸 Estados Unidos – Remoto (EUA)

⏰ Tempo Integral

🟠 Sênior

🔴 Especialista

🚰 Engenheiro de Dados

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Abril 10

Ziply Fiber

1001 - 5000

📡 Telecomunicações

👥 B2C

Senior Data Engineer responsible for designing and maintaining data pipelines at Ziply Fiber. Working with structured and unstructured data to support business intelligence and analytics operations.

🇺🇸 Estados Unidos – Remoto (EUA)

💵 $114.668 - $154.216 / ano

💰 Corporate Round em 2022-11

⏰ Tempo Integral

🟠 Sênior

🚰 Engenheiro de Dados

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Abril 9

Aledade, Inc.

501 - 1000

⚕️ Seguro de Saúde

🏢 Corporativo

Senior Technical Product Manager defining AI Data Platform roadmap for Aledade. Collaborating with AI engineers and clinical leadership to support value-based care workflows.

🇺🇸 Estados Unidos – Remoto (EUA)

⏰ Tempo Integral

🟠 Sênior

🚰 Engenheiro de Dados

🗣️🇺🇸🇬🇧 Inglês obrigatório

🕒 Abril 9

Concept Plus, LLC

51 - 200

🏛️ Governo

Information/Data Architect providing technical expertise and support for data architecture requirements at Concept Plus. Developing data strategies and maintaining information structures for effective governance.

🇺🇸 Estados Unidos – Remoto (EUA)

⏰ Tempo Integral

🟠 Sênior

🔴 Especialista

🚰 Engenheiro de Dados

🗣️🇺🇸🇬🇧 Inglês obrigatório