Lead Data Architect

🔥 1 minute ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Henry Schein

Henry Schein

10,000+ employees

Founded 1932

⚕️ Healthcare Insurance

💊 Pharmaceuticals

🤝 B2B

Healthcare Insurance • Pharmaceuticals • B2B

Henry Schein is a global company dedicated to servicing a range of medical and dental providers. It offers a comprehensive portfolio of products and solutions, including practice management software, large equipment, and technology services. Founded in 1932, Henry Schein has established itself as a leader in healthcare products and services, with a commitment to improving health and increasing access to care through strategic growth and innovation across its global operations.

📋 Description

• Define and implement a scalable, enterprise-wide data architecture aligned with business and technology goals • Develop a data strategy roadmap, ensuring long-term sustainability, scalability, and efficiency • Partner with executive leadership, product teams, and engineering to ensure data initiatives drive business value • Establish enterprise data governance, security, and compliance frameworks leveraging tools like Collibra or Alation • Oversee the design and evolution of data lakes, data warehouses, and cloud-based analytics platforms using Databricks, Snowflake, BigQuery, or Redshift • Lead the adoption of modern data architecture patterns, including event-driven architectures, real-time data streaming (Kafka, Pulsar), and AI-driven analytics • Provide guidance on database optimization, indexing, partitioning, and storage strategies for tools like PostgreSQL, MySQL, and NoSQL solutions like MongoDB or Cassandra • Evaluate emerging technologies, making recommendations for tools and platforms that enhance data capabilities • Direct ETL/ELT strategies, ensuring seamless data flow across systems with Python, Apache Airflow, dbt, or Informatica • Architect cloud-based solutions (AWS, Azure, or GCP) using services such as AWS Glue, Azure Synapse, and Google Cloud Dataflow to support analytics, AI, and operational use cases • Ensure API-first design for data integration using GraphQL, RESTful APIs, or event-driven architectures (Kafka, AWS Kinesis, Pub/Sub) • Define and oversee data quality, lineage, and cataloging efforts using Great Expectations, Monte Carlo, or DataHub • Develop policies for data privacy, access control, and encryption, ensuring compliance with GDPR, CCPA, HIPAA, or other relevant regulations • Implement enterprise-wide metadata management and data lineage tracking using Collibra, Alation, or Data Catalog solutions • Drive best practices for data security and compliance audits, leveraging IAM tools and cloud security solutions • Lead a team of data architects, engineers, and analysts, mentoring them on best practices • Act as a liaison between business and technical teams, translating business needs into scalable data solutions • Champion a culture of innovation, ensuring the data team is adopting cutting-edge methodologies • Conduct data architecture reviews, ensuring alignment with organizational standards

🎯 Requirements

• 10+ years of experience in data architecture, data engineering, or related fields • Bachelor’s degree (Master’s preferred) in Computer Science, Applied Mathematics, Statistics, Machine Learning, or a closely related field (or foreign equivalent) • Proven track record in designing large-scale, enterprise data architectures • Expertise in SQL, NoSQL, and distributed database technologies such as Snowflake, Databricks, BigQuery, Redshift, PostgreSQL, MongoDB, and Cassandra • Strong experience with cloud-based data platforms (AWS, Azure, GCP) and services like AWS Glue, Azure Data Factory, and Google Dataflow • Deep understanding of data modeling, ETL/ELT processes, and data pipeline optimization using dbt, Apache Airflow, Informatica, or Talend • Experience with real-time streaming technologies (Kafka, Spark Streaming, Apache Flink, AWS Kinesis) • Strong knowledge of data security, governance, and compliance frameworks • Excellent verbal and written communication skills and ability to resolve disputes effectively and efficiently • Outstanding presentation and public speaking skills • Mastery independent decision making, analysis and problem-solving skills • Ability to quickly understand and assess complex projects, systems and ecosystems and identify relevant relationships and connections between them • Mastery planning and organizational skills and techniques • Communicate effectively with senior management and key stakeholders • Ability to influence, build relationships, understand organizational complexities, manage conflict and navigate politics • Familiarity with the healthcare data domain with previous experience working with healthcare datasets is a plus • Strong Python programming skills, with expertise in data manipulation and pipeline development using Pandas, PySpark, NumPy, and SQLAlchemy • Experience with AI/ML-driven analytics architectures and MLOps frameworks like MLflow or SageMaker • Hands-on experience with Infrastructure as Code (Terraform, CloudFormation) • Familiarity with Graph databases and knowledge graphs (Neo4j, Amazon Neptune) • Certifications in cloud data services (AWS Certified Data Analytics, Google Professional Data Engineer, Databricks Certified Data Engineer)

🏖️ Benefits

• Medical, Dental and Vision Coverage • 401K Plan with Company Match • PTO • Paid Parental Leave • Income Protection • Work Life Assistance Program • Flexible Spending Accounts • Educational Benefits • Worldwide Scholarship Program • Volunteer Opportunities

Apply Now

Similar Jobs

🔥 20 hours ago

BCT Partners

51 - 200

Data Engineer leading data management solutions for Head Start and Early Head Start programs. Overseeing design, development, and implementation of data engineering projects to enhance organizational performance.

🇺🇸 United States – Remote

💵 $120k - $140k / year

💰 Non Equity Assistance on 1998-12

⏰ Full Time

🟠 Senior

🚰 Data Engineer

AWS

Azure

Cloud

Docker

ETL

Oracle

Postgres

Python

SQL

🕒 Yesterday

AAA

5001 - 10000

🚗 Transport

👥 B2C

Enterprise Data Architect designing and delivering secure, scalable data and AI platforms for CSAA Insurance Group. Collaborating with teams to evolve capabilities and support business needs.

AWS

Cloud

🕒 Yesterday

Highmark Health

10,000+ employees

⚕️ Healthcare Insurance

🤝 Non-profit

🌍 Social Impact

HEDIS Data Engineer supporting data management and quality controls for Highmark Health. Collaborating with IT and engineering solutions for the Analytic Data Warehouse.

SQL

🕒 Yesterday

Senior Data Engineer developing data pipeline applications for CCC's cloud-based insurance platform. Involves mentoring teams and working with various data technologies.

Airflow

Amazon Redshift

Apache

AWS

Cloud

ETL

Hadoop

HBase

HDFS

Kafka

PySpark

Python

Spark

SQL

Subversion

Terraform

Unix

Yarn

🕒 Yesterday

Ferguson

10,000+ employees

🤝 B2B

🛍️ eCommerce

🛒 Retail

Senior Data Engineer designing and developing complex Power BI semantic models at Ferguson. Collaborating with analytics teams while ensuring high-quality delivery of data and reporting solutions.

Python

SQL