Data Engineer, Databricks

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of ICF

ICF

5001 - 10000 employees

Founded 1969

☁️ SaaS

⚡ Energy

💰 $30M Grant on 2021-03

Consulting • SaaS • Energy

ICF is a global consulting and technology services company that helps government and commercial clients tackle complex challenges. The firm provides expertise in areas such as Federal IT modernization, energy and utilities, public health, climate resilience, disaster management, and transportation. ICF leverages data and analytics, artificial intelligence, and cybersecurity to deliver innovative solutions. With a strong commitment to corporate citizenship and sustainability, ICF supports social programs and community development, aiming for a sustainable, low-emissions future. The company operates worldwide, with significant presence in Europe through its offices in Belgium, Spain, and the UK.

📋 Description

• Enable secure, scalable, and efficient data exchange between federal client and external data sharing partners using Databricks Delta Sharing. • Support the design and development of data pipelines and ETL routines in Azure Cloud environment for many source system types including RDBMS, API, and unstructured data using CDC, incremental, and batch loading techniques. • Conduct data profiling, transformation, and quality assurance on structured, semi-structured, and unstructured data. • Identify underlying issues and translate them into technical requirements. • Assist in building and optimizing data lakes, feature stores, and data warehouse structures to support analytics and machine learning. • Prepare, structure, and validate data for data science and MLOps workflows, ensuring it meets the quality and format requirements for modeling. • Help monitor and maintain the flow of data across BI dashboards, analytics environments, and machine learning pipelines. • Engage directly with clients and stakeholders to understand data needs and translate them into scalable solutions. • Collaborate with UX designers, business analysts, developers, and end users to define data and reporting requirements • Work with external data partners to determine their data product needs and work within the Databricks platform to enable rapid prototyping and extensible use cases • Meet with government employees at executive levels, platform stakeholders, and vendor partners. • Work within Agile teams to support iterative development, backlog grooming, and sprint-based delivery. • Provide mentorship to junior resources.

🎯 Requirements

• Bachelor’s degree • 5+ years in data engineering, data security practices, data platforms, and analytics • U.S. Citizenship required due to federal contract requirements. • Ability to obtain and maintain a federal public trust clearance or equivalent client-required background investigation. • Candidate must reside in the U.S., be authorized to work in the U.S., and all work must be performed in the U.S. • Candidate must have lived in the U.S. for three (3) full years out of the last five (5) years • 3+ years Databricks Platform Expertise – SME Level Proficiency including: Databricks, Delta Lake, and Delta Sharing • Deep experience with distributed computing using Apache Spark • Knowledge of Spark runtime internals and optimization • Ability to design and deploy performant end-to-end data architectures • 4+ years of ETL Pipeline Development building robust, scalable data pipelines • Databricks certifications - Professional or specialty certifications • Hands-on experience with Azure services such as Synapse, Data Factory, or Databricks. • Familiarity with data visualization tools such as Tableau, Power BI, or similar. • Machine Learning and Analytical Skills including: MLOps - Working knowledge of ML deployment and operations • Data Science Methodologies - Statistical analysis, modeling, and interpretation • Big Data Technologies - Experience beyond Spark with distributed systems • Experience with deployment pipelines, including Git-based version control and CI/CD pipelines and DevOps practices using Terraform for IaC.

🏖️ Benefits

• Reasonable Accommodations are available, including, but not limited to, for disabled veterans, individuals with disabilities, and individuals with sincerely held religious beliefs, in all phases of the application and employment process.

Apply Now

Similar Jobs

🔥 43 minutes ago

Allata

201 - 500

🤝 B2B

Data Engineer designing and optimizing scalable data solutions for healthcare industry. Collaborating with architects, analysts, and stakeholders in a high-performing consulting team.

AWS

Azure

Cloud

ETL

Informatica

MS SQL Server

Oracle

Postgres

PySpark

Python

SQL

🔥 48 minutes ago

VetsEZ

201 - 500

🤝 B2B

☁️ SaaS

🏛️ Government

Data Engineer focusing on designing and maintaining data solutions for healthcare operations, with a strong emphasis on cloud-based platforms and analytics.

Azure

Cloud

ETL

Python

Spark

SQL

🔥 2 hours ago

EvolutionIQ

51 - 200

🤖 Artificial Intelligence

Senior Software Engineer designing and delivering AI solutions for insurance claims teams. Leveraging modern coding tools and data pipelines for high-velocity workflows and application scalability.

Python

SQL

🔥 3 hours ago

Penn Mutual

1001 - 5000

Senior Data Engineer designing, building, and evolving Penn Mutual’s enterprise data platforms and pipelines. Collaborating closely with teams to enable analytics, reporting, and data-driven decision making.

AWS

Cloud

Java

Python

Scala

SQL

🔥 5 hours ago

Humana

10,000+ employees

⚕️ Healthcare Insurance

Lead Data Engineer responsible for enhancing Wisconsin Medicaid Market's data platform reliability and performance. Collaborating with Market BI team to optimize data access and flow across dual tech stacks.

Azure

ETL

SQL

SSIS