Senior Data Engineer – Databricks SME

51 - 200 employees

Founded 1986

🎯 Recruiter

🤝 B2B

Recruitment • B2B • Staffing

A. C. Coy Company is a specialized recruitment agency based in Pittsburgh, PA, that focuses on connecting professionals with job opportunities in IT, engineering, finance, and sales. Since its inception in 1986, the company has been committed to acting in the best interests of clients and candidates, offering contract, contract-to-hire, and permanent placement services. A. C. Coy aims to make successful career designs possible for job seekers while providing top talent to businesses across the Tri-State area and beyond.

Senior Data Engineer – Databricks SME

Job not on LinkedIn

🕒 May 8

🌲 North Carolina – Remote

⏳ Contract/Temporary

🟠 Senior

🚰 Data Engineer

Apache

Azure

Cloud

ETL

Hadoop

Kafka

Oracle

Perl

Python

Ruby

Spark

SQL

Tableau

Apply Now

Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

A.C.Coy Company

51 - 200 employees

Founded 1986

🎯 Recruiter

🤝 B2B

Recruitment • B2B • Staffing

📋 Description

• Design, develop, and maintain scalable data ingestion pipelines to onboard structured, semi-structured, and unstructured data from batch and streaming sources (e.g., APIs, databases, flat files, message queues) into the Azure/Databricks environment. • Implement de-duplication strategies across large-scale datasets using deterministic and probabilistic matching techniques to ensure data integrity and reduce redundancy within the Data Lake. • Develop and enforce data tagging frameworks to classify, label, and annotate datasets with appropriate metadata (e.g., sensitivity, source, domain, lineage) to support data governance, discoverability, and compliance requirements. • Assist with Operationalizing deployments and support of Cloud services for ETL Operations. • This will include standardizing and automating processes and workflows, creating documentation/knowledge articles, and overall assisting Operations staff who have limited experience in Cloud. • Written and oral presentations to high-level CIO management on status of current efforts. • Possesses skills and experience related to business management, systems engineering, operations research, and management engineering. • Typically has specialization in a particular technology or business application. • Keeps abreast of technological developments and industry trends. • Assist with deployment, configuration, and management of Azure Cloud environment. • Assist with migration efforts of existing ETL jobs into Azure/Databricks cloud environment. • Ability to share optimization and efficiencies with the larger team and management. • Ability to automate solutions to repetitive problems/tasks.

🎯 Requirements

• A degree from an accredited College/University in the applicable field of services is required. • 13+ years of overall IT experience. • 5+ years demonstrated experience designing and implementing data ingestion pipelines using tools such as Azure Data Factory, Apache Kafka, Apache NiFi, Spark Structured Streaming, or equivalent technologies. • 5+ years of experience applying de-duplication techniques at scale, including record linkage, fuzzy matching, and entity resolution across structured and unstructured datasets. • 5+ years of hands-on experience with data tagging and metadata management, including the use of tagging schemas, data catalogs (e.g., Azure Purview, Apache Atlas), and automated classification tools to support data governance and lineage tracking. • 5+ years of demonstrated experience working with unstructured data. • 2+ years of experience in using Databricks or other Spark-based platforms. • Fluency in at least one scripting language (Python, Perl, Ruby, or equivalent). • Experience with one or more of the following products and technologies: SAS, C++, Hadoop, SQL Database/Coding, Teradata, Oracle, Amazon S3, Apache Spark, Machine Learning, Natural Language Processing, and visualization tools such as Tableau, Strategy and QLIK is a plus. • Integration of Git in continuous deployment and experience with DevOps monitoring tools is a plus. • Familiarity with Cloud Operations support in Azure is a plus. • Excellent communication skills. • Must be able to obtain a Position of Public Trust Clearance. • Must be a US Citizen or have US Permanent Residence status (Green Card). • Must have resided in the US for the last 5 years and not have traveled outside the US for a combined total of 6 months or more in last 5 years.

🏖️ Benefits

• Health insurance • Retirement plans • Paid time off

Apply Now

Similar Jobs

Senior Data Architect – Common Data Model, Metadata Platforms

🕒 April 25

aKUBE

51 - 200

🎯 Recruiter

☁️ SaaS

Senior Data Architect at a tech services company developing common data models and metadata platforms. Leading architecture design and integrating data systems for enterprise solutions.

🇺🇸 United States – Remote

💵 $100 - $110 / hour

⏳ Contract/Temporary

🟠 Senior

🚰 Data Engineer

Java

Python

Looker Data Engineer – Contract

🕒 April 21

Tech Holding

51 - 200

🤝 B2B

🏢 Enterprise

☁️ SaaS

Looker Data Engineer for project-based assignment at Tech Holding. Building and optimizing data models and reporting solutions using Looker and modern data stack technologies.

🇺🇸 United States – Remote

⏳ Contract/Temporary

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

Python

SQL

Senior Data Engineer, Data Platform

🕒 April 14

Virtasant

51 - 200

🏢 Enterprise

🤝 B2B

Senior Data Engineer responsible for building and scaling healthcare data platform services. Collaboration with cross-functional teams to improve pipeline reliability and observability.

🇺🇸 United States – Remote

⏳ Contract/Temporary

🟠 Senior

🚰 Data Engineer

Airflow

Amazon Redshift

AWS

BigQuery

Docker

Kafka

Python

Spark

SQL

Senior Snowflake Data Engineer

🕒 April 3

Urbansoft™

11 - 50

🤖 Artificial Intelligence

🔒 Cybersecurity

☁️ SaaS

Join our team as a Senior Snowflake Data Engineer designing and implementing scalable data solutions. Leverage your Snowflake expertise to optimize our cloud-based data platforms.

🇺🇸 United States – Remote

⏳ Contract/Temporary

🟠 Senior

🚰 Data Engineer

AWS

Azure

Cloud

ETL

Informatica

Matillion

Python

SQL

AWS Cloud Data Engineer

🕒 April 1

Webtellect, LLC

11 - 50

☁️ SaaS