Data Engineer, Databricks

Job not on LinkedIn

🕒 March 20

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Horizon Industries, Limited

Horizon Industries, Limited

201 - 500 employees

Founded 1996

🤝 B2B

☁️ SaaS

🔒 Cybersecurity

B2B • SaaS • Cybersecurity

Horizon Industries, Limited is an IT services and management consulting corporation that delivers a full life-cycle of support through business strategy analysis, business system development and deployment, and operational support. The company focuses on digital transformation capabilities by leveraging expertise in Robotic Process Automation (RPA) and Artificial Intelligence (AI), offering tailored solutions to meet the unique needs of its clients. Horizon Industries is committed to quality, using established management principles like CMMI and ISO, and provides services including IT support, program management, and cybersecurity solutions across the U. S. and internationally.

📋 Description

• Build end-to-end implementation of multiple ETL/ELT pipelines, demonstrating efficient data transformation and ingest patterns to move raw data from data producers to an enterprise data ecosystem, with a focus on performance and reliability • Assess and understand the ETL jobs, workflows, BI tools, and reports • Address technical inquiries concerning customization, integration, enterprise architecture and general feature / functionality of data products • Experience in crafting database / data warehouse solutions in the cloud (Preferably AWS, Azure, Alternatively GCP) • Key must have skill sets – Python, SQL, Databricks, AWS Data Services • Experience with message queuing, stream processing, and highly scalable ‘big data’ data stores • Experience manipulating, processing, and extracting value from large, disconnected datasets • Experience manipulating structured and unstructured data for analysis • Experience with data modeling tools and processes • Experience aggregating and transforming data from multiple datasets to create data products • Support an Agile software development lifecycle

🎯 Requirements

• Ability to hold a position of public trust with the US government. • S. in Computer Science or equivalent • 4+ years' experience in data engineering and big data. At least 2 years' of professional services experience interacting directly with clients. • Big data tools: Hadoop, Spark, Kafka, etc. • Relational SQL and NoSQL databases and experience working with relational databases • AWS cloud services: EC2, S3, RDS, Glue, Step Functions, Lamda, EMR, DynamoDB, DocumentDB, Redshift, Aurora, Athena • Data Platforms: Databricks • Data streaming systems: Batch,Kafka, Storm, Spark-Streaming, etc. • Languages: Python, R, Scala, Go • Ability to inspect existing data pipelines, discern their purpose and functionality, and re-implement them efficiently in Databricks. • Extensive knowledge of data warehousing concepts and hands-on experience deploying pipelines using Databricks **a must • Data modeling and database design skills and knowledge of version control • Excellent verbal and written communication skills • Experience architecting scalable and fault-tolerant data solutions across Azure, AWS, and Databricks • Databricks Data Engineer Professional certification a plus. • Preference for candidates with Databricks Professional certifications

🏖️ Benefits

• A comprehensive benefits package including healthcare (medical, dental, vision and disability) • a 401k program where you are 100% vested from day one with an employer match after 90 days. • an Educational Assistance program. • a Student Loan Repayment Program • Gym Reimbursement Program. • Paid Time off • Dynamics, passionate, multi-disciplinary team of creative minds to work with, and many more.

Apply Now

Similar Jobs

🕒 March 20

SHI International Corp.

5001 - 10000

🤝 B2B

🔧 Hardware

☁️ SaaS

Senior Data Engineer involved in data engineering and ETL workflows for SHI. Collaborating with stakeholders and establishing data practices within the organization.

AWS

Azure

Cloud

ETL

Google Cloud Platform

Python

SQL

🕒 March 20

Veeva Systems

1001 - 5000

☁️ SaaS

⚕️ Healthcare Insurance

💊 Pharmaceuticals

Senior Data Engineer leading design and implementation of Lakehouse environment at Veeva Systems. Supporting life sciences companies by providing scalable solutions to enhance data utilization and analytics.

AWS

Cloud

ETL

Kafka

Kubernetes

Spark

🕒 March 20

Fuze Health

1001 - 5000

☁️ SaaS

🤝 B2B

💊 Pharmaceuticals

Senior Manager leading data engineering at Fuze Health. Overseeing the data platform strategy and team development for healthcare solutions.

AWS

Azure

Cloud

Google Cloud Platform

🕒 March 20

CRB

1001 - 5000

🤝 B2B

☁️ SaaS

Senior Data Engineer delivering on data and business intelligence initiatives for the life sciences and food industries. Collaborating on large-scale projects focused on data architecture and analytics.

Azure

ERP

ETL

Python

Spark

SQL

SSIS

🕒 March 20

CRB

1001 - 5000

🤝 B2B

☁️ SaaS

Senior Data Engineer working on data initiatives for CRB. Responsible for ETL processes, data modeling, and mentorship.

Azure

ERP

ETL

Python

Spark

SQL

SSIS