Data Engineer

Job not on LinkedIn

March 24

Apply Now
Logo of Centific

Centific

Artificial Intelligence • eCommerce • Enterprise

Centific is a company that provides Zero distance innovation™ solutions for Generative AI (GenAI) creators and industries. The company focuses on rapid, safe, and scalable AI deployment, offering high-quality data and models to drive business impact through accelerated deployments. With expertise in AI and data science, Centific is positioned as a leader in data annotation and labeling for AI/ML, recognized by the Everest Group’s 2024 assessments. Their AI platforms support enterprise-ready applications, particularly improving search, personalization, fraud detection, and customer engagement in ecommerce environments. Centific serves a wide array of industries with foundational and frontier AI data solutions, fostering partnerships with leading model providers to advance sustainable GenAI models and applications.

📋 Description

• Develop and optimize PySpark-based ETL pipelines for processing large-scale data. • Develop, test, and maintain robust Python applications. • Collaborate with data engineers, analysts, and other developers to design and implement data solutions. • Work with Azure Data Lake, Azure Databricks, and Azure Data Factory to build scalable data solutions. • Implement data classification, policy enforcement, and metadata extraction within Purview. • Collaborate with data engineers and business teams to ensure smooth data flow across systems. • Troubleshoot performance bottlenecks in Spark jobs and improve data pipeline efficiency. • Ensure data security, compliance, and quality following best practices.

🎯 Requirements

• 3+ years of experience in Python and PySpark for big data processing. • Proven experience as a Python Developer or similar role. • Strong experience with Azure Data Services (Data Lake, Data Factory). • Excellent problem-solving skills and ability to work in an agile environment. • Excellent communication and teamwork abilities.

Apply Now

Similar Jobs

March 14

Senior Manager at Wex, overseeing Data Lake technologies and engineering team in Bangalore.

AWS

Azure

Cloud

Dart

Kubernetes

SDLC

March 14

Join Wex as Senior Staff Engineer for Data Lake House platform development and innovation.

Apache

AWS

Azure

Cloud

Java

Kubernetes

Python

SDLC

Spark

March 8

Particle41 seeks a Data Engineer for robust data pipeline design and data infrastructure maintenance.

AWS

Azure

Cloud

ElasticSearch

ETL

Flask

Google Cloud Platform

Java

Linux

Microservices

MongoDB

MySQL

NoSQL

Pandas

Postgres

PySpark

Python

Redis

Scikit-Learn

Spark

SQL

March 4

Nimble

11 - 50

Join a leading RCM company as a Senior Data Engineer optimizing data pipelines and infrastructures.

Azure

Cloud

ETL

JavaScript

Python

Scala

SQL

February 12

Join a fast-growing UK client as a Remote Senior Data Engineer for a B2B diamond marketplace; drive technical excellence and support customer needs.

Airflow

Amazon Redshift

Apache

AWS

Distributed Systems

ETL

JavaScript

Kafka

Kubernetes

PySpark

Python

Scala

Spark

SQL

Terraform

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com