Senior AI Data Engineer

Job not on LinkedIn

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of TechBiz Global

TechBiz Global

51 - 200 employees

🎯 Recruiter

Recruitment • Software Development • Consulting

TechBiz Global is a leading recruitment and software development company that specializes in connecting businesses with top-tier talent across 28+ countries. They serve clients from startups to major enterprises, providing expertise in hiring key roles in industries such as FinTech. In addition to talent acquisition, TechBiz Global offers comprehensive software development services to turn clients' visions into reality, powering digital transformations with expert engineers. The company also provides strategic CTO consulting services and flexible outstaffing and outsourcing solutions, helping businesses manage growth and optimize technology processes. Recognized as a top IT recruitment and consulting firm, TechBiz Global is dedicated to delivering personalized and innovative solutions to its clients, ensuring they have the tools necessary for success.

📋 Description

• Design, build, and scale robust ETL/ELT pipelines optimized for AI workloads, including RAG, fine-tuning, and batch inference. • Transform unstructured data sources such as PDFs, logs, and transcripts into structured and vectorized formats suitable for LLM consumption. • Maintain and automate the data-to-model lifecycle, ensuring AI knowledge bases remain synchronized with changing business data. • Develop and maintain real-time feature pipelines that support low-latency AI and machine learning applications. • Integrate data platforms with Kafka and other event-driven systems to enable real-time processing and AI-driven responses. • Manage and optimize Feature Stores to ensure consistency between model training and production environments. • Implement automated data quality controls and validation processes to ensure the reliability and accuracy of AI training and inference data. • Establish and maintain data lineage frameworks to provide traceability, auditability, and regulatory compliance across data workflows. • Enforce data security, privacy, and governance standards, including PII protection and compliance with industry regulations. • Manage data movement and synchronization across on-premises systems, cloud platforms, and data warehouses. • Optimize data storage and retrieval strategies for Vector Databases to support high-performance RAG and AI search workloads. • Collaborate with Data Scientists, ML Engineers, Software Engineers, and business stakeholders to deliver scalable AI data solutions.

🎯 Requirements

• 10+ years of experience in Data Engineering or Backend Engineering with a strong focus on data platforms and pipelines. • 2+ years of hands-on experience supporting AI/ML data pipelines, including data preparation for machine learning and generative AI applications. • Expert-level proficiency in Python and SQL; experience with Java or Scala is an advantage. • Strong experience building and maintaining real-time data streaming solutions using Apache Kafka, Flink, or Spark Streaming. • Hands-on experience with modern data orchestration and transformation tools such as Airflow, dbt, and Prefect. • Experience working with Vector Databases and Feature Stores to support AI and machine learning workloads. • Strong knowledge of cloud-based data services on AWS, Azure, or GCP, including services such as Glue, Kinesis, Data Factory, or Dataflow. • Experience deploying and managing data workloads in Kubernetes (K8s) environments. • Proven experience handling sensitive data within regulated industries such as Fintech, Healthcare, or other compliance-driven environments. • Strong understanding of data quality, governance, security, and privacy best practices. • Bachelor's degree in Computer Science, Software Engineering, Information Systems, or a related technical field. Equivalent practical experience will also be considered. • Excellent problem-solving skills and the ability to collaborate effectively with cross-functional engineering, data, and AI teams.

🏖️ Benefits

• Health insurance • Professional development opportunities

Apply Now

Similar Jobs

🕒 June 3

Palta

501 - 1000

🤖 Artificial Intelligence

👥 B2C

🧘 Wellness

Senior Data Engineer at Simple Life building and maintaining innovative data ingestion pipelines for AI-powered health coaching app. Driving improvements in data access and processing efficiencies.

Airflow

AWS

Cloud

Distributed Systems

Python

SQL

Terraform

🕒 April 23

Fundraise Up

51 - 200

🤲 Charity

💳 Fintech

☁️ SaaS

Senior Data Engineer responsible for designing and optimizing data pipelines at Fundraise Up. Collaborating with analytics and engineering teams to ensure data governance and quality.

🗣️🇷🇺 Russian Required

Airflow

Docker

ETL

JavaScript

Kafka

MongoDB

Node.js

Python

TypeScript

Promoted

ennabl

11 - 50

💳 Fintech

☁️ SaaS

Data Analyst role analyzing insurance-related text data and improving ETL pipelines for better data quality. Collaborating across teams to meet business needs at a data-driven company in the insurance industry.

ETL

SQL