Senior Data Engineer – Agentic AI Engineering

🔥 16 hours ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Boomi

Boomi

1001 - 5000 employees

Founded 2000

☁️ SaaS

🔌 API

🏢 Enterprise

SaaS • API • Enterprise

Boomi is a leading provider of intelligent integration and automation solutions, empowering organizations to streamline business processes and accelerate digital transformation. With a strong focus on integration platform as a service (iPaaS), Boomi enables the secure deployment and management of APIs, integration of applications and data, and automation of workflows using AI-powered solutions. Trusted by over 20,000 customers, Boomi's platform supports a wide array of industries including manufacturing, healthcare, financial services, and more. Their extensive library of pre-built connectors, user-friendly interface, and robust customer support foster seamless integration and connectivity across cloud and on-premise environments. Recognized for its innovation and excellence, Boomi transforms complex integration challenges into manageable solutions, contributing significantly to efficiency and productivity in business operations.

📋 Description

• Architect and build scalable, secure, and observable data infrastructure to power LLM-based agents, multi-agent systems, and tool-using AI workflows. • Design and operate robust batch and real-time data pipelines supporting embeddings, RAG systems, and agent memory frameworks. • Develop and manage vector database solutions to enable low-latency retrieval and contextual intelligence for AI applications. • Build data frameworks for training, evaluation, benchmarking, and continuous improvement of agentic AI systems. • Implement strong data governance, quality controls, lineage tracking, and PII/security compliance across AI data platforms. • Collaborate with AI/ML, platform, and DevOps teams to productionize experimental AI prototypes into enterprise-grade solutions. • Optimize data systems for performance, scalability, reliability, and cost efficiency across cloud environments (AWS, Azure, or GCP).

🎯 Requirements

• 5+ years of experience building and operating large-scale data platforms as a Data Engineer. • Strong programming expertise in Python and SQL for developing scalable and efficient data solutions. • Hands-on experience designing batch and real-time data pipelines, including streaming systems like Kafka or Kinesis. • Experience with modern data platforms and cloud environments (AWS, Azure, or GCP), including tools like Snowflake. • Strong understanding of LLM/AI data workflows, including embeddings, RAG pipelines, evaluation datasets, and vector databases (Pinecone, Milvus). • Experience with DataOps/MLOps tools such as Airflow, dbt, Lavender, and MLflow for orchestration and lifecycle management. • Strong knowledge of data quality, governance, and security, including PII handling, access controls, lineage, and ensuring data reliability.

🏖️ Benefits

• An overview of our benefits can be found here.

Apply Now

Similar Jobs

🔥 16 hours ago

OnQGlobal

11 - 50

Data Center Architect providing architectural expertise in planning and design for mission-critical data centers. Collaborating with stakeholders and ensuring compliance with architectural standards throughout project lifecycle.

🔥 16 hours ago

Jellyfish

1001 - 5000

📱 Media

Senior Software Engineer developing data products for Jellyfish's core data platform. Collaborating on user-facing features and engaging in product development phases.

Postgres

Python

SQL

🔥 19 hours ago

THEMIS Waste Recovery Technology

11 - 50

💳 Fintech

🏦 Banking

📋 Compliance

Data Scientist / ML Engineer turning data into intelligence for governance, risk, and compliance. Collaborating across the full data lifecycle and applying ML to real workflows.

Pandas

Python

PyTorch

Scikit-Learn

SQL

Tensorflow

🕒 Yesterday

Nex

51 - 200

🎮 Gaming

🥽 AR/VR

🛍️ eCommerce

Data Engineer optimizing data backend for Nex’s analytics needs. Collaborating with multiple teams to build robust data processes and improve access to data.

AWS

🕒 Yesterday

Presidio

1001 - 5000

🤝 B2B

🤖 Artificial Intelligence

🔒 Cybersecurity

Senior Data Engineer optimizing and modernizing data pipelines and analytics infrastructure for enterprise clients. Collaborating across data engineering lifecycle at Presidio.

Airflow

Amazon Redshift

AWS

Azure

BigQuery

Cloud

Google Cloud Platform

Kafka

Python

Scala

Spark

SQL