Cloud Data Architect, AI Experience

1001 - 5000 employees

💼 Consulting

🏥 Healthcare

🛡️ Insurance

💰 Private Equity Round on 2021-10

Consulting • Healthcare • Insurance

3Pillar Global is a modern application strategy, design, and engineering firm that specializes in delivering strategic software development initiatives for various industries. They offer a range of services, including application technology strategy, digital product engineering, data and analytics, and artificial intelligence development. 3Pillar Global focuses on helping organizations transform their bold ideas into breakthrough solutions by leveraging cutting-edge technologies such as generative and multimodal AI. They work with partners and clients across multiple sectors, including healthcare, financial services, insurance, media, and information services, to solve complex technology challenges and deliver high-performing results.

Cloud Data Architect, AI Experience

Job not on LinkedIn

🕒 June 12

🇮🇳 India – Remote

⏰ Full Time

🟠 Senior

🔴 Lead

🚰 Data Engineer

AWS

Azure

Cloud

Cyber Security

ETL

Kafka

Neo4j

PySpark

Spark

Apply Now

Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

3Pillar Global

1001 - 5000 employees

💼 Consulting

🏥 Healthcare

🛡️ Insurance

💰 Private Equity Round on 2021-10

Consulting • Healthcare • Insurance

📋 Description

• Architect and own the enterprise AI data platform — the unified, governed layer that ingests, transforms, stores, and serves all data consumed by AI systems across the organisation. • Design multi-domain data models (lakehouse, data mesh, event-driven) that are structured from day one to serve AI workloads: clean lineage, versioned schemas, well-documented contracts, and low-latency serving APIs. • Own the full data stack: real-time streaming (Kafka, Spark Structured Streaming), batch processing (Databricks, PySpark, Delta Lake), cloud storage and compute (AWS, Azure), and data quality /metadata management. • Ensure this platform is the single, authoritative data source for all downstream consumers — conversational AI, dashboard assistants, autonomous agents, ML models, and reporting — eliminating data silos and conflicting truths. • Drive modernisation of legacy pipelines (on-prem ETL, batch DWH) to cloud-native, AI-ready architectures with measurable improvements in cost, latency, and delivery velocity. • Design the semantic layer that sits above raw data — business-aligned ontologies, entity relationships, domain taxonomies, and knowledge graphs — so AI systems understand context, not just tokens. • Build and maintain knowledge graphs (Neo4j or equivalent) that capture relationships between business entities, policies, KPIs, hierarchies, and domain rules — enabling structured reasoning alongside unstructured retrieval. • Define and govern a feature store and semantic data contracts that serve both classical ML models and LLM-based applications from a single, well-versioned, trusted source. • Own metadata management, data lineage, and audit trails across the semantic layer — ensuring every AI system can trace its outputs back to source data with full accountability. • Design and enforce a comprehensive data governance model that governs access for both human users and AI agents — with role-based access control (RBAC), attribute-based policies, and agent-specific permission scopes that prevent privilege escalation.

🎯 Requirements

• 15+ years of hands-on data engineering and architecture experience, with 3–5+ years building production AI/ML and LLM-era data infrastructure. • Proven experience designing enterprise-scale AI data platforms that serve multiple AI consumers — not just one application or pipeline. • Deep expertise in lakehouse and data mesh architectures: Databricks, Delta Lake, PySpark, Kafka, Spark Structured Streaming, cloud-native data services (AWS, Azure). • Hands-on experience with vector stores, semantic models, knowledge graphs, and retrieval infrastructure in production environments. • Working knowledge of LLMOps: model serving pipelines, MLflow, CI/CD for AI, automated evaluation, and production monitoring. • Strong background in data governance, security, and compliance in regulated industries (financial services, payments, cybersecurity, healthcare). • Experience defining data access controls for AI agents and automated systems — not just human users.

🏖️ Benefits

• Health insurance • Flexible work hours • Professional development opportunities

Apply Now

Similar Jobs

Data Engineer – Azure

🕒 June 11

Sikich

1001 - 5000

🛡️ Insurance

📦 Logistics

📣 Marketing

Data Engineer at Sikich optimizing data solutions using Microsoft platforms. Responsible for building robust data pipelines and delivering insight-driven analytics.

🇮🇳 India – Remote

💰 Private Equity Round on 2024-05

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

Azure

Kafka

Python

Scala

Spark

SQL

Data Engineer II

🕒 June 11

NPS Prism

201 - 500

💼 Consulting

📣 Marketing

📦 Logistics

Data Engineer II responsible for developing ETL/ELT workflows and managing data lakes for NPS Prism. Collaborating with teams to design data solutions on cloud platforms like Azure and AWS.

🇮🇳 India – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

AWS

Azure

Cloud

ETL

PySpark

Python

SQL

Tableau

Data Engineer – L2

🕒 June 6

Forbes Advisor

201 - 500

🛡️ Insurance

💼 Consulting

✈️ Travel

Data Engineer building and maintaining data pipelines for marketing analytics and support across business teams. Contributing to data ingestion and modelling from various platforms with a focus on Meta Ads.

🇮🇳 India – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

Airflow

BigQuery

Cloud

ETL

Microservices

Python

SQL

Data Engineer, L3

🕒 June 6

Forbes Advisor

201 - 500

🛡️ Insurance

💼 Consulting

✈️ Travel

Data Engineer (L3) designing scalable data architecture and robust data pipelines for social marketing. Collaborate in a fintech initiative providing insights on personal finance and growth marketing.

🇮🇳 India – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

Airflow

BigQuery

Cloud

ETL

Microservices

Python

SQL

D365 Customer Insight Data Engineer

🕒 June 5

DysrupIT

51 - 200

🏢 Enterprise

☁️ SaaS

🔒 Cybersecurity

Data Engineer managing D365 Customer Insights and data pipelines. Triage issues, validate data fitness, and collaborate on monitoring and observations.

🇮🇳 India – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

Azure

Python

SQL

.NET