Data Engineering Architect

🕒 March 12

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Juniper Square

Juniper Square

201 - 500 employees

💸 Finance

🏠 Real Estate

☁️ SaaS

💰 $75M Series C on 2019-11

Finance • Real Estate • SaaS

Juniper Square is a company that provides a comprehensive platform and solutions tailored for private investment partnerships. Founded in 2014, the company focuses on enabling seamless connection and communication between General Partners (GPs) and Limited Partners (LPs) throughout the entire investment lifecycle. Juniper Square's technology is purpose-built to support commercial real estate, private equity, and venture capital firms of all sizes. The platform offers services such as fund administration, fundraising, investor management, compliance, and investor reporting, all aimed at enhancing transparency, data governance, and the overall investor experience.

📋 Description

• Define and own the end-to-end data and analytics architecture strategy • Design scalable batch, streaming, and real-time data systems • Establish standards for data modeling, semantic layers, and reporting • Lead architecture reviews and technical decision-making • Drive adoption of modern architectures (lakehouse, data mesh, real-time analytics) • Design and prototype critical data platform components • Write production-quality code for complex or high-impact areas • Review schemas, transformations, dashboards, and analytics models • Troubleshoot performance and reliability issues across pipelines and queries • Optimize workloads for latency, concurrency, and cost • Design and architect a scalable data platform supporting ingestion, transformation, and delivery of both structured and unstructured data across batch and real-time pipelines. • Design a "Data for Agents" strategy, ensuring our data warehouse is structured with the semantic layers and metadata necessary for LLMs to navigate it accurately. • Build AI-ready data infrastructure, including vector stores, embedding pipelines, and retrieval systems that power LLM and agentic workflows. • Develop a RAG-ready data architecture that enables trusted enterprise data retrieval with strong lineage, governance, security, and observability. • Create curated data products and reusable APIs that make high-quality datasets easily consumable by applications, analytics platforms, and AI agents. • Enable self-service data access for engineering, analytics, and business teams through standardized models, semantic layers, and platform capabilities. • Partner with AI, product, and engineering teams to support training datasets, feature stores, and production AI inference pipelines. • Build agentic ETL/ELT pipelines that use AI agents to autonomously discover sources and generate transformations. • Ensure reliability, scalability, and resilience of the platform, including high availability, monitoring, and disaster recovery readiness. • Partner with product, finance, business operations, and leadership teams to define analytics needs • Design scalable data models for reporting and advanced analytics • Ensure analytics solutions are performant, trustworthy, and easy to use • Drive adoption of data-driven culture through reliable insights • Define data governance, lineage, cataloging, and metadata standards • Establish data quality frameworks and validation processes • Ensure privacy, compliance, and secure access to sensitive data • Implement role-based access controls and auditability • Mentor senior engineers, analytics engineers, and data scientists • Partner with product, ML, platform, and business teams • Translate business questions into scalable data solutions • Influence roadmaps using data platform and analytics considerations • Act as the executive technical authority for data and analytics • Define SLAs/SLOs for data availability, freshness, and accuracy • Establish monitoring, alerting, and incident response processes • Optimize cloud costs and query performance • Support capacity planning for data growth • Be an evangelist for pragmatic AI adoption. • Help establish a culture of outcome-driven innovation.

🎯 Requirements

• Advanced degree in Computer Science, Engineering, or related field • 10+ years in data engineering, analytics engineering, or data platform roles • Proven experience architecting large-scale data and analytics systems • Strong hands-on experience with modern data stacks in cloud environments • Deep expertise in data modeling for analytics (dimensional, star/snowflake, Data Vault, etc.) • Advanced SQL skills and proficiency in Python, Scala, or Java • Advanced expertise in dimensional data modeling and semantic layers (e.g., dbt, Cube) to provide "agent-readable" context. • Experience with distributed processing frameworks (Spark, Flink, etc.) • Experience building reporting and BI solutions at scale • Strong understanding of both batch and real-time architectures • Hands-on experience with AWS, Azure, or GCP data services • Experience with BI tools (e.g., Looker, Tableau, Power BI, etc.) • Strong understanding of data governance and security best practices • Ability to operate at both executive and deeply technical levels.

🏖️ Benefits

• Health, dental, and vision care for you and your family • Life insurance • Mental wellness coverage • Fertility and growing family support • Flex Time Off in addition to company paid holidays • Paid family leave, medical leave, and bereavement leave policies • Retirement saving plans • Allowance to customize your work and technology setup at home • Annual professional development stipend

Apply Now

Similar Jobs

🕒 March 11

Trumid

51 - 200

💳 Fintech

💸 Finance

☁️ SaaS

Senior Data Engineer building real-time data platforms for pricing algorithms at Trumid. Collaborating with engineers and data scientists to enhance infrastructure reliability and performance.

Cloud

Distributed Systems

Kafka

Python

SQL

🕒 March 11

C the Signs

51 - 200

⚕️ Healthcare Insurance

🤖 Artificial Intelligence

🧬 Biotechnology

Lead Data Engineer architecting and building a healthcare data platform leveraging AI technology. Collaborating with multiple teams to ensure high-quality data ingestion and compliance in healthcare.

Airflow

Amazon Redshift

AWS

BigQuery

Cloud

ETL

Google Cloud Platform

Python

SQL

🕒 March 11

Orbital AI

1 - 10

🤖 Artificial Intelligence

👥 HR Tech

🧘 Wellness

Senior Data Engineer at Orbital Insight designing real-time data platforms for analytics and machine learning. Focusing on streaming data pipelines and large-scale event data systems.

Apache

AWS

Distributed Systems

Java

Kafka

Python

SQL

🕒 March 9

Conduent

10,000+ employees

🤝 B2B

🛍️ eCommerce

🏛️ Government

Technical Lead Engineer providing architectural oversight for the Medicaid Enterprise Data Warehouse. Leading design, data integration, and compliance structure for optimized Medicaid data solutions.

Azure

Cloud

ETL

Informatica

Oracle

SQL

🕒 March 6

The Planet Group

1001 - 5000

🎯 Recruiter

👥 HR Tech

Data Engineer responsible for data modeling and pipeline development at Planet Equity Group. Collaborating with Azure services and optimizing data management processes.

🇺🇸 United States – Remote

💵 $111.6k / year

💰 Private Equity Round on 2018-01

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

Azure

ETL

Spark

SQL