Senior Data Engineer

201 - 500 employees

Founded 2010

💼 Consulting

📣 Marketing

🔌 API

Consulting • Marketing • API

Leega is a leading technology solutions provider in Latin America, specializing in data analytics and cloud solutions. As the first company in the region certified by Google Cloud for Data Analytics, Leega offers a range of services including application development, machine learning, and risk management analytics. The firm partners with major cloud services such as AWS and Microsoft Azure to help businesses enhance their data management and transition effectively to the cloud, ultimately driving digital transformation and innovation.

Senior Data Engineer

Job not on LinkedIn

🕒 June 10

🇧🇷 Brazil – Remote

⏰ Full Time

🟠 Senior

🚰 Data Engineer

🗣️🇧🇷🇵🇹 Portuguese Required

Airflow

Apache

JavaScript

Kafka

PySpark

Python

SQL

Apply Now

Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Leega

201 - 500 employees

Founded 2010

💼 Consulting

📣 Marketing

🔌 API

Consulting • Marketing • API

📋 Description

• You will architect and evolve the datalake that is the company's data nervous system — the foundation that feeds, in real time, the dynamic pricing engine, ML models, and the group's business intelligence. • This is an ownership role: you define the multi-tenant Lakehouse architecture, from streaming to the semantic layer, and are responsible for its reliability, governance, and cost. • Design and evolve the data lake on Apache Iceberg over S3 — well-defined layers, partitioning and compaction, time-travel and support for DELETE/UPDATE for LGPD (Brazilian data protection law). • Build real-time ingestion (Kafka, Flink, CDC with Debezium) with controlled schema evolution (Schema Registry) and delivery guarantees. • Model the transformation layer in dbt and orchestrate batch and quality flows in Airflow, from crawler to backfill. • Maintain metric definitions in Cube.js — the single source that feeds BI and AI agents and ensures consistency across the company. • Operate federated and low-latency OLAP queries over the lake, with cost and access isolation by tenant and performant queries. • Ensure data testing, lineage and cost efficiency, keeping the platform reliable as it scales.

🎯 Requirements

• Strong command of SQL and query optimization in distributed environments (Minimum 5 years). • Python with solid experience in PySpark or distributed processing. • Orchestration (Airflow), ELT and dbt applied at scale (Minimum 4 years). • Streaming (Kafka, Flink) and Lakehouse architectures with Apache Iceberg (Minimum 3 years). • Strong understanding of data governance, quality, and modeling. • Comfortable with AI-assisted development (e.g., Claude Code). • CDC (Debezium) and low-latency OLAP (ClickHouse, Pinot, Trino/Athena). • Semantic layers (Cube.js, dbt) and Data Mesh architectures. • Governance and catalog tools (OpenMetadata, Lake Formation). • Vector databases (Qdrant) and data pipelines for ML.

🏖️ Benefits

• Remote work • Project duration: 6 months, with possibility of extension or conversion to permanent employment.

Apply Now

Similar Jobs

Senior Software Engineer – Data Platform

🕒 June 9

avra

1 - 10

💼 Consulting

💸 Finance

🤝 B2B

Senior Software Engineer developing data products for Avra’s AI infrastructure in a remote-first environment. Collaborating with cross-functional teams to build and maintain data systems and services.

🇧🇷 Brazil – Remote

⏰ Full Time

🟠 Senior

🚰 Data Engineer

🗣️🇧🇷🇵🇹 Portuguese Required

AWS

Cloud

Distributed Systems

Google Cloud Platform

Python

Rust

Data Engineering Specialist, II

🕒 June 9

Experian

10,000+ employees

💼 Consulting

📣 Marketing

📦 Logistics

Data Engineer II at Experian designing and implementing Data Lake architectures. Collaborating on AI and ML solutions for innovative data-driven insights in various industries.

🇧🇷 Brazil – Remote

⏰ Full Time

🟠 Senior

🔴 Lead

🚰 Data Engineer

🗣️🇧🇷🇵🇹 Portuguese Required

Airflow

Apache

PySpark

Python

Scala

Spark

SQL

Terraform

Data Engineering Specialist

🕒 June 8

Localiza&Co

10,000+ employees

🚘 Automotive

📦 Logistics

✈️ Travel

Data Engineer designing and implementing robust data pipelines at Localiza&Co. Utilizing AWS tools to manage and transform data for business insights, enhancing sustainable mobility.

🇧🇷 Brazil – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

🗣️🇧🇷🇵🇹 Portuguese Required

Airflow

Apache

AWS

ETL

Python

Spark

SQL

Data Architect – Data Governance

🕒 June 6

Smarthis

51 - 200

💼 Consulting

📦 Logistics

📣 Marketing

Data Architect at Smarthis, focusing on defining efficient data architectures and supporting engineering teams in implementations. Collaborating on data solutions across cloud environments.

🇧🇷 Brazil – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

🗣️🇧🇷🇵🇹 Portuguese Required

AWS

Azure

Cloud

Google Cloud Platform