Senior/Lead Data Engineer – AI-Native Aftermarket Platform

Job not on LinkedIn

🔥 0 minutes ago

🇨🇴 Colombia – Remote

⏰ Full Time

🟠 Senior

🚰 Data Engineer

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Truelogic Software

Truelogic Software

501 - 1000 employees

Founded 2004

☁️ SaaS

🤝 B2B

🏢 Enterprise

SaaS • B2B • Enterprise

Truelogic Software is a nearshore software development company specializing in agile staff augmentation services. They focus on providing custom outsourced software development with a team of highly skilled engineers from Latin America. Truelogic Software partners with both startups and Fortune 500 companies, offering solutions that align with their clients' time zones and ensuring high-quality outcomes through collaboration and responsiveness. With a presence in over 25 countries, Truelogic emphasizes remote work for better quality of life, and their engineers are experienced in various industries, delivering a wide range of successful projects globally.

📋 Description

• Design and build robust, idempotent data pipelines from scratch utilizing a modern data stack. • Design star and snowflake schemas, writing precise, grain-aware SQL to construct scalable data marts. • Write production-grade, unit-tested Python code at the module level, adhering to strong engineering disciplines such as type hinting and testing. • Build and test dbt models across staging, intermediate, and mart layers while managing overall project structure. • Author and deploy jobs using Databricks Asset Bundles (DAB) following documented architectural patterns. • Implement rigorous data quality checks at source, intermediate, and destination layers to prevent silent drops of nulls or duplicates. • Maintain data governance through comprehensive dbt tests and strict documentation-at-merge-time discipline. • Operate securely within a multi-repository architecture, utilizing service principals and ensuring zero personal credentials in production deployments. • Run cross-repository exposure checks prior to merging schema-breaking changes. • Own data pipelines end-to-end, making key technical design decisions and mentoring mid-level engineers through substantive code reviews. • Define overarching technical direction across core data systems, including modeling standards, branching strategies, observability thresholds, and secret management policies. • Act as a technical leader to unblock the team and actively participate in hiring panels to scale the engineering organization.

🎯 Requirements

• Expertise in SQL and dimensional modeling methodologies, including medallion architecture, SCDs, and grain management. • Proven ability to design idempotent pipelines utilizing incremental, checkpoint, and replaceWhere strategies. • Extensive experience with production-grade Python engineering, including type hints, pytest, and ruff. • Strong capability to diagnose and resolve failing Spark / PySpark jobs utilizing tools like Spark UI. • Deep understanding of Delta Lake features such as MERGE, OPTIMIZE, Z-ORDER, and time travel. • Hands-on expertise with dbt, including models, tests, and exposures. • Experience authoring and deploying jobs using Databricks Asset Bundles (DAB) and operating within a Unity Catalog environment. • Commitment to data quality via pre-write asserts, schema checks, and maintaining dbt relationship and uniqueness tests. • Strong adherence to disciplined Git workflows, conventional commits, and strict documentation practices. • Experience provisioning and utilizing Service Principals, GitHub environment secrets, and secret management tools like Azure Key Vault or Databricks secret scopes. • Strong written technical communication skills for PR descriptions and runbooks, with the ability to translate pipeline work into business metrics. • Proven decision-making abilities to navigate ambiguity and balance trade-offs between cost, latency, and reliability. • Experience leading technical initiatives, establishing architectural standards, and contributing to interview rubrics is preferred. • Experience reading or modifying Azure Data Factory (ADF) pipelines and familiarity with Azure Data Lake storage is highly preferred. • Familiarity with dbt observability tools, such as Elementary, is a plus. • Awareness of PII detection and masking best practices is preferred. • Experience with multi-tenant configuration patterns to onboard new tenants with zero code changes is a strong plus. • Proficiency in reading and editing GitHub Actions workflows for Databricks deployment is preferred. • Ability to make cost-aware compute decisions, selecting the appropriate cluster shape per workload, is a plus. • Proficiency in AI-assisted development tools like Claude Code for daily work and code review is preferred. • Experience writing incident post-mortems and coordinating feature handovers with Data Science teams is a plus.

🏖️ Benefits

• 100% Remote Work: Enjoy the freedom to work from the location that helps you thrive. All it takes is a laptop and a reliable internet connection. • Highly Competitive USD Pay: Earn an excellent, market-leading compensation in USD, that goes beyond typical market offerings. • Paid Time Off: We value your well-being. Our paid time off policies ensure you have the chance to unwind and recharge when needed. • Work with Autonomy: Enjoy the freedom to manage your time as long as the work gets done. Focus on results, not the clock. • Work with Top American Companies: Grow your expertise working on innovative, high-impact projects with Industry-Leading U.S. Companies.

Apply Now

Similar Jobs

🔥 7 hours ago

Imagemaker

201 - 500

🏢 Enterprise

☁️ SaaS

Data Engineer supporting LATAM Airlines in integrating and modeling corporate data sources for sustainable decision-making. Collaborating on data architecture and optimization processes.

🇨🇴 Colombia – Remote

⏰ Full Time

🟠 Senior

🚰 Data Engineer

🗣️🇪🇸 Spanish Required

🔥 12 hours ago

Pearster

51 - 200

🎯 Recruiter

🤝 B2B

🏢 Enterprise

Senior Data Engineer overseeing data architecture and pipelines for a LATAM company. Collaborating with cross-functional teams and ensuring optimal data delivery.

🇨🇴 Colombia – Remote

⏰ Full Time

🟠 Senior

🔴 Lead

🚰 Data Engineer

Airflow

Amazon Redshift

AWS

Azure

Cassandra

Cloud

EC2

Google Cloud Platform

Hadoop

Java

Kafka

NoSQL

Postgres

Python

Scala

Spark

SQL

🕒 Yesterday

Blend360

501 - 1000

🤖 Artificial Intelligence

🏢 Enterprise

Senior Data Engineer at Blend, contributing to scalable data solutions on AWS. Designing and implementing high-quality data pipelines while collaborating across cross-functional teams.

🇨🇴 Colombia – Remote

💰 $100M Private Equity Round on 2022-08

⏰ Full Time

🟠 Senior

🚰 Data Engineer

AWS

PySpark

Python

🕒 5 days ago

Apiux Tech

201 - 500

🔌 API

💳 Fintech

🏛️ Government

Data Engineer designing, developing, and maintaining data integration solutions in Azure for APIUX. Ensuring data quality and supporting business analytical needs.

🇨🇴 Colombia – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

🗣️🇪🇸 Spanish Required

Azure

Cloud

ETL

Oracle

Python

SQL

Vault

🕒 6 days ago

Valtech

5001 - 10000

🤝 B2B

☁️ SaaS

Data Engineer at Valtech building reliable and scalable data pipelines leveraging SQL. Collaborating with stakeholders and cross-functional teams to drive data solutions and insights.

🇨🇴 Colombia – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

SQL