Data and Machine Learning Engineer

Job not on LinkedIn

🔥 0 minutes ago

🇨🇴 Colombia – Remote

⏰ Full Time

🟠 Senior

🔴 Lead

🚰 Data Engineer

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of IDT BY INDET GROUP

IDT BY INDET GROUP

201 - 500 employees

📡 Telecommunications

💸 Finance

🤝 B2B

Telecommunications • Finance • B2B

IDT BY INDET GROUP is a global company that develops communication and payment services, aiming to unite families and grow businesses worldwide. Founded in 1990, IDT has grown to become the third-largest voice carrier with operations in over 20 countries and an annual revenue of $1. 5 billion. The company is known for its variety of communication tools and financial services, including its flagship brand BOSS Revolution, which offers international calling, mobile top-up, and money transfer services. IDT also provides cloud-based VoIP solutions through Net2Phone, as well as sales management systems and POS equipment for small and medium-sized businesses via National Retail Solutions. Additionally, IDT Express offers voice termination, DIDs, and SMS services globally, while Awards2Go provides custom prepaid Visa cards. Based in Newark, New Jersey, IDT is committed to innovation and quality, serving a diverse, multicultural community.

📋 Description

• Design, develop, and maintain scalable data pipelines to support ingestion, transformation, and delivery into centralized feature stores, model-training workflows, and real-time inference services. • Build and optimize workflows for extracting, storing, and retrieving semantic representations of unstructured data to enable advanced search and retrieval patterns. • Architect and implement lightweight analytics and dashboarding solutions that deliver natural language query experience and AI-backed insights. • Define and execute processes for managing prompt engineering techniques, orchestration flows, and model fine-tuning routines to power conversational interfaces. • Oversee vector data stores and develop efficient indexing methodologies to support retrieval-augmented generation (RAG) workflows. • Partner with data stakeholders to gather requirements for language-model initiatives and translate into scalable solutions. • Create and maintain comprehensive documentation for all data processes, workflows and model deployment routines. • Should be willing to stay informed and learn emerging methodologies in data engineering, MLOps and LLM operations.

🎯 Requirements

• 8+ years of experience as a Data Engineer with 2+ years focused on MLOps. • Excellent English communication skills. • Effective oral and written communication skills with BI team and user community. • Demonstrated experience in utilizing python for data engineering tasks, including transformation, advanced data manipulation, and large-scale data processing. • Deep understanding of vector databases and RAG architectures, and how they drive semantic retrieval workflows. • Skilled at integrating open-source LLM frameworks into data engineering workflows for end-to-end model training, customization, and scalable inference. • Experience with cloud platforms like AWS or Azure Machine Learning for managed LLM deployments. • Hands-on experience with big data technologies including Apache Spark, Hadoop, and Kafka for distributed processing and real-time data ingestion. • Experience designing complex data pipelines extracting data from RDBMS, JSON, API and Flat file sources. • Demonstrated skills in SQL and PLSQL programming, with advanced mastery in Business Intelligence and data warehouse methodologies, along with hands-on experience in one or more relational database systems and cloud-based database services such as Snowflake/Redshift. • Understanding of software engineering principles and skills working on Unix/Linux/Windows Operating systems, and experience with Agile methodologies. • Proficiency in version control systems, with experience in managing code repositories, branching, merging, and collaborating within a distributed development environment. • Interest in business operations and comprehensive understanding of how robust BI systems drive corporate profitability by enabling data-driven decision-making and strategic insights.

Apply Now

Similar Jobs

🕒 3 days ago

Goods & Services

201 - 500

🤖 Artificial Intelligence

🏢 Enterprise

🤝 B2B

Senior Data Engineer at Goods & Services, transforming raw data into governed data marts. Leading the development of scalable ETL/ELT pipelines and ensuring data consistency.

🇨🇴 Colombia – Remote

⏰ Full Time

🟠 Senior

🚰 Data Engineer

AWS

Cloud

ETL

PySpark

Python

Spark

SQL

🕒 June 19

Sofka Technologies

1001 - 5000

🤝 B2B

🏢 Enterprise

🤖 Artificial Intelligence

AWS Databricks Platform Administrator managing and optimizing data solutions in a fully remote LATAM environment. Collaborating with teams to ensure operational efficiency and data governance.

🇨🇴 Colombia – Remote

⏰ Full Time

🟠 Senior

🔴 Lead

🚰 Data Engineer

🗣️🇪🇸 Spanish Required

AWS

EC2

Python

Spark

SQL

Terraform

🕒 June 17

BlueCloud

501 - 1000

🤖 Artificial Intelligence

Senior Data Engineer designing and implementing Snowflake data solutions for enterprise organizations. Leading the architecture of scalable data pipelines and enforcing data governance standards.

🇨🇴 Colombia – Remote

⏰ Full Time

🟠 Senior

🚰 Data Engineer

Airflow

AWS

Azure

Cloud

ETL

Google Cloud Platform

Python

SQL

Vault

🕒 June 12

Blend360

501 - 1000

🤖 Artificial Intelligence

🏢 Enterprise

Senior Data Engineer at Blend responsible for designing and optimizing data pipelines. Collaborating with teams to ensure data accuracy and quality for enterprise initiatives.

🇨🇴 Colombia – Remote

💰 $100M Private Equity Round on 2022-08

⏰ Full Time

🟠 Senior

🚰 Data Engineer

ETL

Python

SQL

🕒 June 12

Jalasoft

1001 - 5000

☁️ SaaS

📚 Education

Senior Data Engineer designing and operating cloud data infrastructures for AI initiatives. Building data lakes on AWS and real-time pipelines for RAG systems.

🇨🇴 Colombia – Remote

⏰ Full Time

🟠 Senior

🚰 Data Engineer

Amazon Redshift

AWS

Cloud

Distributed Systems

ElasticSearch

ETL

Java

JavaScript

Node.js

Postgres

Python

.NET