Mid-Level Data Engineer

Job not on LinkedIn

🔥 0 minutes ago

🗣️🇧🇷🇵🇹 Portuguese Required

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Youx Group

Youx Group

201 - 500 employees

🤖 Artificial Intelligence

🛍️ eCommerce

🔬 Science

Artificial Intelligence • eCommerce • Science

Youx Group is a technology and innovation company that creates intelligent solutions to enhance public management and promote sustainable practices. They specialize in the development of intelligent virtual agents and data-driven solutions for environmental regulation and land management across various sectors. With over 30 years of expertise, Youx Group focuses on transforming ideas into effective technological innovations that impact businesses and communities.

📋 Description

• Pipeline Development: Design, build, test and maintain scalable data pipelines (batch and streaming) and ETL/ELT processes. • AI Infrastructure: Develop and maintain data pipelines focused on the Machine Learning lifecycle, integrating structured and unstructured data. • Quality and Governance: Ensure data quality, integrity and security by applying governance and data curation practices for use in predictive models and large language models (LLMs). • Performance Optimization: Monitor data flow performance and optimize complex queries to reduce costs and processing time. • Collaboration with AI Teams: Work closely with Data Scientists and Machine Learning Engineers to understand requirements and enable large-scale data consumption.

🎯 Requirements

• 2–4 years of proven experience working as a Data Engineer. • Strong SQL skills (modeling, optimization and processing) and Python (data manipulation with Pandas, PySpark, etc.). • Hands-on experience with cloud platforms (AWS, GCP or Azure) and Data Warehouse services (BigQuery, Redshift or Snowflake). • Practical experience structuring unstructured data (text, PDFs, images) and integrating with vector databases (such as Pinecone, Milvus, Chroma, pgvector or Weaviate) to support semantic search and RAG (Retrieval-Augmented Generation) systems. • Experience with workflow orchestrators (preferably Apache Airflow). • Familiarity with relational and NoSQL databases. • Experience with APIs and integrating diverse systems. • Familiarity with natural language processing (NLP) concepts and embeddings. • Assertive Communication: Ability to interact with business and technical teams and explain technological limitations and possibilities clearly to non-technical stakeholders. • Critical Thinking and Business Awareness: Focus on addressing root causes of structural problems and prioritizing tasks that deliver the greatest value and cost efficiency to the company. • Proactivity/Autonomy and Ownership: Take ownership of pipelines, anticipate failures, actively propose improvements and document architectural decisions. • Collaborative Spirit: Empathy for data consumers’ needs and a willingness to share knowledge with the team. • Adaptability: Resilience to handle scope changes, new data sources or technology evolution without losing focus on delivery.

🏖️ Benefits

• Care for your health: Medical plan, Dental plan, Telemedicine and Life Insurance. • Customizable multi-benefit program (Flash). • Rest is essential: Paid time off. • Celebrate your day: Day off on your birthday! • We offer Gympass to support a healthy routine. • Autonomy and flexibility. • Workplace exercise and Quality of Life initiatives. • Training and development program, Academia X. • Start your self-awareness journey: Profiler and behavioral mapping.

Apply Now

Similar Jobs

🕒 4 days ago

Domo Inovação

51 - 200

🏦 Banking

🏢 Enterprise

Data Engineer supporting the definition of IT infrastructure and data requirements for a major Brazilian bank. Collaborates with various teams to implement complex data architectures.

🗣️🇧🇷🇵🇹 Portuguese Required

ETL

Kafka

NoSQL

Numpy

Pandas

PySpark

Python

Spark

SQL

🕒 June 18

FCamara Consulting & Training

1001 - 5000

🛍️ eCommerce

🤖 Artificial Intelligence

Engenheiro de Dados Sênior na FCamara atuando em projetos de Data & AI. Desenvolvimento de pipelines de dados em ambiente cloud utilizando tecnologias de ponta.

🗣️🇧🇷🇵🇹 Portuguese Required

Airflow

Apache

BigQuery

Cloud

Docker

Google Cloud Platform

Python

🕒 June 18

Compass

10,000+ employees

🏠 Real Estate

📱 Media

Data Architect focusing on AWS and Databricks for comprehensive cloud migrations and data architecture evolution at Compass UOL.

🗣️🇧🇷🇵🇹 Portuguese Required

Airflow

AWS

Azure

Cloud

ETL

NoSQL

Python

SQL

Unity

🕒 June 8

Extractta

201 - 500

Engenheiro(a) de Dados Pleno na Extractta, desenvolvendo soluções de dados para projetos estratégicos e escaláveis. Atuando em engenharia de dados com foco em pipelines, qualidade e governança.

🗣️🇧🇷🇵🇹 Portuguese Required

Airflow

Apache

AWS

Cloud

Kafka

Kubernetes

PySpark

SQL

🕒 June 5

SysMap Solutions

1001 - 5000

Data Architect developing scalable data models for analytics and business transformation at Triggo.ai. Collaborating with modern data architecture and business requirements.

🗣️🇧🇷🇵🇹 Portuguese Required

BigQuery

Cloud

Google Cloud Platform

SQL

Vault