
10,000+ employees
Founded 1987
🤖 Artificial Intelligence
🔒 Cybersecurity
Artificial Intelligence • Cybersecurity • Cloud
Stefanini Brasil is a leading provider of digital transformation solutions, offering a range of services including artificial intelligence, cybersecurity, cloud enablement, and consulting. With over 35 years of experience, the company focuses on integrating innovative technologies to help organizations enhance their operations and customer experiences across various industries. Their expertise extends to sectors like healthcare, retail, and industrial goods, enabling businesses to optimize processes and drive value through technology.
🕒 May 19
🗣️🇧🇷🇵🇹 Portuguese Required
Improve your chances of getting an interview by checking your resume score before you apply.

10,000+ employees
Founded 1987
🤖 Artificial Intelligence
🔒 Cybersecurity
Artificial Intelligence • Cybersecurity • Cloud
Stefanini Brasil is a leading provider of digital transformation solutions, offering a range of services including artificial intelligence, cybersecurity, cloud enablement, and consulting. With over 35 years of experience, the company focuses on integrating innovative technologies to help organizations enhance their operations and customer experiences across various industries. Their expertise extends to sectors like healthcare, retail, and industrial goods, enabling businesses to optimize processes and drive value through technology.
• Develop ingestion, transformation, and enrichment pipelines for use in AI • Work with structured and unstructured data (text, PDFs, HTML, audio, and others) • Implement chunking, embeddings, and vector indexing processes • Build and maintain datasets for the corporate knowledge base • Develop pipelines using Databricks (Spark / PySpark) • Work with the medallion architecture (bronze, silver, and gold) • Integrate data with vector databases (Azure AI Search, pgvector, and others) • Ensure performance, scalability, and reliability of pipelines • Apply data quality best practices (completeness, consistency, and versioning) • Implement policies for data refresh, retention, and deletion • Ensure traceability and auditability of data used by models • Collaborate with AI/ML teams on data preparation and optimization • Support information retrieval strategies (RAG) • Optimize data to improve the relevance and accuracy of model responses
• Bachelor's degree in Information Technology, Engineering, Information Systems, or a related field • Solid experience in data engineering • Proficiency in Python and/or PySpark • Experience with Databricks and Spark (batch and/or streaming) • Experience building data pipelines (ETL/ELT) • Data modeling for Data Lake / Lakehouse architectures • Experience working with unstructured data (documents, text, etc.) • API integration and consumption • Ability to work autonomously in building pipelines • Knowledge of modern data architectures • Experience processing and preparing data for AI • Experience working in complex environments with multiple integrations
• Meal allowance or meal voucher • Discounts on courses, university programs, and language schools • Stefanini Academy — platform offering free, up-to-date online courses with certificates • Mentoring • Benefits club for consultations and medical exams • Health insurance • Dental insurance • Perks and discount club at leading establishments • Travel club • Pet care benefits
Apply Now