Senior Data Engineer

1001 - 5000 employees

Founded 2002

💼 Consulting

🏥 Healthcare

🚘 Automotive

Consulting • Healthcare • Automotive

Sigma Software Group is a multinational company, established in 2002, that specializes in providing high-quality software development, graphic design, testing, and support services. The company focuses on delivering solutions across various industries such as automotive, telecommunications, aviation, advertising, gaming, banking, real estate, and healthcare. Sigma Software values professional growth, offers remote work opportunities worldwide, and caters to world-renowned clients like AstraZeneca, Scania, and SAS. The company emphasizes a culture of continuous education, mentorship, and flexible work environments, making it a preferred workplace for IT specialists aiming to work on complex solutions utilizing cutting-edge technologies. Sigma Software is committed to innovative solutions and engineering the future while also contributing to social causes such as charitable work in Ukraine.

Senior Data Engineer

🕒 June 18

🇵🇱 Poland – Remote

⏰ Full Time

🟠 Senior

🚰 Data Engineer

Airflow

Apache

AWS

Azure

Cloud

ETL

Google Cloud Platform

Kafka

Microservices

PySpark

Python

Spark

SQL

Terraform

Unity

Vault

Apply Now

Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Sigma Software Group

1001 - 5000 employees

Founded 2002

💼 Consulting

🏥 Healthcare

🚘 Automotive

Consulting • Healthcare • Automotive

📋 Description

• Design and build scalable, cloud-native data platforms from greenfield to production • Implement near-real-time ingestion pipelines using event-driven patterns • Define and enforce platform standards, including Data Lake / Lakehouse principles, medallion architecture, and data contracts • Refactor and optimise existing Spark and PySpark scripts for performance and maintainability • Introduce best practices for code quality, testing, and CI/CD across data pipelines • Drive adoption of AI tooling and agentic workflows within the data engineering team • Ensure data quality, observability, and reliability across all pipelines and platforms • Develop self-service tooling and microservices to simplify platform usage for other teams

🎯 Requirements

• 5+ years of professional experience in Data Engineering • Strong Python and SQL development skills for pipeline development and optimisation • Proficiency in Apache Spark / PySpark, including query optimisation and performance tuning • Hands-on experience with Databricks (preferred) or Snowflake • Experience with at least one major cloud provider: Azure (preferred), AWS, or GCP • Experience with stream processing technologies (Kafka, Spark Structured Streaming) • Solid understanding of ETL/ELT patterns, data modelling (dimensional, Data Vault), and data warehousing • Experience with orchestration tools (Apache Airflow, Azure Data Factory, or equivalent) • Knowledge of Infrastructure as Code (Terraform or equivalent) • Understanding of production-grade system requirements: reliability, scalability, observability, and performance • Upper-Intermediate English level WILL BE A PLUS • Familiarity with RAG pipeline design and LLM integration patterns • Knowledge of data governance frameworks and tools (Unity Catalog, Apache Atlas, or similar) • Experience with dbt for data transformation and modelling • Familiarity with MLflow, Feature Stores, or ML platform integration

🏖️ Benefits

• Employees can work remotely

Apply Now

Similar Jobs

Middle Data Engineer, Azure Databricks

🕒 June 13

Miratech

501 - 1000

🤝 B2B

💼 Consulting

☁️ SaaS

Middle Data Engineer specialized in Azure Databricks for Miratech's data platform team. Designing and developing modern data pipelines integrated with existing data warehouse environments.

🇵🇱 Poland – Remote

💰 Private Equity Round on 2022-04

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

Azure

Cloud

ETL

PySpark

Spark

SQL

SSIS

Middle Data Engineer – Azure Databricks

🕒 June 13

Miratech

501 - 1000

🤝 B2B

💼 Consulting

☁️ SaaS

Middle Data Engineer specialized in Azure Databricks responsible for designing data pipelines at Miratech supporting global clients. Focusing on modern data architectures, analytics, and business intelligence.

🇵🇱 Poland – Remote

💰 Private Equity Round on 2022-04

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

Azure

Cloud

ETL

PySpark

Spark

SQL

SSIS

Middle Data Engineer, Azure Databricks

🕒 June 13

Miratech

501 - 1000

🤝 B2B

💼 Consulting

☁️ SaaS

Middle Data Engineer developing data pipelines and Lakehouse architectures using Azure Databricks. Collaborating to design scalable, cloud-based data solutions for analytics and business intelligence.

🇵🇱 Poland – Remote

💰 Private Equity Round on 2022-04

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

Azure

Cloud

ETL

PySpark

Spark

SQL

SSIS

Middle Data Engineer – Azure Databricks

🕒 June 13

Miratech

501 - 1000

🤝 B2B

💼 Consulting

☁️ SaaS

Middle Data Engineer specializing in Azure Databricks for global IT services company. Responsible for designing and developing data pipelines and Lakehouse architectures, enabling analytics and business intelligence.

🇵🇱 Poland – Remote

💰 Private Equity Round on 2022-04

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

Azure

Cloud

ETL

PySpark

Spark

SQL

SSIS

Middle Data Engineer, Azure Databricks

🕒 June 13

Miratech

501 - 1000

🤝 B2B

💼 Consulting

☁️ SaaS