Senior Data Engineer

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Sigma Software Group

Sigma Software Group

1001 - 5000 employees

Founded 2002

🎮 Gaming

📡 Telecommunications

Software Development • Gaming • Telecommunications

Sigma Software Group is a multinational company, established in 2002, that specializes in providing high-quality software development, graphic design, testing, and support services. The company focuses on delivering solutions across various industries such as automotive, telecommunications, aviation, advertising, gaming, banking, real estate, and healthcare. Sigma Software values professional growth, offers remote work opportunities worldwide, and caters to world-renowned clients like AstraZeneca, Scania, and SAS. The company emphasizes a culture of continuous education, mentorship, and flexible work environments, making it a preferred workplace for IT specialists aiming to work on complex solutions utilizing cutting-edge technologies. Sigma Software is committed to innovative solutions and engineering the future while also contributing to social causes such as charitable work in Ukraine.

📋 Description

• Design and build scalable, cloud-native data platforms from greenfield to production • Implement near-real-time ingestion pipelines using event-driven patterns • Define and enforce platform standards, including Data Lake / Lakehouse principles, medallion architecture, and data contracts • Refactor and optimise existing Spark and PySpark scripts for performance and maintainability • Introduce best practices for code quality, testing, and CI/CD across data pipelines • Drive adoption of AI tooling and agentic workflows within the data engineering team • Ensure data quality, observability, and reliability across all pipelines and platforms • Develop self-service tooling and microservices to simplify platform usage for other teams

🎯 Requirements

• 5+ years of professional experience in Data Engineering • Strong Python and SQL development skills for pipeline development and optimisation • Proficiency in Apache Spark / PySpark, including query optimisation and performance tuning • Hands-on experience with Databricks (preferred) or Snowflake • Experience with at least one major cloud provider: Azure (preferred), AWS, or GCP • Experience with stream processing technologies (Kafka, Spark Structured Streaming) • Solid understanding of ETL/ELT patterns, data modelling (dimensional, Data Vault), and data warehousing • Experience with orchestration tools (Apache Airflow, Azure Data Factory, or equivalent) • Knowledge of Infrastructure as Code (Terraform or equivalent) • Understanding of production-grade system requirements: reliability, scalability, observability, and performance • Upper-Intermediate English level WILL BE A PLUS • Familiarity with RAG pipeline design and LLM integration patterns • Knowledge of data governance frameworks and tools (Unity Catalog, Apache Atlas, or similar) • Experience with dbt for data transformation and modelling • Familiarity with MLflow, Feature Stores, or ML platform integration

🏖️ Benefits

• Employees can work remotely

Apply Now

Similar Jobs

🕒 Yesterday

SOFTETA

11 - 50

☁️ SaaS

🏢 Enterprise

🤝 B2B

Senior Data Engineer designing, implementing, and improving data architectures for clients in banking. Collaborating with teams to optimize data processes and maintain high standards.

Airflow

Amazon Redshift

AWS

Azure

BigQuery

Cloud

Docker

ETL

Google Cloud Platform

Kafka

Kubernetes

NoSQL

Python

Spark

SQL

🕒 Yesterday

InPost Group

10,000+ employees

🛍️ eCommerce

🚗 Transport

Data Engineer responsible for designing data pipelines and streaming systems at InPost. Working with cross-functional teams to create data products that power ML models and analytics.

🗣️🇵🇱 Polish Required

Apache

AWS

Azure

BigQuery

Cassandra

Cloud

Docker

ETL

Google Cloud Platform

Java

Jenkins

Kafka

MongoDB

NoSQL

Postgres

PySpark

Python

Scala

SOAP

Spark

SQL

🕒 4 days ago

Miratech

501 - 1000

Middle Data Engineer developing data pipelines and Lakehouse architectures using Azure Databricks. Collaborating to design scalable, cloud-based data solutions for analytics and business intelligence.

🇵🇱 Poland – Remote

💰 Private Equity Round on 2022-04

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

Azure

Cloud

ETL

PySpark

Spark

SQL

SSIS

🕒 4 days ago

Miratech

501 - 1000

Middle Data Engineer specialized in Azure Databricks at Miratech, enhancing scalable data architectures and analytics on data platforms. Focused on developing modern data pipelines and Lakehouse architectures.

🇵🇱 Poland – Remote

💰 Private Equity Round on 2022-04

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

Azure

Cloud

ETL

PySpark

Spark

SQL

SSIS

🕒 4 days ago

Miratech

501 - 1000

Middle Data Engineer specializing in Azure Databricks to design and develop modern data pipelines for Miratech. Collaborating on data architectures, enabling advanced analytics and business intelligence.

🇵🇱 Poland – Remote

💰 Private Equity Round on 2022-04

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

Azure

ETL

PySpark

Spark

SQL

SSIS