Data Engineer

Job not on LinkedIn

🔥 0 minutes ago

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Greenlight Planet

Greenlight Planet

1001 - 5000 employees

Founded 2009

⚡ Energy

🌍 Social Impact

👥 B2C

Energy • Social Impact • B2C

Greenlight Planet is a company that has transitioned to the brand Sun King. It specializes in designing, distributing, installing, and financing solar energy solutions, specifically for the 1. 8 billion people without reliable electricity access. The company provides a range of solar products, such as lanterns, home systems, and AC electricity systems, to meet diverse power needs across Africa and Asia. Sun King's solutions facilitate off-grid power access, offering reliable and sustainable energy alternatives to traditional power sources. The company operates on a global scale, including in regions such as East Africa, South Asia, and Oceania, and also provides pay-as-you-go financing options to make solar energy more accessible and affordable.

📋 Description

• ETL/ELT Pipeline Development: Build, and maintain scalable data pipelines using AWS. Implement both batch and incremental load patterns for BI reporting and application data needs. • Real-Time Data Streaming: Develop and manage real-time data ingestion pipelines using Kafka. Ensure low-latency, fault-tolerant data flow for critical business workflows. • Workflow Orchestration: Build, schedule, and monitor end-to-end data workflows using Apache Airflow. Manage dependencies, retries, and alerting for production DAGs. • Data Warehouse Management: Administer and optimize Amazon Redshift clusters including schema design, query performance tuning, distribution/sort keys, and vacuuming to ensure high availability and cost efficiency. • Data Quality & Observability: Implement automated data quality checks at ingestion and transformation stages. Define validation rules, build alerting for anomalies and discrepancies, and establish SLAs to ensure stakeholders can trust the data they use. • API Integrations: Integrate third-party and internal REST APIs into data pipelines to pull operational and product data into the warehouse. • Cloud Cost Optimization: Monitor and right-size data processing and storage resources across S3, EMR, Redshift, EC2, and Lambda. Proactively identify inefficiencies and propose cost-saving improvements. • BI & Analytics Collaboration: Partner with the BI team to align data models, preprocessing logic, and Redshift schema design with reporting and dashboard needs.

🎯 Requirements

• Bachelor’s degree in Computer Science or a related quantitative field. • 2+ years of experience working as a Data Engineer • Good proficiency in Python and SQL for data transformation and pipeline development • Hands-on experience with Apache Spark (PySpark) for large-scale data processing • Working knowledge of Kafka for real-time data ingestion and stream processing • Hands-on experience managing and maintaining Airflow DAGs in production environments • Familiarity with Redshift performance tuning, schema design, and query optimization • Experience implementing automated data validation and quality checks within pipelines • Detail-oriented with a keen interest in data transformations and their impact on business outcomes • Problem-solving and time management skills • Prior experience in project or team management is preferred, enthusiasm for mentoring and guiding others is a plus.

🏖️ Benefits

• - Professional growth in a dynamic, rapidly expanding, high-social-impact industry • - An open-minded, collaborative culture made up of enthusiastic colleagues who are driven by the challenge of innovation towards profound impact on people and the planet. • - A truly multicultural experience: you will have the chance to work with and learn from people from different geographies, nationalities, and backgrounds. • - Structured, tailored learning and development programs that help you become a better leader, manager, and professional through the Sun King Center for Leadership.

Apply Now

Similar Jobs

🕒 2 days ago

TLC Worldwide

201 - 500

🤝 B2B

Data Engineer at TLC Worldwide building data pipelines and platforms for insights and AI innovation. Collaborating cross-functionally to deliver scalable data solutions in a fully remote setup.

Amazon Redshift

Azure

BigQuery

Cloud

ETL

Google Cloud Platform

Python

SQL

🕒 June 11

Sikich

1001 - 5000

Data Engineer at Sikich optimizing data solutions using Microsoft platforms. Responsible for building robust data pipelines and delivering insight-driven analytics.

🇮🇳 India – Remote

💰 Private Equity Round on 2024-05

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

Azure

Kafka

Python

Scala

Spark

SQL

🕒 June 11

NPS Prism

201 - 500

🤝 B2B

👥 B2C

☁️ SaaS

Data Engineer II responsible for developing ETL/ELT workflows and managing data lakes for NPS Prism. Collaborating with teams to design data solutions on cloud platforms like Azure and AWS.

AWS

Azure

Cloud

ETL

PySpark

Python

SQL

Tableau

🕒 June 6

Forbes Advisor

201 - 500

📱 Media

💸 Finance

Data Engineer (L3) designing scalable data architecture and robust data pipelines for social marketing. Collaborate in a fintech initiative providing insights on personal finance and growth marketing.

Airflow

BigQuery

Cloud

ETL

Microservices

Python

SQL

🕒 June 6

Forbes Advisor

201 - 500

📱 Media

💸 Finance

Data Engineer building and maintaining data pipelines for marketing analytics and support across business teams. Contributing to data ingestion and modelling from various platforms with a focus on Meta Ads.

Airflow

BigQuery

Cloud

ETL

Microservices

Python

SQL