Data Engineer

Job not on LinkedIn

August 30

Apply Now
Logo of Sumble

Sumble

Technology • SaaS • eCommerce

Sumble is a technology company that enhances user experiences by utilizing cookies to streamline browsing, personalize content, and analyze website traffic. It ensures secure logins and compliance with user consent preferences, while also leveraging analytics to improve site performance and offer tailored advertisements. Their services aim to facilitate efficient navigation and feature utilization for visitors, making it a key player in the realm of user experience and web performance optimization.

📋 Description

• Build Sumble's knowledge graph from web data for go-to-market teams, using job posts and resume data to identify org structure, tech stack, and key projects (e.g., GenAI initiatives, cloud migrations) • Join a 15-person team including 10 engineers with experience at Google, Meta, Stack Overflow, and Kaggle • Build mission-critical, flexible, and scalable data pipelines focusing on reliability, data consistency, and data recovery • Explore and define standards for data access and analytics patterns; develop evolving data warehouse and data lake solutions • Work with SQL, orchestrators, and data modeling; tech stack includes Python, FastAPI, TypeScript, GCP, PostgreSQL/AlloyDB, PyTorch/Huggingface/vLLM, Prefect, and Cloud Run • Tackle noisy datasets, expensive analytics computations, growing data/model complexity, and create UX supporting both aggregated and granular source data

🎯 Requirements

• Located within Americas timezones

🏖️ Benefits

• Medical, dental, and vision (US) • 401k (US) • Target 4 weeks PTO

Apply Now

Similar Jobs

August 29

Experienced Database & Cloud Advisory Specialist providing guidance on SQL Server, Oracle, and GCP cloud services. Key role in database management and cloud migration strategies.

Cloud

Google Cloud Platform

MongoDB

Oracle

SQL

August 29

Senior Data Engineer builds data pipelines and data infrastructure at Hershey. Ensures data quality and scalability across teams.

ETL

Java

JavaScript

MongoDB

MySQL

NoSQL

Postgres

Python

SQL

August 27

Senior Data Engineer building 8am analytics infrastructure and ETL pipelines. Mentor engineers, integrate data across products for business insights.

Airflow

Amazon Redshift

AWS

Cloud

ETL

Kafka

NoSQL

PySpark

Python

Spark

SQL

Terraform

August 26

Senior Hadoop Big Data engineer building ETL and data pipelines using Hadoop, Spark, PySpark and AWS at CubeTechUS

AWS

Hadoop

HDFS

Kafka

PySpark

Scala

Spark

SQL

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com