The single source of truth for B2B data
people data • data science • data • DaaS • B2B Data
51 - 200
March 20
The single source of truth for B2B data
people data • data science • data • DaaS • B2B Data
51 - 200
• Build infrastructure for ingestion, transformation, and loading an exponentially increasing volume of data from a variety of sources using Spark, SQL, AWS, and Databricks • Building an organic entity resolution framework capable of correctly merging hundreds of billions of individual entities into a number of clean, consumable datasets • Developing CI/CD pipelines and anomaly detection systems capable of continuously improving the quality of data we're pushing into production • Devising solutions to largely-undefined data engineering and data science problems • Work with stakeholders in Engineering and Product to assist with data-related technical issues and support their infrastructure needs
• 5-7+ years industry experience with clear examples of strategic technical problem solving and implementation • Strong software development fundamentals • Experience with Python Expertise with Apache Spark (Java, Scala, and/or Python-based) • Experience with SQL • Experience building scalable data processing systems (e.g., cleaning, transformation) from the ground up • Experience using developer-oriented data pipeline and workflow orchestration (e.g., Airflow (preferred), dbt, dagster or similar) • Knowledge of modern data design and storage patterns (e.g., incremental updating, partitioning and segmentation, rebuilds and backfills) • Experience working in Databricks (including delta live tables, data lakehouse patterns, etc.) • Experience with cloud computing services (AWS (preferred), GCP, Azure or similar) • Experience with data warehousing (e.g., Databricks, Snowflake, Redshift, BigQuery, or similar) • Understanding of modern data storage formats and tools (e.g., parquet, ORC, Avro, Delta Lake)
• Stock • Competitive Salaries • Unlimited paid time off • Medical, dental, & vision insurance • Health, fitness, and office stipends • The permanent ability to work wherever and however you want
Apply NowMarch 20
11 - 50
March 20
51 - 200
March 20
51 - 200
March 20
1001 - 5000
🇺🇸 United States – Remote
💵 $77.9k - $137.9k / year
💰 $1.6G Series H on 2021-08
⏰ Full Time
🟠 Senior
🚰 Data Engineer
🗽 H1B Visa Sponsor