phData is the perfect mix of services and automation to create solid data platforms, outstanding data products, and value-generating machine learning systems in the cloud. phData guides the world’s largest brands in cloud data platforms, data engineering, data science, and machine learning.
Spark • Impala • Kafka • Kudu • Scala
201 - 500
💰 $2.5M Seed Round on 2018-03
March 20
Airflow
AWS
Azure
Cassandra
Cloud
ElasticSearch
GCP
Hadoop
HDFS
Informatica
Java
Kafka
Matillion
NoSQL
Python
Scala
Spark
SQL
phData is the perfect mix of services and automation to create solid data platforms, outstanding data products, and value-generating machine learning systems in the cloud. phData guides the world’s largest brands in cloud data platforms, data engineering, data science, and machine learning.
Spark • Impala • Kafka • Kudu • Scala
201 - 500
💰 $2.5M Seed Round on 2018-03
• At least 4+ years experience as a Software Engineer, Data Engineer or Data Analyst • Ability to develop end-to-end technical solutions into production — and to help ensure performance, security, scalability, and robust data integration. • Programming expertise in Java, Python and/or Scala • Core cloud data platforms including Snowflake, AWS, Azure, Databricks and GCP • SQL and the ability to write, debug, and optimize SQL queries • Client-facing written and verbal communication skills and experience • Create and deliver detailed presentations • Detailed solution documentation (e.g. including POCS and roadmaps, sequence diagrams, class hierarchies, logical system views, etc.) • 4-year Bachelor's degree in Computer Science or a related field
• Production experience in core data platforms: Snowflake, AWS, Azure, GCP, Hadoop, Databricks • Cloud and Distributed Data Storage: S3, ADLS, HDFS, GCS, Kudu, ElasticSearch/Solr, Cassandra or other NoSQL storage systems • Data integration technologies: Spark, Kafka, event/streaming, Streamsets, Matillion, Fivetran, NiFi, AWS Data Migration Services, Azure DataFactory, Informatica Intelligent Cloud Services (IICS), Google DataProc or other data integration technologies • Multiple data sources (e.g. queues, relational databases, files, search, API) • Complete software development lifecycle experience including design, documentation, implementation, testing, and deployment • Automated data transformation and data curation: dbt, Spark, Spark streaming, automated pipelines • Workflow Management and Orchestration: Airflow, AWS Managed Airflow, Luigi, NiFi
• Remote-First Work Environment • Casual, award-winning small-business work environment • Collaborative culture that prizes autonomy, creativity, and transparency • Competitive comp, excellent benefits, generous PTO plan plus 10 Holidays (and other cool perks) • Accelerated learning and professional development through advanced training and certifications
Apply NowJune 1, 2023
51 - 200