phData is the perfect mix of services and automation to create solid data platforms, outstanding data products, and value-generating machine learning systems in the cloud. phData guides the world’s largest brands in cloud data platforms, data engineering, data science, and machine learning.
Spark • Impala • Kafka • Kudu • Scala
201 - 500
💰 $2.5M Seed Round on 2018-03
May 7
phData is the perfect mix of services and automation to create solid data platforms, outstanding data products, and value-generating machine learning systems in the cloud. phData guides the world’s largest brands in cloud data platforms, data engineering, data science, and machine learning.
Spark • Impala • Kafka • Kudu • Scala
201 - 500
💰 $2.5M Seed Round on 2018-03
• Collaborate closely with diverse stakeholders to gather business requirements and translate them into technical specifications to inform model design and implementation • Conduct data manipulation required for model pre- and post-processing (preparing the data) • Author articulate reports, visualizations, and presentations to communicate your process and results to stakeholders with varying levels of technical understanding • Engage with interdisciplinary development teams, such as data engineers and machine learning engineers, to leverage MLOPs frameworks to support the longevity of models (work across functional teams) • Proactively and independently engage with customers and take ownership for implementing data science best practices unique to each project • Drive thought leadership in the data science community by actively contributing to open source projects, sharing insights through engaging talks, and advancing global data science practices
• Advanced degree or evidence of exceptional ability in engineering, computer science, mathematics, physics, chemistry, or operations research • 3-5+ years hands-on experience in building models and developing algorithms for real-world applications and data-driven optimizations • Depth of knowledge in advanced mathematics, machine learning, and statistics • Strong computer science fundamentals: data structures, algorithms, distributed systems • Mastery of one or more data analysis languages such as R, Python, PySpark, and SQL • Experience with data science tools including Python scripting, NumPy, SciPy, matplotlib, scikit-learn, Jupyter notebooks, bash scripting • Experience with XGBoost, Facebook Prophet, Random Forests, and other major data science algorithms and packages • Preferred - familiarity developing on cloud platforms such as AWS, Azure, and Databricks • Exemplary verbal and written communication skills, ability to relate technical results and business outcomes to technical & non-technical audiences • Able to work under pressure while managing competing demands and tight deadlines • Well organized with meticulous attention to detail
• Remote-First Work Environment • Casual, award-winning small-business work environment • Collaborative culture that prizes autonomy, creativity, and transparency • Competitive comp, excellent benefits, 4 weeks PTO plus 10 Holidays (and other cool perks) • Accelerated learning and professional development through advanced training and certifications
Apply NowMay 7
1001 - 5000
May 7
11 - 50
May 7
501 - 1000
🇺🇸 United States – Remote
💵 $177.8k - $209.3k / year
💰 Funding Round on 2017-01
⏰ Full Time
🟠 Senior
📊 Data Scientist
🗽 H1B Visa Sponsor
May 7
1001 - 5000
🇺🇸 United States – Remote
💵 $80k - $120k / year
💰 $35.2M Venture Round on 2021-03
⏰ Full Time
🟠 Senior
📊 Data Scientist
🗽 H1B Visa Sponsor
May 4
201 - 500
🇺🇸 United States – Remote
💵 $140k - $180k / year
💰 Post-IPO Equity on 2022-11
⏰ Full Time
🟠 Senior
📊 Data Scientist
🗽 H1B Visa Sponsor