phData

Website LinkedIn All Job Openings

phData is the perfect mix of services and automation to create solid data platforms, outstanding data products, and value-generating machine learning systems in the cloud. phData guides the world’s largest brands in cloud data platforms, data engineering, data science, and machine learning.

Spark • Impala • Kafka • Kudu • Scala

201 - 500

💰 $2.5M Seed Round on 2018-03

Senior Data Scientist

May 7

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

📊 Data Scientist

🗽 H1B Visa Sponsor

Apply Now

phData

Website LinkedIn All Job Openings

Spark • Impala • Kafka • Kudu • Scala

201 - 500

💰 $2.5M Seed Round on 2018-03

Description

• Collaborate closely with diverse stakeholders to gather business requirements and translate them into technical specifications to inform model design and implementation • Conduct data manipulation required for model pre- and post-processing (preparing the data) • Author articulate reports, visualizations, and presentations to communicate your process and results to stakeholders with varying levels of technical understanding • Engage with interdisciplinary development teams, such as data engineers and machine learning engineers, to leverage MLOPs frameworks to support the longevity of models (work across functional teams) • Proactively and independently engage with customers and take ownership for implementing data science best practices unique to each project • Drive thought leadership in the data science community by actively contributing to open source projects, sharing insights through engaging talks, and advancing global data science practices

Requirements

• Advanced degree or evidence of exceptional ability in engineering, computer science, mathematics, physics, chemistry, or operations research • 3-5+ years hands-on experience in building models and developing algorithms for real-world applications and data-driven optimizations • Depth of knowledge in advanced mathematics, machine learning, and statistics • Strong computer science fundamentals: data structures, algorithms, distributed systems • Mastery of one or more data analysis languages such as R, Python, PySpark, and SQL • Experience with data science tools including Python scripting, NumPy, SciPy, matplotlib, scikit-learn, Jupyter notebooks, bash scripting • Experience with XGBoost, Facebook Prophet, Random Forests, and other major data science algorithms and packages • Preferred - familiarity developing on cloud platforms such as AWS, Azure, and Databricks • Exemplary verbal and written communication skills, ability to relate technical results and business outcomes to technical & non-technical audiences • Able to work under pressure while managing competing demands and tight deadlines • Well organized with meticulous attention to detail

Benefits

• Remote-First Work Environment • Casual, award-winning small-business work environment • Collaborative culture that prizes autonomy, creativity, and transparency • Competitive comp, excellent benefits, 4 weeks PTO plus 10 Holidays (and other cool perks) • Accelerated learning and professional development through advanced training and certifications

Apply Now