Data Scientist II – Big Data R&D, Identity Graph, KYC

🕒 April 24

🏄 California – Remote

info

💵 $140k - $170k / year

⏰ Full Time

🟢 Junior

🟡 Mid-level

📊 Data Scientist

🦅 H1B Visa Sponsor

info
Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Socure

Socure

501 - 1000 employees

Founded 2012

🤖 Artificial Intelligence

🔐 Security

💸 Finance

💰 $450M Series E on 2021-11

Artificial Intelligence • Security • Finance

Socure is a leading platform for digital identity verification and trust. Utilizing advanced predictive analytics, artificial intelligence, and machine learning technologies, Socure leverages vast online and offline data intelligence including email, phone, address, IP, and device information to verify identities in real-time. Their solutions address challenges in onboarding, login authentication, account takeover prevention, and contact center operations. Socure's AI-powered platform excels in combating identity fraud, ensuring compliance, and enhancing user experiences across various industries such as financial services, eCommerce, online gaming, and crypto.

📋 Description

• Contribute to the design and implementation of machine learning, data mining, statistical, and graph-based algorithms to analyze very large datasets for identity verification and anomaly detection. • Analyze large datasets to help develop and refine entity-resolution and identity-matching algorithms that drive Socure’s KYC and compliance solutions. • Build and maintain components of data-processing pipelines (ETL, feature generation, normalization) using tools such as Spark/PySpark and AWS (e.g., EMR, S3). • Support senior data scientists with feature engineering, data exploration, error analysis, and A/B test setup for new models and signals. • Help evaluate new third‑party and internal data sources: profile data quality, design offline experiments, and summarize impact on coverage and model performance. • Implement and maintain SQL and Python/R code for data extraction, transformation, and validation; contribute to code reviews and basic testing. • Provide analytical support to compliance and regulatory product teams, including ad hoc investigations, simple dashboards, and data deep dives. • Communicate findings in a clear, structured way to peers and cross‑functional partners (Product, Engineering, Client Analysis), focusing on key insights and trade‑offs. • Work effectively in a fast‑paced, cross‑functional environment; demonstrate ownership of well-scoped tasks and follow through to completion.

🎯 Requirements

• Master’s degree with 2+ years of experience, or Ph.D. with 1+ years of experience in a data science or analytics role, or equivalent practical experience. • Proficiency in at least one general-purpose programming language used in data science (Python, or Scala). • Solid experience writing and optimizing SQL for large datasets; comfort working in data lake / warehouse environments. • Hands‑on experience with Spark or PySpark and common ML libraries (e.g., scikit‑learn, XGBoost, TensorFlow/PyTorch a plus). • Familiarity with UNIX environments and the AWS ecosystem (e.g., EMR, S3); Databricks experience is a plus. • Working knowledge of supervised/unsupervised ML and basic statistics (similarity measures, clustering, evaluation metrics). • Exposure to graph techniques or graph databases (Neo4j, AWS Neptune, GraphFrames) is a strong plus. • Bonus: experience with Elasticsearch or DynamoDB; workflow tools such as Airflow for automating data pipelines. • Ability to break down loosely defined problems, ask good clarifying questions, and iterate quickly with feedback.

🏖️ Benefits

• Offers Equity • Offers Bonus

Apply Now

Similar Jobs

🕒 April 22

Junction

11 - 50

📚 Education

🤝 Non-profit

Data Scientist at Junction leading innovative modeling in diagnostics and clinical workflows. Building frameworks and models to transform patient data into actionable insights for healthcare.

🕒 April 21

Cushman & Wakefield

10,000+ employees

🏠 Real Estate

🏢 Enterprise

Junior Data Scientist leveraging AI and econometrics at Cushman & Wakefield. Focused on exploratory data analysis and building analytical models for real estate investment.

🕒 April 16

Go Fish

1 - 10

🤝 B2B

🛍️ eCommerce

Media and Marketing Data Scientist at Go Fish Digital bridging complex datasets and actionable insights. Collaborating with cross-functional teams to enhance the company’s data capabilities through analysis and reporting.

🕒 April 15

Teamworks

501 - 1000

⚽ Sports

☁️ SaaS

🤖 Artificial Intelligence

Data Scientist II focused on hockey/basketball analytics leveraging cutting-edge sports tracking data. Building metrics and models for NHL and NBA clients.

🇺🇸 United States – Remote

💵 $145k / year

🔥 Funding within the last year

💰 $235M Series F - Teamworks on 2025-06

⏰ Full Time

🟡 Mid-level

🟠 Senior

📊 Data Scientist

🕒 April 15

Reflow

11 - 50

☁️ SaaS

🏢 Enterprise

🤝 B2B

Data Scientist responsible for designing algorithms powering a workforce intelligence platform. Collaborating across teams to translate complex data into actionable insights and automation.