Senior Data Scientist – International eKYC, Identity Graph

🕒 April 24

🇺🇸 United States – Remote

💵 $140k - $170k / year

⏰ Full Time

🟠 Senior

📊 Data Scientist

🦅 H1B Visa Sponsor

info
Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Socure

Socure

501 - 1000 employees

Founded 2012

🤖 Artificial Intelligence

🔐 Security

💸 Finance

💰 $450M Series E on 2021-11

Artificial Intelligence • Security • Finance

Socure is a leading platform for digital identity verification and trust. Utilizing advanced predictive analytics, artificial intelligence, and machine learning technologies, Socure leverages vast online and offline data intelligence including email, phone, address, IP, and device information to verify identities in real-time. Their solutions address challenges in onboarding, login authentication, account takeover prevention, and contact center operations. Socure's AI-powered platform excels in combating identity fraud, ensuring compliance, and enhancing user experiences across various industries such as financial services, eCommerce, online gaming, and crypto.

📋 Description

• Lead the design, development, and deployment of ML and graph-based algorithms for international entity resolution, identity trust scoring, and anomaly detection across heterogeneous, country‑specific datasets. • Architect reusable matching and linking frameworks that work across multiple ID schemes (e.g., national ID numbers, passports, voter IDs, mobile accounts, bank accounts) and local name/address conventions. • Develop probabilistic and rule‑augmented models that handle noisy, sparse, or partially labeled international data while maintaining explainability and regulatory defensibility. • Define and evolve the international extension of Socure’s identity graph: schema design, linkage strategies, quality tiers, and confidence scoring that can be leveraged by multiple products (Verify, KYC, watchlists, fraud). • Design and implement robust data quality and monitoring frameworks for international identity data (coverage, stability, drift, regional bias, label quality) and integrate them into modeling and production monitoring workflows. • Own experimentation strategy for major international eKYC initiatives: Design offline evaluations and online A/B tests that reflect local ground truth constraints and data sparsity. • Define success metrics that balance approval rates, fraud capture, and regulatory/operational constraints per market. • Analyze lift, stability, and fairness trade‑offs and drive go/no‑go decisions with Product and Engineering. • Contribute to model governance documentation and support responses to regulators and large enterprise customers regarding model logic, data provenance, fairness, and monitoring for international markets.

🎯 Requirements

• Master’s or Ph.D. in Computer Science, Data Science, Machine Learning, Statistics, Mathematics, or a related field, or equivalent practical experience. • 6+ years of hands-on applied ML / data science experience (4+ with Ph.D.), including owning production models and pipelines in high‑stakes domains (fraud, risk, identity, payments, credit, or similar). • Significant prior work on international or multi‑region products is strongly preferred (e.g., cross‑country KYC, credit risk, payments, or compliance systems). • Expert‑level proficiency in Python and SQL, with extensive experience in distributed data processing (Spark/PySpark, Databricks or similar) on very large datasets. • Deep experience designing, training, and deploying models for classification, ranking, anomaly detection, and/or graph learning, including: • Feature engineering for noisy/heterogeneous identity data. • Robust evaluation under label sparsity and feedback delays. • Calibration and thresholding tailored to regional risk and regulatory constraints. • Proven expertise with graph technologies (e.g., Neo4j, AWS Neptune, GraphFrames, DGL, PyTorch Geometric) and graph algorithms (entity resolution, link prediction, community detection, label propagation) at scale.

🏖️ Benefits

• Offers Equity • Offers Bonus

Apply Now

Similar Jobs

🕒 April 24

Foundation

11 - 50

₿ Crypto

🏪 Marketplace

🛍️ eCommerce

Senior Data Scientist at City of Hope analyzing large healthcare datasets to improve cancer care delivery. Collaborating with administrative and clinical teams, applying machine learning techniques.

🕒 April 24

OneStudyTeam

201 - 500

⚕️ Healthcare Insurance

🧬 Biotechnology

💊 Pharmaceuticals

Senior Data Scientist advancing data-driven solutions for clinical trials at OneStudyTeam. Collaborating with cross-functional teams to improve patient enrollment and trial management through statistical models and machine learning algorithms.

🕒 April 23

Cushman & Wakefield

10,000+ employees

🏠 Real Estate

🏢 Enterprise

Senior Director leading execution of AI strategy and managing a multidisciplinary team at Cushman & Wakefield. Overseeing AI innovation, governance, and operational rigor.

🕒 April 23

Advarra

501 - 1000

☁️ SaaS

💊 Pharmaceuticals

AI Data Scientist focusing on optimizing and operationalizing machine learning models for Advarra’s Braid platform. Collaborating with teams to enhance clinical and operational data leveraging advanced AI techniques.

🕒 April 23

Paramount

10,000+ employees

📱 Media

👥 B2C

Senior Data Scientist leading evaluation strategies for dynamic personalization surfaces at Paramount. Collaborating to ensure visual optimizations are statistically sound and causally effective.