Senior Data Scientist - International eKYC, Identity Graph
Listed on 2026-05-31
-
IT/Tech
Data Scientist, Cybersecurity
Why Socure?
Socure is building the identity trust infrastructure for the digital economy — verifying 100% of good identities in real time and stopping fraud before it starts. The mission is big, the problems are complex, and the impact is felt by businesses, governments, and millions of people every day.
We hire people who want that level of responsibility. People who move fast, think critically, act like owners, and care deeply about solving customer problems with precision. If you want predictability or narrow scope, this won’t be your place. If you want to help build the future of identity with a team that holds a high bar for itself — keep reading.
About the RoleThe Big Data R&D team builds the core entity‑resolution and graph‑based intelligence that underpins Socure’s Verify and KYC products. As a Senior Data Scientist focused on international eKYC
, you will be a technical leader driving the next generation of global identity verification solutions. You will design and deploy ML and graph-based systems tailored to diverse international markets, regulations, and data ecosystems—covering government IDs, telco and credit bureaus, mobile‑first data, and non‑traditional signals.
You will own complex, cross‑product initiatives such as international identity graph evolution, probabilistic matching for non‑US identities, and scalable evaluation frameworks that account for regional regulatory and fairness constraints. You will closely partner with Product, Engineering, Compliance, and GTM teams to launch and scale eKYC solutions across multiple countries and regions.
What You'll DoInternational eKYC Modeling & Entity Resolution
Lead the design, development, and deployment of ML and graph‑based algorithms for international entity resolution, identity trust scoring, and anomaly detection across heterogeneous, country‑specific datasets.
Architect reusable matching and linking frameworks that work across multiple (e.g., national , passports, voter IDs, mobile accounts, bank accounts) and local name/address conventions.
Develop probabilistic and rule‑augmented models that handle noisy, sparse, or partially labeled international data while maintaining explainability and regulatory defensibility.
Global Identity Graph & Data Quality
Define and evolve the international extension of Socure’s identity graph: schema design, linkage strategies, quality tiers, and confidence scoring that can be leveraged by multiple products (Verify, KYC, watchlists, fraud).
Design and implement robust data quality and monitoring frameworks for international identity data (coverage, stability, drift, regional bias, label quality) and integrate them into modeling and production monitoring workflows.
Build scalable approaches for handling linguistic and cultural variation (e.g., transliteration, multi‑script names, address normalization, local naming patterns) in the identity graph and matching pipelines.
Evaluation, Experimentation, and Model Governance
Own experimentation strategy for major international eKYC initiatives:
Design offline evaluations and online A/B tests that reflect local ground truth constraints and data sparsity.
Define success metrics that balance approval rates, fraud capture, and regulatory/operational constraints per market.
Analyze lift, stability, and fairness trade‑offs and drive go/no‑go decisions with Product and Engineering.
Define and maintain evaluation frameworks specific to international eKYC (e.g., regional coverage maps, cross‑border identity leakage, local demographic impact, regulatory thresholds).
Contribute to model governance documentation and support responses to regulators and large enterprise customers regarding model logic, data provenance, fairness, and monitoring for international markets.
Data Source Strategy & Vendor Evaluation (International)
Lead the evaluation and integration of international data vendors (e.g., bureaus, telcos, public records, alternative data):
Design benchmarking methodologies for signal quality, incremental value, stability, and fairness by country/segment.
Quantify ROI and trade‑offs across multiple vendors and data types; provide clear recommendations that influence…
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: