Data Engineer Expert-level SQL
Listed on 2026-06-07
-
IT/Tech
Data Engineer, Data Scientist
Overview
Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues around the world, and where you’ll be able to reimagine what’s possible. Join us and help the world’s leading organizations unlock the value of technology and build a more sustainable, more inclusive world.
Onsite :
New Jersey
- Design and maintain high-throughput ingestion pipelines for transaction signals, behavioral events, and third-party identity graphs - including Live Ramp RampID, UID2, GCLID chains, and household device graphs
- Implement identity resolution logic at scale: deterministic matching, probabilistic graph construction, and household + device-level cluster assembly across 1B+ data points
- Build and maintain data clean room connectors and privacy-preserving data exchange pipelines (AWS Clean Rooms, Live Ramp DCR, Google ADH, or equivalent)
- Develop integrations between activation platforms (email, CDP, DSP) and the identity graph layer - supporting real-time audience push and match rate monitoring
- Design medallion-architecture or equivalent data models optimized for cohort-level LTV/CAC attribution and multi-touch attribution across owned, paid, and clean room channels
- Build automated QC and reconciliation frameworks - deduplication, compliance validation, and data lineage tracking - capable of reducing manual reconciliation cycles from weeks to hours
- Implement PII governance controls at the pipeline layer: redacted s, consent signal propagation, and guardrail validation aligned to GLBA, Fair Lending, UDAAP, and TCPA/CAN-SPAM
- Integrate LLM-based APIs (e.g., Anthropic Claude, OpenAI, Vertex AI) for AI-powered signal enrichment, audience brief generation, and compliance pre-screening within pipeline workflows
- Build serverless microservices and API bridge layers connecting clean room outputs to activation destinations - using any major serverless or edge compute platform
- Maintain and evolve authentication, email notification, and managed database services supporting platform-facing APIs and client-facing tooling
- 5+ years of data engineering experience
- Expert-level SQL across at least one major cloud data warehouse:
Snowflake, Google Big Query, Amazon Redshift, or Azure Synapse - Proficiency in Python for pipeline development, transformation logic, and data quality automation
- Hands-on experience with at least one clean room technology: AWS Clean Rooms, Live Ramp DCR, Google ADH, Info Sum, or equivalent privacy-preserving data collaboration platform
- Deep understanding of identity resolution concepts: deterministic matching, probabilistic graph construction, household-level aggregation, and device graph assembly
- Strong PII governance knowledge: data residency, consent frameworks, and financial services regulatory requirements (GLBA, Fair Lending, UDAAP)
- Experience integrating with DSPs, CDPs, or marketing activation platforms at the data layer
- Ability to operate in client-facing consulting delivery contexts - translating business requirements into technical specifications
- Experience with graph database technologies - Neo4j, Amazon Neptune, or Tiger Graph - for identity graph storage and traversal
- Familiarity with Live Ramp Embedded Identity, UID2 token handling, or walled garden attribution integrations (Google ADH, Meta CAPI, Amazon Attribution)
- Working knowledge of LLM APIs for structured data enrichment and AI-assisted pipeline workflows
The base compensation range for this role in the posted location is: $100000 to $130000
Capgemini provides compensation range information in accordance with applicable national, state, provincial, and local pay transparency laws. The base compensation range listed for this position reflects the minimum and maximum target compensation Capgemini, in good faith, believes it may pay for the role at the time of this posting. This range may be subject to change as permitted by law.
The actual compensation offered to any candidate may fall outside of the posted range…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).