×
Register Here to Apply for Jobs or Post Jobs. X

Senior Data Lakehouse Architect; Databricks), Vice President

Job in Quincy, Norfolk County, Massachusetts, 02171, USA
Listing for: State Street
Full Time position
Listed on 2026-05-19
Job specializations:
  • IT/Tech
    Data Engineering
Salary/Wage Range or Industry Benchmark: 150000 - 200000 USD Yearly USD 150000.00 200000.00 YEAR
Job Description & How to Apply Below
Position: Senior Data Lakehouse Architect (Databricks), Vice President

About the Role

We are seeking a Senior Data Lakehouse Architect to design and lead the build-out of a Legal Data Lakehouse platform on AWS and Databricks
. This role will drive the architecture, engineering, and governance of scalable, secure, and compliant data capabilities supporting legal operations, contract intelligence, eDiscovery, and AI/ML use cases.

Why This Role Is Important

State Street’s Legal function operates across a wide range of contracts, matters, regulatory obligations, documents, and workflows. Building a modern Legal Data Lakehouse is critical to creating a trusted, governed foundation that brings these data sources together—making legal information easier to access, analyze, and use at scale.

Responsibilities
  • Architecture & Platform Design:
    Define and implement the end‑to‑end Legal Data Lakehouse architecture using Databricks (Delta Lake, Unity Catalog, Workflows) on AWS.
  • Design multi‑layered data architecture (Bronze, Silver, Gold) to support contract metadata and document ingestion, legal matter management data, eDiscovery datasets, and external regulatory and compliance feeds.
  • Establish scalable ingestion frameworks (batch and streaming) for structured and unstructured legal data (PDFs, contracts, emails).
  • Data Engineering & Integration:
    Lead development of ETL/ELT pipelines using Databricks, Spark, and Python/SQL; integrate with enterprise platforms including contract lifecycle management systems, AI platforms, LLM pipelines, and document repositories.
  • Design patterns for extracting structured data from unstructured legal documents and persisting into Delta Lake; enable downstream integration with enterprise data platforms, analytics tools, and AI/ML pipelines.
  • Governance, Security & Compliance:
    Implement data governance frameworks using Databricks Unity Catalog and AWS‑native controls (IAM, KMS); establish fine‑grained access controls, data lineage and auditability; ensure compliance with data privacy regulations (e.g., GDPR) and internal security and audit requirements; partner with IAM teams to integrate with enterprise identity providers (Entra  / Azure AD).
  • AI/ML & Advanced Analytics Enablement:
    Architect data models supporting contract analytics, clause extraction, and obligation tracking; legal AI use cases (contract review, litigation insights, compliance monitoring, legal spend analytics); design search and retrieval architectures (RAG) for enterprise legal knowledge bases; enable entity extraction and knowledge graph frameworks; integrate with LLM/GenAI platforms for document summarization, Q&A, and workflow automation.
  • Dev Ops & Platform Operations:
    Establish CI/CD pipelines and infrastructure‑as‑code (Terraform, Git‑based workflows); define standards for code quality and versioning, environment promotion (Dev / QA / Prod); implement observability and alerting for platform health and reliability.
  • Leadership & Stakeholder Engagement:
    Partner with Legal and Technology leadership to define platform roadmap and priorities; provide architectural governance and design oversight; mentor data engineers and platform teams; translate business and legal requirements into scalable, enterprise‑grade solutions; operate within a federated data and platform model, collaborating across engineering, security, and domain teams.
Qualifications
  • 10+ years of experience in data architecture, engineering, or analytics platforms.
  • 5+ years of hands‑on experience with Databricks and Apache Spark.
  • Strong experience with AWS‑based data platforms.
  • Expertise in data governance, security, and compliance in regulated environments.
  • Experience working with unstructured data and NLP/document processing pipelines.
Education & Certifications
  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, Information Systems, or a related discipline.
  • Relevant certifications strongly preferred:
    Databricks Certified Data Engineer / Architect; AWS Certified Solutions Architect (Associate or Professional).
Preferred Qualifications (Core)
  • Strong hands‑on experience with the Databricks Lakehouse platform, including Delta Lake, Unity Catalog, Workflows, and MLflow.
  • Deep expertise in AWS data platform services (S3,…
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary