×
Register Here to Apply for Jobs or Post Jobs. X

Data Engineer; Databricks), Assistant Vice President

Job in Quincy, Norfolk County, Massachusetts, 02171, USA
Listing for: STATE STREET CORPORATION
Full Time position
Listed on 2026-05-18
Job specializations:
  • IT/Tech
    Data Engineer, Data Science Manager
Salary/Wage Range or Industry Benchmark: 110000 - 177500 USD Yearly USD 110000.00 177500.00 YEAR
Job Description & How to Apply Below
Position: Data Engineer (Databricks), Assistant Vice President

Data Engineer (Databricks), Assistant Vice President

Corporate Functions Technology (Legal & Security)

Who We Are Looking For

We are seeking a Data Engineer to design, build, and support a modern Legal Data Lakehouse platform on AWS and Databricks. This role will focus on developing scalable, high‑performance data pipelines and enabling trusted, governed data capabilities supporting legal operations, compliance analytics, reporting, and AI/ML use cases.

The ideal candidate brings strong hands‑on experience in Databricks, AWS data platforms, and enterprise data engineering practices, with experience delivering solutions in regulated environments aligned to security, compliance, and audit requirements.

Why This Role Is Important to Us

State Street's Legal, Security, and Compliance functions operate across contracts, matters, regulatory obligations, documents, and workflows distributed across multiple systems. Building robust data pipelines ensures data is accessible, reliable, and usable for analytics and decision‑making.

This role is critical to enabling a scalable and governed data foundation supporting legal analytics and operational reporting while strengthening data quality, auditability, and consistency.

What You Will Be Responsible For
  • Design, build, and maintain scalable data pipelines using PySpark, Python, and Spark SQL
  • Develop and optimize ETL/ELT workflows on Databricks using Delta Lake
  • Implement Lakehouse architecture (Bronze/Silver/Gold layers) for enterprise data platforms
  • Build and manage Databricks Jobs, Workflows, and Notebooks for batch and streaming workloads
  • Develop reusable frameworks for data ingestion, processing, and orchestration
  • Containerize data workloads using Docker and automate processes via scripting
  • Integrate Databricks data pipelines with Power Platform solutions (Power Apps, Power Automate)
  • Enable data exposure for business users via APIs, connectors, and curated datasets
Data Platform & Database Management
  • Design and optimize data lakehouse architectures using Databricks
  • Integrate data from SQL Server, Oracle, and other enterprise source systems
  • Apply advanced data modeling techniques (dimensional modeling, partitioning, optimization)
  • Work with structured and semi‑structured data (e.g., JSON, Parquet)
  • Tune performance using caching, indexing, and Spark optimization techniques
  • Publish curated datasets for consumption in Power BI dashboards, Power Apps, Power Automate workflows
Testing & Troubleshooting
  • Ensure data quality using unit testing, validation frameworks, and automated checks
  • Monitor and troubleshoot distributed Spark workloads and pipelines
  • Analyze logs and resolve production issues across Databricks and cloud environments
  • Maintain data lineage, consistency, and audit readiness
Ownership & Collaboration
  • Collaborate with Legal, Security, Compliance, and Enterprise Data teams to deliver scalable solutions
  • Translate business requirements into robust data engineering designs
  • Act as a Subject Matter Expert (SME) in Databricks and Lakehouse architecture
  • Lead initiatives with minimal supervision and take full ownership of deliverables
Governance, Security & Compliance
  • Support implementation of data governance frameworks using Databricks Unity Catalog and AWS controls (IAM, KMS)
  • Ensure adherence to:
    • Data privacy and regulatory requirements (e.g., GDPR)
    • Internal security and audit standards
  • Implement and maintain:
    • Data access controls
    • Data classification and handling standards
  • Collaborate with IAM and security teams to ensure secure data access
Delivery & Documentation
  • Design and maintain CI/CD pipelines using Harness, Azure Dev Ops, or Git Hub
  • Automate deployment of Databricks assets using Databricks Repos and CLI
  • Monitor, schedule, and optimize workflows using Databricks orchestration tools
  • Maintain clear documentation including architecture, data flows, and runbooks
  • Continuously improve performance, scalability, and cost efficiency
What We Value
  • Strong analytical thinking and problem‑solving skills
  • Hands‑on expertise in data engineering and distributed data processing
  • Ability to work in a fast‑paced, enterprise environment
  • Effective communication and collaboration with cross‑functional teams
  • Ownership…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary