Data Engineer; Databricks), Assistant Vice President
Listed on 2026-05-18
-
IT/Tech
Data Engineer, Data Science Manager
Data Engineer (Databricks), Assistant Vice President
Corporate Functions Technology (Legal & Security)
Who We Are Looking ForWe are seeking a Data Engineer to design, build, and support a modern Legal Data Lakehouse platform on AWS and Databricks. This role will focus on developing scalable, high‑performance data pipelines and enabling trusted, governed data capabilities supporting legal operations, compliance analytics, reporting, and AI/ML use cases.
The ideal candidate brings strong hands‑on experience in Databricks, AWS data platforms, and enterprise data engineering practices, with experience delivering solutions in regulated environments aligned to security, compliance, and audit requirements.
Why This Role Is Important to UsState Street's Legal, Security, and Compliance functions operate across contracts, matters, regulatory obligations, documents, and workflows distributed across multiple systems. Building robust data pipelines ensures data is accessible, reliable, and usable for analytics and decision‑making.
This role is critical to enabling a scalable and governed data foundation supporting legal analytics and operational reporting while strengthening data quality, auditability, and consistency.
What You Will Be Responsible For- Design, build, and maintain scalable data pipelines using PySpark, Python, and Spark SQL
- Develop and optimize ETL/ELT workflows on Databricks using Delta Lake
- Implement Lakehouse architecture (Bronze/Silver/Gold layers) for enterprise data platforms
- Build and manage Databricks Jobs, Workflows, and Notebooks for batch and streaming workloads
- Develop reusable frameworks for data ingestion, processing, and orchestration
- Containerize data workloads using Docker and automate processes via scripting
- Integrate Databricks data pipelines with Power Platform solutions (Power Apps, Power Automate)
- Enable data exposure for business users via APIs, connectors, and curated datasets
- Design and optimize data lakehouse architectures using Databricks
- Integrate data from SQL Server, Oracle, and other enterprise source systems
- Apply advanced data modeling techniques (dimensional modeling, partitioning, optimization)
- Work with structured and semi‑structured data (e.g., JSON, Parquet)
- Tune performance using caching, indexing, and Spark optimization techniques
- Publish curated datasets for consumption in Power BI dashboards, Power Apps, Power Automate workflows
- Ensure data quality using unit testing, validation frameworks, and automated checks
- Monitor and troubleshoot distributed Spark workloads and pipelines
- Analyze logs and resolve production issues across Databricks and cloud environments
- Maintain data lineage, consistency, and audit readiness
- Collaborate with Legal, Security, Compliance, and Enterprise Data teams to deliver scalable solutions
- Translate business requirements into robust data engineering designs
- Act as a Subject Matter Expert (SME) in Databricks and Lakehouse architecture
- Lead initiatives with minimal supervision and take full ownership of deliverables
- Support implementation of data governance frameworks using Databricks Unity Catalog and AWS controls (IAM, KMS)
- Ensure adherence to:
- Data privacy and regulatory requirements (e.g., GDPR)
- Internal security and audit standards
- Implement and maintain:
- Data access controls
- Data classification and handling standards
- Collaborate with IAM and security teams to ensure secure data access
- Design and maintain CI/CD pipelines using Harness, Azure Dev Ops, or Git Hub
- Automate deployment of Databricks assets using Databricks Repos and CLI
- Monitor, schedule, and optimize workflows using Databricks orchestration tools
- Maintain clear documentation including architecture, data flows, and runbooks
- Continuously improve performance, scalability, and cost efficiency
- Strong analytical thinking and problem‑solving skills
- Hands‑on expertise in data engineering and distributed data processing
- Ability to work in a fast‑paced, enterprise environment
- Effective communication and collaboration with cross‑functional teams
- Ownership…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).