Data Platform Architect; Databricks
Listed on 2026-06-03
-
IT/Tech
Data Engineering, Cloud Computing, Systems Engineer
Hartford, CT, hybrid (on-site 3 days per week)
Department:
Data & Analytics Platform
Business Unit:
Infrastructure and Cloud Services
Reports To:
Senior Director, Data Platform
We are seeking a Staff Data Platform Architect to serve as the primary technical consultant and strategist for our enterprise Databricks ecosystem. This is a high-impact, senior individual contributor role focused on driving technical excellence, automation, and fiscal efficiency. Unlike a traditional administrator, you will act as an internal consultant to our extensive Databricks team, providing the blueprint for scalable pipelines, advanced automation, and long-term capacity forecasting.
You will bridge the gap between complex infrastructure (Unix/Linux) and modern AI/ML workflows, ensuring our platform is both cutting-edge and cost-effective.
- Strategic Consultation & Architecture
- Act as the Technical Authority for Databricks, advising engineering teams on Unity Catalog governance, workspace topology, and complex migration patterns.
- Consult on the design of high-performance data pipelines, specifically optimizing Delta Live Tables (DLT) and structured streaming for scale.
- Partner with teams using Ab Initio and Fivetran to ensure seamless integration and architectural alignment across the multi-platform ecosystem.
- Platform Optimization & Financial Forecasting
- Capacity Planning:
Own the forecasting of DBU consumption and partner with leadership on multi-year contract utilization and commitment management. - Cost Engineering:
Design and implement sophisticated cost-attribution models (chargeback/showback) and proactively identify “leaks” in compute spend. - Performance Tuning:
Define enterprise standards for Z-ordering, partitioning, and compute strategy to maximize performance-per-dollar.
- Capacity Planning:
- Advanced Automation & AI Operations
- Architect “self-healing” infrastructure through Python and Bash automation, reducing manual toil for the wider engineering team.
- Consult on the operationalization of ML models, leveraging MLflow and Model Serving to move experiments into production.
- Guide the integration of Generative AI and LLM-backed workflows into the standard data engineering lifecycle.
- Infrastructure & Linux Engineering
- Provide deep-tier expertise for the Unix/Linux environments underpinning our compute nodes.
- Develop advanced automation scripts for cluster lifecycle management, monitoring, and security hardening.
- Experience:
7+ years in Data Engineering/Platform roles, with at least 4 years of deep architectural experience in Databricks. - The “Consultant” Mindset:
Proven ability to advise multiple teams, influence technical roadmaps, and communicate complex trade-offs to senior leadership. - Technical Depth:
Mastery of Unity Catalog, Delta Lake, and PySpark. - Systems Expertise:
Strong proficiency in Unix/Linux systems administration and shell scripting (Bash) for infrastructure automation. - Financial Acumen:
Experience managing cloud consumption (DBUs), forecasting usage, and implementing cost-governance tools. - Tooling:
High proficiency with Git-based CI/CD and experience in Oracle environments.
- Hands-on experience with Infrastructure-as-Code (Terraform/Ansible) for Databricks provider.
- Exposure to Ab Initio or Fivetran in a large-scale enterprise environment.
- Background in highly regulated industries (e.g., Finance or Insurance).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).