Databricks Architect Los Angeles CA; Hybrid
Job in
Los Angeles, Los Angeles County, California, 90079, USA
Listed on 2026-06-17
Listing for:
Tech Mirrors
Full Time, Contract
position Listed on 2026-06-17
Job specializations:
-
IT/Tech
Data Engineering, Cloud Computing: Infrastructure & Operations
Job Description & How to Apply Below
Databricks Architect
Location:
Los Angeles CA (Hybrid)
Contract
Role OverviewThe Databricks Architect is responsible for designing, implementing, and optimizing scalable data engineering and analytics solutions on the Databricks Lakehouse Platform on AWS. This role requires deep expertise in distributed data processing, Delta Lake–based architectures, and modern data engineering best practices. The architect will partner with cross‑functional teams to define data strategies, ensure platform reliability, and enable advanced analytics, ML, and BI workloads across the enterprise.
MustDemonstrate (Critical Architectural Capabilities)
- Designing Databricks-based Lakehouse architectures on AWS
- Clear separation of compute layer vs. serving layer
- Low‑latency API/data delivery strategy (cannot rely solely on Spark)
- Caching strategies for performance acceleration and cost efficiency
- Data partitioning and optimization strategy, including file‑size tuning
- Ability to handle multi‑terabyte structured time‑series datasets
- Skill in distilling architectural significance from complex requirements
- Strong curiosity and requirement‑probing mindset
- Player‑coach leadership style (hands‑on engineering + design guidance)
- Architect end‑to‑end Databricks Lakehouse solutions on AWS for ingestion, processing, storage, and consumption.
- Define and implement Delta Lake patterns including Medallion Architecture (Bronze/Silver/Gold).
- Lead design of scalable data pipelines using PySpark, Spark SQL, Workflows, and Delta Live Pipelines.
- Architect solutions for structured, semi‑structured, and time‑series workloads.
- Ensure architectures support low‑latency delivery, serving‑layer separation, and high performance.
- Build robust ETL/ELT pipelines using Databricks Notebooks, Jobs, and Workflows.
- Implement streaming and incremental data processing as needed using Structured Streaming.
- Optimize Spark jobs with partitioning, caching, ZORDER, file compaction, and shuffle reduction.
- Implement CI/CD automation using Databricks Repos, Git Lab/Git Hub, and Infrastructure‑as‑Code.
- Architect Databricks solutions using AWS‑native services including: S3 (data storage), Glue Catalog (metadata governance), IAM (identity & access control), Lambda / API Gateway (low‑latency serving mechanisms), Kinesis (streaming ingestion).
- Ensure security, governance, and compliance via Unity Catalog, RBAC, and encryption standards.
- Monitor workloads and optimize cluster sizing, autoscaling, and cost controls.
- Partner with data engineers, ML engineers, BI teams, and business stakeholders.
- Serve as a Databricks SME, defining best practices, standards, and architectural patterns.
- Conduct architectural reviews and guide teams on solution choices.
- Lead PoCs, evaluate new Databricks features, and drive platform adoption across teams.
- Define standards for data quality, lineage, observability, and operational governance.
- Implement automated testing frameworks for pipelines and notebooks.
- Establish monitoring dashboards, performance baselines, and reliability KPIs.
- 7+ years in data engineering or data architecture.
- 3+ years hands‑on with Databricks.
- Strong expertise in Spark, PySpark, SQL, and distributed data systems.
- Deep understanding of Delta Lake features (ACID, OPTIMIZE, ZORDER, Auto Loader).
- Experience with workflows/orchestration and Databricks REST APIs.
- Hands‑on expertise with AWS, specifically: S3, Glue / Glue Catalog, IAM, Lambda, Kinesis.
- Familiarity with CI/CD, Git, Dev Ops, and IaC (Terraform preferred).
- Strong analytical and problem‑solving abilities.
- Excellent communication and stakeholder management.
- Ability to lead design discussions and guide engineering teams.
- Strong documentation and architectural blueprinting abilities.
- Databricks Certifications such as:
Databricks Certified Data Engineer Professional, Databricks Certified Machine Learning Professional, Databricks Lakehouse Fundamentals. - Experience with MLflow, Feature Store, or MLOps pipelines.
- Experience in regulated industries (BFSI, healthcare, etc.).
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×