×
Register Here to Apply for Jobs or Post Jobs. X

Data Quality Engineer; Databricks

Job in Abu Dhabi, UAE/Dubai
Listing for: Datamatics Technologies LLC
Full Time position
Listed on 2026-06-13
Job specializations:
  • Software Development
    Data Engineering
Salary/Wage Range or Industry Benchmark: 120000 - 200000 AED Yearly AED 120000.00 200000.00 YEAR
Job Description & How to Apply Below
Position: Data Quality Engineer (Databricks)

Job Description – Data Quality Engineer (Databricks) – x 4 Positions

Location: Abu Dhabi, UAE – Onsite (Open to Relocate)

Duration: 6 months (Extendable to One Year)

Experience: 5 to 7 Years

Project start date: 1st July – Immediate joiners will be preferred

Role Overview

The Data Quality Engineer will be responsible for designing, implementing, and operating ADC's enterprise data quality framework within the Databricks platform. The role will deliver automated profiling, quality rule execution, cleansing, monitoring, remediation support, and quality reporting capabilities across 170 datasets and 1,346 prioritised Critical Data Elements (CDEs). Working closely with Data Modellers, Data Catalogue Specialists, business data owners, and platform engineers, the Data Quality Engineer will establish scalable and reusable quality controls that improve trust, accuracy, completeness, consistency, timeliness, validity, and uniqueness across ADC's data estate.

Key Responsibilities Databricks Platform Configuration and Administration
  • Configure and manage the Databricks environment supporting enterprise data quality operations.
  • Establish and maintain compute clusters, PySpark notebook frameworks, Delta Lake storage structures, and Unity Catalog integration.
  • Optimise platform performance for large‑scale profiling and rule execution across all in-scope datasets and CDEs.
  • Implement development, testing, and production deployment standards for data quality assets.
Data Profiling and Quality Assessment
  • Design and develop AI‑assisted profiling notebooks using PySpark.
  • Perform baseline data quality assessments across the six quality dimensions: completeness, accuracy, consistency, validity, timeliness, uniqueness.
  • Capture and analyse null value rates, duplicate records, invalid values, format violations, outliers, and schema drift.
  • Produce quality profiling outputs for all prioritised CDEs and datasets.
Data Quality Rule Factory Development
  • Design and implement a reusable Data Quality Rule Factory.
  • Build parameterised PySpark‑based rule templates capable of supporting large‑scale rule deployment.
  • Enable automated generation and management of approximately 6,730 data quality rules without manual rule‑by‑rule development.
  • Ensure rules are reusable, configurable, and maintainable across multiple datasets and domains.
Data Quality Controls and Pipeline Integration
  • Deploy quality rules as reusable Databricks Jobs integrated into Delta Lake processing pipelines.
  • Embed quality controls within Bronze, Silver, and Gold processing stages.
  • Implement automated quality gates preventing data progression where defined thresholds are not met.
  • Maintain rule traceability and execution history for audit and governance purposes.
Data Cleansing and Quality Improvement
  • Develop automated remediation and cleansing pipelines using PySpark.
  • Implement standardisation routines, data enrichment processes, deduplication logic, and schema harmonisation controls.
  • Deploy machine learning models managed through MLflow for anomaly detection, exact duplicate detection, and fuzzy matching/duplicate identification.
  • Ensure all AI and ML recommendations are explainable, auditable, and routed through human‑in‑the‑loop validation processes where required.
Exception Management and Reprocessing
  • Design and manage exception handling processes for failed quality records.
  • Implement quarantine Delta Lake tables serving as the Failed Record Register.
  • Capture and maintain failure reason, associated CDE, rule reference, processing timestamp, and resolution status.
  • Develop reprocessing workflows to support correction and controlled re‑ingestion of remediated records.
Data Quality Metrics and Reporting
  • Develop Delta Lake metric aggregation structures supporting enterprise quality reporting.
  • Calculate and publish Data Quality Index (DQI) scores, dimension‑level quality metrics, rule pass/fail rates, dataset compliance scores, and SLA adherence indicators.
  • Provide curated outputs to support Power BI quality dashboards and executive reporting.
Monitoring, Alerting and Predictive Quality Management
  • Configure automated quality monitoring and alerting mechanisms.
  • Implement threshold‑based notifications using…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary