Principal Data Engineer
Job in
Cape Town, 7100, South Africa
Listing for:
Vivo Energy
Full Time
position
Listed on 2026-07-02
Job specializations:
-
Software Development
Data Engineering
Job Description & How to Apply Below
Role Overview
Reporting to the Data & Analytics Lead, the Principal Data Engineer is a hands-on role, combining the building and optimisation of production data pipelines with the technical leadership needed to raise engineering standards and develop the people around them. The role functions as a key interface between a) business stakeholders and department heads, b) the wider data and analytics team, and c) owners of source systems and data in Finance and IT.
Key Responsibilities
Designing and building robust, production-grade data pipelines on Databricks, making appropriate use of current platform capabilities such as Spark Declarative Pipelines (SDP)Working “under the hood” to optimise the platform for performance and cost — tuning compute, jobs and queries, managing storage and table layout, and keeping platform spend under controlDefining and maintaining consistent business metrics and a semantic layer (for example, using Metric Views) so reporting is built on trusted, reusable definitionsIntegrating data from core business systems, including SAP S/4
HANA, and other sources, using tools such as Fivetran (both SaaS connectors and HVR)Setting and upholding engineering standards across the team — code quality, testing, documentation, CI/CD and data governanceCoaching and mentoring junior and mid-level engineers, reviewing their work and helping them developPartnering with business stakeholders to understand their needs, shape practical solutions and ensure the platform delivers genuine valueEngaging data owners in Finance and IT to ensure data is well understood, valid and fit for purpose, and initiating data quality improvements where requiredRequirements
Skills & Experience
Experience
At least 3 years of hands-on, daily Databricks experience, covering both pipeline development and under-the-hood performance and cost optimisationUp to date with recent platform developments, such as Spark Declarative Pipelines (SDP) and Metric ViewsStrong proficiency in programming languages commonly used in data engineering, such as Python, SQL and SparkAdvanced experience with data manipulation, data modelling, database design and query optimizationExperience managing or coaching junior developersA track record of managing and influencing business stakeholdersExperience working with SAP S/4
HANA data sets, Fivetran (including both SaaS connectors and HVR), Power BI semantic modelling would be beneficialKey Competencies
Combining deep, hands-on engineering skill with sound judgement about cost, performance and long-term maintainabilityCoaching, mentoring and raising the capability of less experienced engineersCollaborating, communicating confidently and influencing business stakeholdersBreaking down complex technical concepts and explaining them simply to non-technical audiencesStaying current with a fast-moving platform and bringing new capabilities into everyday practiceTaking ownership and driving work independently, from concept through to productionImportant to note:
This role requires full-time office-based attendance, five days per week.
To apply
Qualified candidates to apply by uploading a cover letter and a recent resume by close on business 31 July 2026
.
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here: