×
Register Here to Apply for Jobs or Post Jobs. X

Data Engineer; SC Cleared

Job in Newcastle upon Tyne, Newcastle, Tyne and Wear, SY7, England, UK
Listing for: Scrumconnect Consulting
Full Time position
Listed on 2026-05-18
Job specializations:
  • IT/Tech
    Data Engineer, Cloud Computing, Big Data, AWS
Salary/Wage Range or Industry Benchmark: 60000 - 80000 GBP Yearly GBP 60000.00 80000.00 YEAR
Job Description & How to Apply Below
Position: Data Engineer (SC Cleared)
Location: Newcastle upon Tyne

Job Description

A hands‑on data engineering role within a large‑scale cloud data programme, responsible for building, maintaining, and troubleshooting data pipelines using Apache Spark, PySpark, Apache Airflow, and a broad suite of AWS services. You will apply strong analytical and engineering skills to deliver trusted, well‑governed data assets in a modern, cloud‑native environment.

  • Python
  • AWS
  • Cloud Data Pipelines
About Scrumconnect

Scrumconnect is a leading UK technology consultancy delivering digital transformation across public and private sectors, contributing to over 20% of the UK’s major citizen‑facing public services. We specialise in cloud engineering, data platforms, and agile delivery, helping clients build scalable, secure, and user‑centred digital solutions that create real impact.

Security Clearance

Active SC clearance is a mandatory, non‑negotiable requirement. Candidates must hold current, in‑date Security Check (SC) clearance at the time of application. Sponsorship is not available. Applications without active SC clearance will not be considered.

Working Arrangement

This role is hybrid. Candidates must be willing and able to travel to the London office three days per week. Remaining days may be worked remotely from anywhere in the UK.

About

The Role

You will work as a Data Engineer on a complex, cloud‑based data programme — designing, building, and maintaining data pipelines that process large volumes of data across a modern AWS‑native stack. Using Apache Spark and PySpark for distributed data processing, Apache Airflow for orchestration, and a range of AWS services for storage, compute, and analytics, you will help deliver reliable, well‑governed data assets to downstream users.

You will apply strong data analysis skills to identify root causes of data issues, work with dimensional data models and slowly changing dimensions, and implement infrastructure as code using Terraform. Familiarity with engineering best practices and the ability to translate customer expectations into applied technical functionality are key to success in this role.

Key Responsibilities
  • Build and maintain scalable data pipelines using Apache Spark and PySpark, processing and transforming large datasets across distributed cloud infrastructure.
  • Configure and manage Apache Airflow DAGs for task orchestration, ensuring reliable scheduling, monitoring, and execution of data processing workflows.
  • Perform data analysis to identify and resolve root causes of pipeline failures and data quality issues — including reviewing EMR output logs and Cloud Watch metrics.
  • Apply understanding of dimensional data models and slowly changing dimensions (SCD) to design and maintain well‑structured, analytically trusted data assets.
  • Provision and manage cloud infrastructure using Terraform. Containerise solutions using Docker and manage deployments through Git Lab CI/CD pipelines and release tagging.
  • Apply understanding of both server‑side and client‑side encryption patterns within AWS. Work within IAM policies and data governance standards appropriate to a regulated government environment.
Technical Skills Required Languages & Analytics
  • Python — primary language for pipeline development and data processing
  • SQL — used for querying, transformation, and validation across data stores
  • PySpark — for distributed data processing using Apache Spark on AWS EMR
Data Processing & Orchestration
  • Apache Spark — understanding of distributed data processing architecture and execution
  • Apache Airflow — configuring DAGs and managing task orchestration at scale
  • Jupyter Notebooks — for exploratory data analysis and pipeline prototyping
  • Understanding of dimensional data models and slowly changing dimensions (SCD Types 1, 2,
    3)
  • Data analysis skills to identify root cause of issues within pipelines and data assets
AWS Services
  • Amazon EMR — running Spark workloads and reviewing output logs
  • Amazon Athena — ad hoc querying of data in S3
  • Amazon Textract and Comprehend — familiarity with AI/ML document extraction and NLP services
  • AWS S3, IAM, Cloud Watch, EC2, ECR — core platform services used day‑to‑day
  • AWS console proficiency — navigating, configuring, and monitoring services
  • Understanding of server‑side and client‑side encryption within AWS
Infrastructure, Dev Ops & Delivery
  • Terraform — Infrastructure as Code for provisioning and managing AWS environments
  • Docker — containerisation of data engineering solutions
  • Git Lab — source code management, CI/CD pipeline configuration, release tagging, and component versioning
  • Familiarity with engineering best practices
  • Ability to translate customer expectations into applied, functional technical solutions
Technology Stack at a Glance
  • Python
  • Py Spark
  • SQL
  • Apache Spark
  • Apache Airflow
  • Jupyter Notebooks
  • Dimensional Modelling / SCD
  • AWS EMR
  • Amazon Athena
  • AWS S3
  • AWS IAM
  • AWS Cloud Watch
  • AWS EC2 / ECR
  • Amazon Textract
  • Amazon Comprehend
  • Terraform
  • Docker
  • Git Lab CI/CD
#J-18808-Ljbffr
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary