More jobs:
Data Ingestion and Enrichment team
Job in
Greater London, London, Greater London, W1B, England, UK
Listed on 2026-05-30
Listing for:
Preply
Full Time
position Listed on 2026-05-30
Job specializations:
-
Software Development
Data Engineering
Job Description & How to Apply Below
Location: Greater London
Senior II Data Engineer – Data Ingestion and Enrichment
We build and maintain Preply’s data foundations that support analytics, machine learning, and product features. The role combines hands‑on engineering with technical leadership, driving the delivery and quality of ingestion pipelines and data products across the organization.
Responsibilities- Design, build, and own Preply’s data lake and data‑as‑a‑product, ensuring clear ownership, schemas, and quality expectations for every dataset.
- Develop and operate scalable batch and streaming ingestion pipelines that support both real‑time and analytical use cases.
- Define and enforce data contracts, embed validation, anomaly detection, and quality checks early in the ingestion lifecycle.
- Create enrichment logic that joins, standardises, and contextualises data across domains, supporting historical tracking and dataset versioning.
- Instrument pipelines with observability for freshness, latency, data quality, and cost; contribute to SLOs, alerting, and incident response playbooks.
- Apply consistent access control, classification, and privacy protections at ingestion time, masking or anonymising sensitive data by default.
- Contribute to standardised ingestion templates, libraries, and platform tooling to enable teams to onboard new data sources independently.
- Collaborate closely with Product, Backend, Analytics, and ML partners to align on ingestion requirements, trade‑offs, and priorities.
- Experience driving architectural patterns of a large, high‑scale application (e.g., APIs, data pipelines, efficient algorithms).
- Solid experience in platform or data engineering teams with evidence of leading multi‑stakeholder deliveries.
- Familiarity with cloud platforms (AWS/GCP) and modern Dev Ops practices.
- Hands‑on experience designing real‑time and batch data processing infrastructures using Spark, Flink, Kafka, Debezium, etc.
- Expertise with orchestration tools such as Airflow, dbt, or similar.
- Strong problem‑solving skills and a proactive, innovative mindset focused on continuous improvement.
- Strong communication and cross‑functional collaboration skills (English level B2+).
- Proven track record scaling data infrastructures within fast‑growing startups.
- Experience with Terraform/Kubernetes for data tooling.
- SQL proficiency.
- An open, collaborative, dynamic, and diverse culture.
- A generous monthly allowance for lessons on , a Learning & Development budget, and time off for self‑development.
- A competitive financial package with equity, leave allowance, and health insurance.
- Access to free mental‑health support platforms.
- The opportunity to unlock the potential of learners and tutors through language learning and teaching in 175 countries.
is committed to creating an inclusive environment where people of diverse backgrounds can thrive. We consider all applications for employment without regard to race, color, religion, gender identity or expression, sexual orientation, national origin, disability, age or veteran status.
#J-18808-LjbffrNote that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×