Data Ingestion and Enrichment team Job London area,Greater London England UK,Software Development

Position: Staff Data Ingestion and Enrichment team
Location: Greater London

Senior II Data Engineer – Data Ingestion and Enrichment

We build and maintain Preply’s data foundations that support analytics, machine learning, and product features. The role combines hands‑on engineering with technical leadership, driving the delivery and quality of ingestion pipelines and data products across the organization.

Responsibilities

Design, build, and own Preply’s data lake and data‑as‑a‑product, ensuring clear ownership, schemas, and quality expectations for every dataset.
Develop and operate scalable batch and streaming ingestion pipelines that support both real‑time and analytical use cases.
Define and enforce data contracts, embed validation, anomaly detection, and quality checks early in the ingestion lifecycle.
Create enrichment logic that joins, standardises, and contextualises data across domains, supporting historical tracking and dataset versioning.
Instrument pipelines with observability for freshness, latency, data quality, and cost; contribute to SLOs, alerting, and incident response playbooks.
Apply consistent access control, classification, and privacy protections at ingestion time, masking or anonymising sensitive data by default.
Contribute to standardised ingestion templates, libraries, and platform tooling to enable teams to onboard new data sources independently.
Collaborate closely with Product, Backend, Analytics, and ML partners to align on ingestion requirements, trade‑offs, and priorities.

Qualifications

Experience driving architectural patterns of a large, high‑scale application (e.g., APIs, data pipelines, efficient algorithms).
Solid experience in platform or data engineering teams with evidence of leading multi‑stakeholder deliveries.
Familiarity with cloud platforms (AWS/GCP) and modern Dev Ops practices.
Hands‑on experience designing real‑time and batch data processing infrastructures using Spark, Flink, Kafka, Debezium, etc.
Expertise with orchestration tools such as Airflow, dbt, or similar.
Strong problem‑solving skills and a proactive, innovative mindset focused on continuous improvement.
Strong communication and cross‑functional collaboration skills (English level B2+).

Nice to Have

Proven track record scaling data infrastructures within fast‑growing startups.
Experience with Terraform/Kubernetes for data tooling.
SQL proficiency.

Benefits

An open, collaborative, dynamic, and diverse culture.
A generous monthly allowance for lessons on , a Learning & Development budget, and time off for self‑development.
A competitive financial package with equity, leave allowance, and health insurance.
Access to free mental‑health support platforms.
The opportunity to unlock the potential of learners and tutors through language learning and teaching in 175 countries.

Diversity, Equity, and Inclusion

is committed to creating an inclusive environment where people of diverse backgrounds can thrive. We consider all applications for employment without regard to race, color, religion, gender identity or expression, sexual orientation, national origin, disability, age or veteran status.

#J-18808-Ljbffr