×
Register Here to Apply for Jobs or Post Jobs. X

Senior Data Engineer - Pathogen

Job in Oxford, Oxfordshire, OX1, England, UK
Listing for: Ellison Institute of Technology Oxford
Full Time position
Listed on 2025-11-15
Job specializations:
  • IT/Tech
    Data Engineer, Data Analyst
Job Description & How to Apply Below

Senior Data Engineer, EIT Pathogen Program

The Ellison Institute of Technology (EIT) tackles humanity's greatest challenges by turning science and technology into impactful global solutions. Focused on areas such as health, food security, sustainable agriculture, climate change, clean energy, and robotics in an era of artificial intelligence, EIT blends groundbreaking research with practical applications to deliver lasting results.

A cornerstone of EIT's mission is its upcoming 300,000-square‑foot research facility at the Oxford Science Park, set to open in 2027. This campus will feature advanced labs, an oncology and preventative care clinic, and collaborative spaces to strengthen its partnership with the University of Oxford. It will also host the Ellison Scholars, driving innovation for societal benefit.

The Pathogen Mission highlights EIT's transformative approach, using Whole Genome Sequencing (WGS) and Oracle's cloud technology to create a global pathogen metagenomics system. This initiative aims to improve diagnostics, provide early epidemic warnings, and guide treatments by profiling antimicrobial resistance. The goal is to deliver certified diagnostic tools for widespread use in labs, hospitals, and public health.

EIT fosters a culture of collaboration, innovation, and resilience, valuing diverse expertise to drive sustainable solutions to humanity's enduring challenges.

Key Responsibilities
  • Ensure data in the platform is acquired, processed, curated, and made accessible to scientists, digital analytics products, bioinformatics, and AI at a high standard of quality and availability.
  • Ensure data access adheres to FAIR principles (Findable, Accessible, Interoperable, and Re‑usable).
  • Ensure data is secured and compliant with regulatory, legal, and data sharing requirements.
  • Ensure efficient, performant, and high‑quality pipelines for data ingestion into the platform.
  • Contribute to building data management components, including reference data management, de‑identification, data curation, pathogen and technical metadata catalogues, and data access controls.
  • Ensure efficient, secure, scalable, available, and performant data storage components, including genomic variant storage, clinical data stores, and clinical imaging.
  • Ensure robust ingest services capable of seamlessly integrating data from distributed sequencing devices, including real‑time telemetry streams.
  • Ensure data is processed to enable optimal access and consumption by digital analysis products, bioinformatics pipelines, and researchers/scientists.
Requirements Essential Knowledge, Skills and Experience
  • Deep experience in building modern data platforms using cloud‑based architectures and tools.
  • Experience delivering data engineering solutions on cloud platforms, preferably Oracle OCI, AWS, or Azure.
  • Proficient in Python and workflow orchestration tools such as Airflow or Prefect.
  • Expert in data modeling, ETL, and SQL.
  • Experience with real‑time analytics from telemetry and event‑based streaming (e.g., Kafka).
  • Experience managing operational data stores with high availability, performance, and scalability.
  • Expertise in data lakes, lake houses, Apache Iceberg, and data mesh architectures.
  • Proven ability to build, deliver, and support modern data platforms at scale.
  • Strong knowledge of data governance, data quality, and data cataloguing.
  • Experience with modern database technologies, including Iceberg, No

    SQL, and vector databases.
  • Embraces innovation and works closely with scientists and partners to explore cutting‑edge technology.
  • Knowledge of master data, metadata, and reference data management.
  • Understanding of Agile practices and sprint‑based methodologies.
  • Active contributor to knowledge sharing and collaboration.
Desirable Knowledge, Skills and Experience
  • Familiarity with genomics and associated data standards.
  • Experience with healthcare clinical data and standards such as OMOP and SNOMED.
  • Familiarity with containerization tools such as Docker and Kubernetes.
  • Familiarity with Git and CI/CD workflows.
Key Attributes
  • Strong collaborator with excellent communication skills.
  • Comfortable working in a fast‑paced, dynamic environment.
  • Eagerness to learn and…
Position Requirements
10+ Years work experience
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary