×
Register Here to Apply for Jobs or Post Jobs. X

Senior Scientific Data Engineer, Data Platform

Job in Stroude, Egham, Surrey County, TW20, England, UK
Listing for: Recursion
Full Time position
Listed on 2025-12-17
Job specializations:
  • IT/Tech
    Data Engineer, Big Data
Job Description & How to Apply Below
Location: Stroude

Senior Scientific Data Engineer, Data Platform

London, England;
Milton Park, England

Your work will change lives. Including your own.

Recursion is a leading, clinical-stage Tech Bio company decoding biology to industrialize drug discovery. Central to its mission is the Recursion Operating System (OS), a platform built across diverse technologies that continuously expands one of the world’s largest proprietary biological, chemical and patient-centric datasets. Recursion leverages sophisticated machine-learning algorithms to distill from its dataset a collection of trillions of searchable relationships across biology and chemistry unconstrained by human bias.

By commanding massive experimental scale—up to millions of wet lab experiments weekly—and massive computational scale—owning and operating one of the most powerful supercomputers in the world—Recursion is uniting technology, biology, chemistry and patient-centric data to advance the future of medicine.

In this role, you will:

  • Build, scale, and operate a data platform
    . You will be a member of the platform team responsible for building, operating, and tuning a data platform that allows users to discover and query across the breadth of our data at Recursion, which includes a chemistry library of billions of compounds, petabytes of cellular microscopy images taken in millions of different experimental contexts, and millions of assay results, all supporting Recursion’s drug discovery.
  • Build relatability into a heterogeneous dataset. At Recursion, we generate datasets based on a wide swath of diverse biological models and treatment approaches. You'll work with biologists, chemists, and data scientists to build relatability and query-ability into these datasets so they can be used in the future to answer the sorts of questions we haven't even thought of asking yet.
  • Act as a mentor, coach, and sponsor. You will share your technical knowledge and experiences, delivering impact, learning, and growth across teams at Recursion.

The Team You’ll Join

  • You will join the Data Platform that builds and maintains our Data Lake/house, the scientific data products we own, and the pipelines that feed them. The team you will join is responsible for all our data products that are composed of public and 3rd party data feeds (e.g. ChEMBL, patents, reactions, chemical vendor catalogues), which includes the infrastructure to build them and the expertise in how to use and integrate them with the rest of the company.

    This team solves the problem of making our diverse data discoverable, queryable, and relatable across datasets, while continuing to add new data sets and data modalities as we grow. This will require collaboration with many different groups in order to share best practices for usage of our data products, and to receive feedback to keep our products fit for purpose.

The Experience You’ll Need

  • A degree in drug-discovery related science (e.g. Chemistry, Biology) - you will need to make informed choices on the scientific data you work with
  • Excited about the possibilities of biological and chemical data transforming drug discovery, and owning the data products that enable them
  • 5+ years of deep experience in modern, cloud-based data engineering tools to build data platforms that enable the discovery, query, and processing of large datasets.
  • Up to date on industry trends and tools. You understand the tradeoffs between different data platform architectures and technologies like a data lake, a data warehouse and can draw on this knowledge to develop data platforms solutions for Recursion.
  • Excitement to learn parts of our tech stack that you might not already know. Our current tech stack includes:
    Python, dbt, Prefect, Big Query, Data stream, Five Tran, Postgre

    SQL, Agentic coding, GCS, Kubernetes, CI/CD, Infrastructure as Code. Our cloud services are provided by Google Cloud Platform.
  • Experience working collaboratively on projects with significant ambiguity and technical complexity, ideally spanning multiple systems and involving diverse technologies.
  • A people-first mindset. Despite the deadlines, we always prioritize supporting our coworkers in their growth and experience.
  • A drive…
Position Requirements
10+ Years work experience
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary