Research Engineer
Listed on 2026-05-31
-
IT/Tech
AI Engineer (Applied/Software), Machine Learning/ ML Engineer, Data Scientist, Data Engineering
The Singapore-ETH Centre is a research centre focused on sustainable solutions and innovative technologies.
Project backgroundThe healthcare systems of the future must harness data effectively to support clinicians, allowing them to focus on patient care while leveraging AI to detect patterns beyond human perception, enhance diagnostic accuracy, optimise workflows, and improve risk assessment and communication. Developing AI models that address these needs is particularly urgent in ageing societies, where rising patient numbers coincide with increasing workforce constraints.
We are developing the AI for Science Instrumentation Gym, which is designed to bridge this gap by placing data-driven hypothesis generation at the centre of its mission. It introduces a critical intermediate step: the tokenization and cartography of scientific data. Through tokenization, complex data is transformed into coarse-grained, interpretable units. Through cartography, these units are organised into latent spaces that can be explored as structured landscapes.
In this way, machine learning becomes a tool for mapping high-dimensional data into forms that scientists can navigate, interpret, and use to generate new hypotheses.
As a Research Engineer you will be one of the Gym Leads for the ML platform for the Scientific Instrumentation, and you will be working on:
- Data pipelines – ingestion, metadata schemas, and provenance tracking for electron microscopy, X‑ray, and light microscopy datasets
- Model interfaces – modular Python APIs that let representation models, tokenizers, and downstream models be swapped and composed
- Baseline tokenization and cartography pipelines, built collaboratively with ML researchers and domain scientists
- LLM‑ and RAG‑assisted interfaces for dataset navigation, workflow discovery, and user interaction
- Forward‑compatibility to HPC – ensuring S‑Gym and M‑Gym components can be lifted into the L‑Gym tier without fundamental redesign
- Master’s degree in a relevant subject (e.g., Artificial Intelligence, Data Science)
- Strong Python skills and experience with a modern ML framework (PyTorch preferred)
- Experience building data pipelines and reproducible ML workflows
- Comfort with GPU‑based compute and basic scientific data formats
- Track record of building modular, maintainable software
- Interest in working at the intersection of science, ML, and systems engineering
- Experience with scientific imaging data
- Familiarity with LLMs and retrieval‑augmented generation
- Experience deploying ML services on HPC or cloud GPU infrastructure
- Accredited with 5 Tripartite Standards by the Tripartite Alliance for Fair & Progressive Employment Practices (TAFEP) Singapore
- A diverse workplace with 32 nationalities, offering ample opportunities for mutual learning
- Positive and inclusive working environment
- 25 days of annual leave for fixed‑term contracts
- 1 day of Birthday Leave
- Commitment to supporting physical and mental wellness
- Comprehensive healthcare insurance coverage
- Flexible hybrid work arrangement (up to 2 days per week from home)
In line with our values, ETH Zurich encourages an inclusive culture. We promote equality of opportunity, value diversity, and nurture a working and learning environment in which the rights and dignity of all our staff and students are respected. Visit our Equal Opportunities and Diversity website to find out how we ensure a fair and open environment that allows everyone to grow and flourish.
#J-18808-LjbffrTo Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: