Lead Research Software Engineer, Portable AI Performance Engineering
Listed on 2026-05-31
-
Software Development
AI Engineer, Data Scientist, Machine Learning/ ML Engineer, Software Engineer
Lead Research Software Engineer, Portable AI Performance Engineering
MA Green High Performance Computing Center – serves as a hands‑on research software engineering professional and lead for applied performance engineering for AI workloads. The role works closely with research groups and leading computer industry collaborators to evaluate, adapt, and enhance the portable performance of complex AI research workloads on state‑of‑the‑art hardware. The focus is heavy on optimizing existing NVIDIA GPU‑based workloads for top‑tier AMD GPUs, such as MI355X and beyond, and on analyzing and profiling research AI workloads to identify performance bottlenecks and portability challenges;
and porting and optimizing complex AI models and scientific code to run efficiently on AMD MI355X GPUs using ROCm, HIP, and related translation tools.
- Bachelor’s degree or equivalent with a minimum of five years of work experience in deeply technical fields and/or computational research experience;
- Strong proficiency in Python and C++, with deep familiarity with AI/ML frameworks (PyTorch, Tensor Flow, JAX);
- Hands‑on experience with GPU programming models (e.g., CUDA, HIP, or OpenCL);
- Experience with performance profiling and benchmarking tools on Linux‑based High‑Performance Computing systems;
- Excellent communication skills;
- Ability to collaborate effectively with academic researchers and industry partners;
- Self‑motivated with the ability to work independently in a remote or hybrid environment.
- Direct experience with the AMD ROCm software stack and translating CUDA code to HIP;
- Familiarity with AI agentic tools and Large Language Models (LLMs) used for code generation and refactoring;
- Background in supporting large‑scale, domain‑specific scientific research (e.g., physics, biology, climate science) on institutional clusters;
- Direct experience with one or more open‑source schedulers and provisioners;
- Experience with Linux container technologies such as LXC, Apptainer and systemd‑nspawn;
- Advanced degree in a relevant technical field.
Lead Software Engineer must comply with all relevant MGHPCC security policies.
This is a two‑year term position.
3/13/2026
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).