HPC/AI Specialist
Listed on 2026-05-22
-
IT/Tech
AI Engineer, Data Scientist, Data Analyst, Machine Learning/ ML Engineer
Overview
The National Energy Research Scientific Computing Center (NERSC) at Berkeley Lab seeks enthusiastic engineers to join the NERSC Science Acceleration Program (NESAP). You will collaborate with scientific teams to enable the solution of deep, meaningful problems across all program areas funded by the Department of Energy Office of Science under DOE's Project Genesis mission.
The Challenge:
Enabling advanced science at scale on supercomputers. In 2027, NERSC will deploy its next generation HPC system “Doudna” — a system optimized for science workflows involving modelling and simulation, AI and experimental data analysis. The system includes next-generation NVIDIA Vera-Rubin superchips, a high-speed interconnect, and an all-flash VAST file system. NESAP is about employing cutting-edge computational science techniques and advanced performance analysis tools to develop highly scalable, capable and performance complex workflows to meet exciting challenges in Simulation, Data Analysis, and Machine Learning on Doudna and future NERSC supercomputing systems.
join Berkeley Lab?
We invest in our employees by offering a total rewards package you can count on:
- Exceptional health and retirement benefits, including pension or 401K-style plans
- Opportunities to grow in your career – Tuition Assistance Program
- A culture where you’ll belong – we are invested in our teams
- Winter Holiday Shutdown every year (in addition to accrued vacation and sick time)
- Parental bonding leave (for both mothers and fathers)
- Pet insurance
- At the CSE3 level:
Collaborate with NERSC staff and code teams to port and optimize software for the Doudna system and future platforms. - Develop and refine advanced workflows, algorithms, and models with domain experts for HPC applications.
- Research and leverage NERSC's integrated HPC/AI/data ecosystem.
- Support widely used scientific workflows on NERSC systems.
- Provide performance engineering expertise, training, and user support.
- Stay current with AI/HPC advancements through community engagement.
- Help evaluate and shape future supercomputing technologies.
- Partner with scientists and industry to drive impactful research.
- Lead or coordinate work on complex projects, applying sound judgment and advanced analysis.
- Mentor early-career staff in computing techniques and projects.
- Track emerging HPC/AI trends and translate them into opportunities for NERSC users.
- Develop strategies to balance performance and productivity for the science community.
- Resolve complex, ambiguous issues using advanced analysis and independent judgment.
- At CSE3 level:
Typically requires a Bachelor's degree and a minimum of 8 years of related experience; or 6 years of experience and a Master’s degree; or an equivalent combination of education and experience. - Wide-ranging experience in data management, storage and I/O as applied to scientific data.
- Experience with the development and performance optimization of scientific software in the HPC or AI context, including algorithms design or applied mathematics.
- Contributions to scientific and/or open source software projects, public code repositories, publication record.
- Ability to troubleshoot and resolve complex issues in creative and effective ways.
- Ability to network and collaborate with key contacts outside their own area of expertise.
- Excellent oral and written communication skills.
- Excellent software development skills.
- Proven ability to work productively both independently and as part of an interdisciplinary team balancing divergent objectives involving research, code development, supporting software and consulting with scientists.
- Typically requires a Bachelor's degree and a minimum of 12 years of related experience; or 8 years of experience and a Master’s degree; or equivalent experience.
- Broad expertise and/or unique knowledge in data management, storage and I/O as applied to scientific data is required.
- Ability to work on and resolve significant and unique issues where analysis of situations or data requires an evaluation of intangibles.
- Ability to exercise independent judgment in methods, techniques…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).