×
Register Here to Apply for Jobs or Post Jobs. X

HPC Scientific Software Engineer; IT@JH Research Computing

Job in Baltimore, Anne Arundel County, Maryland, 21276, USA
Listing for: The Johns Hopkins University
Full Time position
Listed on 2025-12-07
Job specializations:
  • IT/Tech
    AI Engineer, Systems Engineer
Job Description & How to Apply Below
Position: HPC Scientific Software Engineer (IT@JH Research Computing)

HPC Scientific Software Engineer (IT@JH Research Computing)

Join to apply for the HPC Scientific Software Engineer (IT@JH Research Computing) role at The Johns Hopkins University.

IT@JH Research Computing is seeking a HPC Scientific Software Engineer who will support faculty, researchers, and students engaged in high‑performance and AI‑driven research across Johns Hopkins University. The position is responsible for deploying, optimizing, and maintaining scientific software and computational workflows on advanced HPC systems and related infrastructure. Working primarily within Linux‑based environments, the engineer manages and troubleshoots complex software stacks, containerized applications, and GPU‑accelerated workloads using tools such as SLURM, Easy Build, Spack, and others.

The role combines ticket‑based user support with long‑term project work, collaborating closely with interdisciplinary research groups to enhance system performance, streamline data‑intensive workflows, and integrate cutting‑edge technologies. The position operates with significant independence while coordinating regularly with systems engineers and research computing leadership to ensure reliable, high‑efficiency computing resources that advance the university’s scientific mission.

Specific Duties & Responsibilities Software Deployment and Design
  • Develop and refine deployment strategies for scientific software on HPC and AI systems.
  • Design computational workflows, selecting optimal software configurations, and utilizing tools like Ansible for automation.
  • Assist teams in implementing, tuning, and optimizing AI models and gateway applications (for example, XDMoD, Coldfront, Open OnDemand, CryoSPARC Live, SBGrid, AI Agents).
Performance Optimization
  • Analyze and optimize the performance of AI models and HPC applications, focusing on GPU‑enabled computing.
  • Implement parallel processing, distributed computing, and resource management techniques for efficient job execution.
Integration and Optimization
  • Develop, debug, and maintain software tools, libraries, and frameworks supporting HPC and AI workloads.
  • Collaborate with the system team and software vendors (such as NVIDIA, Intel, Matlab) to optimize systems for maximum performance.
  • Utilize CUDA, DNN, Tensor

    RT, and Intel Compilers to enhance system performance.
HPC Scientific Software Support
  • Manage and support scientific software deployment across HPC, cloud‑based, and colocation facilities.
  • Oversee installation, configuration, and maintenance of HPC packages with tools like CMake, Make, Easy Build, Spack, and Lua module files.
Collaboration and Mentorship
  • Work closely with cross‑functional teams, including researchers, data scientists, and software developers, to address complex HPC/AI challenges.
  • Mentor junior engineers and foster a culture of continuous learning.
Technical Support and Training Workshops & Troubleshooting
  • Resolve complex technical issues and perform root cause analysis for HPC/AI software challenges.
  • Implement effective solutions to prevent recurrence and improve system reliability.
  • Provide training workshops for researchers and students, focusing on troubleshooting, optimizing workflows, and effectively using HPC systems.
Learning and Development
  • Stay current with advances in HPC and AI technologies and methodologies.
  • Incorporate new research findings into existing systems to improve performance and capabilities.
Container Orchestration
  • Develop and manage container orchestration strategies to ensure scalability, reliability, and security of applications.
  • Oversee the container lifecycle from creation and deployment to scaling and removal.
Documentation and Compliance
  • Create comprehensive documentation for system designs, performance metrics, and project status.
  • Ensure compliance with security and regulatory standards for all HPC and AI systems.
Minimum Qualifications
  • Master’s Degree in a quantitative discipline.
  • Five (5) years of experience in HPC user support, software deployment, and performance optimization within an academic or research environment.
  • Experience in scientific computing environments and applications.
  • Hands‑on experience with SLURM for job scheduling.
  • Proficiency in…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary