×
Register Here to Apply for Jobs or Post Jobs. X

Senior Computing System Administrator

Job in New Haven, New Haven County, Connecticut, 06540, USA
Listing for: Yale University
Full Time position
Listed on 2026-01-02
Job specializations:
  • IT/Tech
    Systems Engineer, IT Support
Salary/Wage Range or Industry Benchmark: 90000 USD Yearly USD 90000.00 YEAR
Job Description & How to Apply Below
Position: Senior High Performance Computing System Administrator

Salary Range: $90,000.00 - $

Overview

Yale Center for Research Computing (YCRC) is seeking a versatile system administrator/engineer to help ensure that Yale students and faculty have the AI high‑performance computing infrastructure they need to advance discovery and scholarship. The role focuses on GPU infrastructure enhancements as part of Yale’s investment in AI.

As an experienced subject matter expert you will lead the design, deployment, and support of YCRC’s AI‑focused research cluster and storage infrastructure. This position is primarily systems‑focused but includes researcher interaction. You will stay current on accelerator and HPC technologies, and advise on trade‑offs in memory, precision, networking, power, and cost.

This is a hybrid position, requiring a minimum of two days per week on site at YCRC’s office on the Yale campus. Equipment maintenance may be required on‑site. Infrastructure is hosted at a Yale data center in West Haven, CT and at the Massachusetts Green High‑Performance Computing Center in Holyoke, MA.

Principal Responsibilities
  • Design, implement, and advance core HPC systems such as the HPC provisioning system, the resource‑management system, account/user lifecycle management, and user authentication and authorization systems.
  • Design, deploy, configure and support HPC clusters, including compute, networking, parallel storage, and backup.
  • Install, administer and maintain hardware, system software, networking, accounts, and security measures.
  • Diagnose and correct system issues, whether they involve correct operation or performance.
  • Develop and maintain documentation.
  • Research developments in HPC architecture and new technologies, processes, and methodologies.
  • Determine specifications for new systems and tailor them to meet research needs.
  • Required

    Skills and Abilities
    • Experience with accelerators such as GPUs for AI, including expertise with system‑level trade‑offs in accelerator‑based memory, precision, and interconnects.
    • Expertise in administration of HPC Linux clusters, including provision and management tools and batch scheduler.
    • Experience with high‑speed networking such as Infini Band and high‑speed Ethernet.
    • Experience with large storage systems and parallel file systems such as GPFS and Lustre.
    • Expertise in Linux system administration, including operating system, networking, storage, and security.
    • Expertise in automation and scripting in at least one scripting language.
    • Ability to work in a team environment in a fast‑moving technology field. Excellent verbal and writing skills.
    • Ability to interact well with team members and end users, and to work independently across units.
    • Attention to detail and the care needed to support a system used by hundreds of research users.
    Preferred

    Skills and Abilities
    • Demonstrated ability to specify, install, configure, and support multi‑node GPU systems and tune them for AI applications.
    • Experience designing, implementing, and maintaining a local, customized implementation of a core HPC system such as the provisioning system, resource‑management system, or user authentication and authorization systems.
    • Experience supporting technology in a research environment.
    • Expertise in configuration, deployment, support, and backup of large‑scale parallel storage systems.
    • Experience administering high‑speed networking such as Infini Band or high‑speed Ethernet in a cluster environment.
    • Expertise in computer security, preferably in large, multi‑user Linux environments.
    • Experience in a data‑center environment, installing and troubleshooting hardware.
    • Professional certifications related to the above.
    • Graduate degree in a related field.
    Required

    Education and Experience

    Bachelor's Degree in a related field and a minimum of six years of related work experience, or an equivalent combination of education and experience.

    Equal Opportunity Employer Statement

    Yale University is committed to basing judgments concerning the admission, education, and employment of individuals upon their qualifications and abilities and seeks to attract to its faculty, staff, and student body qualified persons from a broad range of backgrounds and perspectives. In accordance with federal and…

    Position Requirements
    10+ Years work experience
    To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
    (If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
     
     
     
    Search for further Jobs Here:
    (Try combinations for better Results! Or enter less keywords for broader Results)
    Location
    Increase/decrease your Search Radius (miles)

    Job Posting Language
    Employment Category
    Education (minimum level)
    Filters
    Education Level
    Experience Level (years)
    Posted in last:
    Salary