×
Register Here to Apply for Jobs or Post Jobs. X

Sr. HPC Systems Architect; Storage

Job in Ann Arbor, Washtenaw County, Michigan, 48113, USA
Listing for: Dormont Manufacturing Co
Full Time position
Listed on 2026-07-02
Job specializations:
  • IT/Tech
    Systems Engineer, Unix/Linux, IT Infrastructure
Salary/Wage Range or Industry Benchmark: 129600 - 220300 USD Yearly USD 129600.00 220300.00 YEAR
Job Description & How to Apply Below
Position: Sr. HPC Systems Architect (Storage)

Company Overview

KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice‑controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel displays.

The innovative ideas and devices that are advancing humanity all begin with inspiration, research and development. KLA focuses more than average on innovation and we invest 15% of sales back into R&D. Our expert teams of physicists, engineers, data scientists and problem‑solvers work together with the world’s leading technology providers to accelerate the delivery of tomorrow’s electronic devices. Life here is exciting and our teams thrive on tackling really hard problems.

There is never a dull moment with us.

Job Description/Preferred Qualifications

This role provides senior technical leadership for the architecture, deployment, and long‑term scalability of large‑scale HPC storage and compute platforms! It owns systems end‑to‑end—from early architectural definition through full production—partnering across engineering, manufacturing, and strategic vendors to deliver highly available, high‑performance infrastructure at scale.

The scope emphasizes deep technical ownership, architectural decision‑making, and solving sophisticated infrastructure challenges in live production environments! This work directly develops critically important HPC platforms built for adaptability, scale, and operational excellence, driving real‑world impact across core products and technologies.

Job Duties, but not limited to:
  • Lead the design, implementation, and ongoing support of high‑performance compute (HPC) clusters, taking accountability for system performance, reliability, and scalability
  • Serve as a technical authority for HPC storage, with deep hands‑on expertise in parallel file systems such as Lustre, GPFS, and BeeGFS
  • Apply sophisticated systems knowledge across CPU and GPU architectures, high‑bandwidth interconnects, and robust storage subsystems to deliver balanced, high‑performance solutions
  • Lead the creation of hardware BOMs for HPC clusters, working directly with vendors and coordinating hardware release activities
  • Design, configure, and optimize Linux operating systems for HPC environments.
  • Translate project specifications and performance requirements into subsystem and system‑level designs, driving execution while meeting technical and schedule commitments
  • Support the design, release, and transition of new systems to manufacturing and customers, providing high‑quality golden images, procedures, scripts, and documentation
  • Lead EOL part re‑qualification activities to ensure long‑term system viability and supportability
Qualifications, but not limited to:
  • Proven experience with HPC systems, storage, or large‑scale Linux infrastructure
  • Deep, hands‑on expertise in HPC storage and Linux‑based infrastructure
  • Strong, distro‑agnostic Linux experience (Rocky, RHEL, SuSE, Ubuntu)
  • Proven experience crafting and operating large‑scale parallel storage systems
  • Strong understanding of HPC hardware platforms (servers, GPUs, networking, storage, BIOS/BMC)
  • Advanced Linux systems knowledge (PXE/netboot, systemd, HA concepts)
  • Solid networking fundamentals (TCP/IP, DNS, DHCP, LDAP, HTTP)
  • Strong scripting skills in Shell and Python
  • Experience with configuration management and automation (Salt, Puppet, Chef, etc.)
Preferred Qualifications:
  • Strong Dev Ops and automation mentality (CI/CD pipelines, Git, infrastructure as code)
  • Experience with containers for HPC (Singularity, Docker)
  • Monitoring and observability experience (Prometheus, Grafana)
  • Familiarity with Apache/Nginx and supporting infrastructure services

Minimum Qualifications

Requires minimum of 8 years of related experience with a Bachelor’s degree; or 6 years and a Master’s degree; or a PhD with 3 years experience; or equivalent experience.

Base Pay Range: $ - $ Annually

Primary

Location:

USA-MI-Ann…

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary