×
Register Here to Apply for Jobs or Post Jobs. X

HPC Infrastructure Engineer

Remote / Online - Candidates ideally in
Palo Alto, Santa Clara County, California, 94306, USA
Listing for: Guardant Health
Full Time, Remote/Work from Home position
Listed on 2026-01-30
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing
Salary/Wage Range or Industry Benchmark: 149400 - 205400 USD Yearly USD 149400.00 205400.00 YEAR
Job Description & How to Apply Below
Staff HPC Infrastructure Engineer page is loaded## Staff HPC Infrastructure Engineer locations:
Palo Alto, CAtime type:
Full time posted on:
Posted Yesterday job requisition :
R-100428
** Company Description
** Guardant Health is a leading precision oncology company focused on guarding wellness and giving every person more time free from cancer. Founded in 2012, Guardant is transforming patient care and accelerating new cancer therapies by providing critical insights into what drives disease through its advanced blood and tissue tests, real-world data and AI analytics. Guardant tests help improve outcomes across all stages of care, including screening to find cancer early, monitoring for recurrence in early-stage cancer, and treatment selection for patients with advanced cancer.

For more information, visitand follow the company on,and.
*
* About the Role:

** You enjoy an agile, very fast paced and highly technical environment. You are a self-driven accomplished technologist who strives to be ever improving your skills, value to the company and improve the computational infrastructure. You are dedicated to engineering excellence yet pragmatic and flexible. You have the ability to maintain the day-to-day support SLA while running various key projects that move the business forward.
*
* Essential Duties and Responsibilities:

*
* · Act as a technical lead in day to day operations
· Help manage the HPC interconnects
· Help integrate the HPC systems with the bandwidth on-demand system
· Help integrate the HPC system with the single namespace storage system
· Help integrate cloud bursting as part of the HPC abstraction work
· Work with the networking infrastructure team to manage and optimize the connectivity to and from the HPC systems and locales
· Help manage multiple HPC clusters and cluster file systems.
· Help research, develop and implement the next generation HPC solution
· Troubleshoot the production system stack down to source code level e.g. shell scripts, python and others.
· Maintain, monitor, and support the infrastructure environment and/or facilities.
· Use and maintain enhanced production monitoring and additional capability.
· Support improvements for increased system reliability and performance.
· Support multiple systems or applications of medium to high complex (complexity defined by size, technology used, and system feeds and interfaces) with multiple concurrent users, ensuring control, integrity, and accessibility.
· Support systems at remote locations, including internationally
· Work with offsite consultants to maintain the infrastructure
· Work with vendors to troubleshoot, upgrade and repair systems as needed
· Participate in a 24/7 on-call rotation
*
* Required Qualifications:

*
* · B.S. in Computer Science or related field
· 4+ years of TCP/IP networking experience
· 2+ years of RDMA networking experience
· 4+ years of Linux/Unix administration, knowledge of Unix network protocols, TCP/IP network fundamentals, core infrastructure technologies and virtualization
· 2+ years of large-scale data storage and compute clusters (HPC) infrastructure
· 2+ years working in and with on-premise and cloud-based (AWS, Google, IBM and Azure) data-centers
· 2+ years of building software release and ops processes and automation toolset
· 2+ years providing documentation of system administration
*
* Preferred Qualifications:

*
* · Cisco Certified Network Professional certification
· Experience with Arista and compatible networking, up to and including 400 gb/s links
· Experience with Mellanox infiniband fabric
· Experience administering IBM’s General Parallel File System
· Experience administering SLURM scheduler
· Experience with using warewulf
· Experience with cloud bursting technologies
· Experience with wide area file systems
· Experience with docker and container technologies
· Experience with Kubernetes
· Operating infrastructure compliant with HIPAA and SOX standards
** Hybrid Work Model:
**** This section is applicable to onsite employees who are eligible for hybrid work location as specified by management and related policies.  Guardant has defined days for in-person/onsite collaboration and work-from-home days for individual-focused…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary