HPC Systems Engineer
Listed on 2026-06-12
-
Engineering
Systems Engineer
About Houston
Exxon Mobil's state-of-the-art campus north of Houston serves as home to its Upstream, Product Solutions and Low Carbon Solutions businesses and their associated service groups. The facility opened in 2014 and accommodates more than 10,000 employees and visitors.
By bringing many global functional groups together, the campus provides employees with the tools and capabilities needed today, and in the future, to achieve business objectives and accelerate the discovery of new resources, technologies and products. It was designed to foster improved collaboration, creativity and innovation and enhance the company’s ability to attract, develop and retain the top talent in the industry.
The campus is located in Spring, Texas, on 385 wooded acres immediately to the west of Interstate Highway 45 (I-45), at the intersection of I-45 and the Hardy Toll Road, approximately 25 miles from the cultural vibrancy of downtown Houston.
The campus was constructed to the highest standards of energy efficiency and environmental stewardship. Its design incorporates extensive research into best practices in building and workplace design through extensive benchmarking of the world’s top academic, research, and corporate facilities.
What Role You Will Play in Our TeamThe HPC Systems Engineer role has the overall responsibility to work within a small team of highly skilled HPC specialists to provide a performant, reliable, and secure high-performance computing (HPC) environment. The HPC Systems Engineer will be involved in various aspects of designing and engineering our HPC system as well as be responsible for managing day-to-day operations and maintenance activities, including general troubleshooting of any issues that may arise, monitoring overall system health, performing system maintenance tasks, and evaluating new hardware and system software.
Location:
Spring, TX, USA
- Establish strategies for overall support of the systems portfolio (4 supercomputers and large parallel file systems across the domain)
- Evaluate new hardware and software and understand potential benefits/impacts it can have in the scientific computing and seismic imaging environments
- Perform software installations and upgrades, inclusive of operating system in world class HPC environment
- Monitor overall system performance and health and partner with 3rd parties on hardware support as needed
- Provide support for the management of data in the environment (140+ Petabytes of parallel file system)
- Advanced consulting with users to resolve problems and ensure they can effectively utilize the systems
- Interact with both business customers and technical teams that are globally distributed and within varied time zones
- Engaging with vendors for problem resolution of existing infrastructure and discussion of roadmaps and new technologies for evaluations
- Foster a supportive work environment and maintain open, productive interactions among team and across organizations
- Build and maintain cross-organizational contacts to facilitate execution of work
- B.E./B.Tech in Computer Science or related degree area (e.g., Computer Engineering, Information Systems) or equivalent skills work experience.
- Excellent technical, analytical, and communication skills.
- A minimum of 10 years of hands‑on Linux experience (e.g., RHEL, CentOS) and production infrastructure support (e.g., networking, storage, monitoring, compute, installation, configuration, maintenance, upgrade, retirement).
- A minimum of 5 years’ experience in HPC technologies (e.g., installation, configuration, maintenance, upgrade, retirement, problem resolution) such as parallel/distributed files systems (e.g., Lustre, GPFS), high speed interconnect fabrics (e.g., Infiniband, Omni‑Path), and HPC batch scheduling software suites (e.g., PBSPro, SLURM).
- Proficiency in technical writing and documentation of solutions.
- Solid understanding of data center operations fundamentals in networking, cooling, and power.
- Works well in a team environment.
- Self‑motivated.
- Strong IT skills in infrastructure and applications.
- Experience with supporting large scale…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).