Principal/Sr Principal HPC Systems Engineer - R10213786
Listed on 2025-12-27
-
IT/Tech
Systems Engineer, Cybersecurity
Principal/Sr Principal HPC Systems Engineer - R
Northrop Grumman
LocationBaltimore, MD (Onsite)
Relocation AssistanceRelocation assistance may be available.
Security ClearanceA US Government security clearance per customer's requirements.
Travel10% of the time.
DescriptionAt Northrop Grumman, our employees have incredible opportunities to work on revolutionary systems that impact people's lives around the world today, and for generations to come. Our pioneering and inventive spirit has enabled us to be at the forefront of many technological advancements in our nation's history - from the first flight across the Atlantic Ocean, to stealth bombers, to landing on the moon.
We look for people who have bold new ideas, courage and a pioneering spirit to join forces to invent the future, and have fun along the way. Our culture thrives on intellectual curiosity, cognitive diversity and bringing your whole self to work – and we have an insatiable drive to do what others think is impossible. Our employees are not only part of history, they're making history.
Northrop Grumman Mission Systems is a trusted provider of mission‑enabling solutions for global security. Our Engineering and Sciences (E&S) organization pushes the boundaries of innovation, redefines engineering capabilities, and drives advances in various sciences. Our team is chartered with providing the skills, innovative technologies to develop, design, produce and sustain optimized product lines across the sector while providing a decisive advantage to the warfighter.
Come be a part of our mission!
- Oversee design, deployment, and lifecycle operation of a high‑performance compute cluster
- Lead team of HPC Systems Administrators
- Assess and respond to customer requests for cluster modifications, including oversight of requirement gathering and analysis, planning, implementation, verification/validation, and production deployment maintenance
- Investigate, diagnose, and resolve acute system faults
- Ensure system performance aligns with customer requirements and remain within technical, schedule, and cost constraints
- Maintain software deployments
- Maintain security compliance
- Monitor and maintain hardware
- Contribute to design of new high‑performance compute clusters
- Interface with user support staff
- Assess new technology for benefits and risks by performing trade studies of technological function, value proposition, and deployment timeline
- Assess and report on cluster operational risks and propose, plan, and deploy mitigation strategies
- A degree in a STEM area with a minimum of 5 years of experience with a bachelor’s degree, 3 years with a master’s, or 0 years with a PhD.
- Demonstrated experience maintaining computational hardware through its lifecycle
- Demonstrated experience analyzing and responding to customer requirements
- Strong Linux systems administration proficiency (RHEL nice to have)
- Strong knowledge and experience with concepts of high-performance computing system operations, including cluster management (Ansible), multi-user login environments, job scheduling (SLURM), and networked file systems
- Strong knowledge and experience maintaining compliance with Security Technical Implementation Guides (STIGs)
- Strong knowledge and experience with compiling software
- Strong knowledge and experience monitoring and maintaining high-performance compute cluster hardware
- Experience directing technical work of a small team of Linux Systems Administrators
- Strong written and verbal communication skills
- Candidate Must be a U.S. Citizen
- Active US Government security clearance per customers requirements
- Bachelor’s degree in a STEM discipline with 8 years' relevant experience; 6 years' experience with a Master’s degree in a STEM discipline; 4 years with PhD in a STEM discipline.
- Demonstrated experience maintaining computational hardware through its lifecycle
- Demonstrated experience analyzing and responding to customer requirements
- Strong Linux systems administration proficiency (RHEL nice to have)
- Strong knowledge and experience with concepts of high-performance computing system operations, including cluster management (Ansible),…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).