Reliability Engineer - AI & Hyperscale Server NPI and Mfg
Listed on 2026-05-30
-
Engineering
Systems Engineer
About the role
Our reliability team evaluates, develops, designs and implements software and product reliability test regimens to ensure ZT products of the highest quality are delivered to our customers. We are looking for a passionate Reliability Engineer with exceptional knowledge and experience in developing and manufacturing scalable infrastructures, working with technologies used for building hyperscale cloud services.
What you’ll do- Apply Design for Reliability principles to ensure cloud hardware developed and delivered to data centers meets specified use‑conditions and stresses, and meets its design intent.
- Act as an internal consultant on all reliability matters and interface with program management, vendors, and design engineering on key reliability programs/issues; support software/script development needs of the reliability team.
- Create or revise reliability engineering guidelines to improve product field performance through design enhancements that meet reliability goals.
- Use performance evaluation and prediction principles to improve reliability and maintainability of cloud infrastructure servers.
- Identify, collect, analyze, and manage various types of data to minimize failures and improve product performance.
- Develop scripts that represent expected environment and operational conditions.
- Collaborate with other development functional teams and internal stakeholders regarding the application of Design for Reliability principles to ensure products meet customer expectations.
- Minimum B.S. in Electrical Engineering, Computer Science/Engineering, or Software Development and 5+ years of relevant work experience (or MS degree and 3+ years).
- Knowledge of computer systems/hardware structure, switch/network interfaces.
- Knowledge and/or experience with programming languages such as Python or Unix (Bash and/or Power Shell).
- Knowledge of statistical and probability techniques and reliability modeling.
- Ability to communicate, collaborate and lead cross‑functionally to resolve issues, including those with customers.
- Fundamental knowledge of computer architecture, server architecture at the block level, and hardware‑firmware‑OS interactions.
- Working knowledge of PCBA (printed circuit board assembly) design, fabrication, and validation testing.
- Experience using tools such as Relia Soft and JMP statistical software packages.
- Working knowledge of electronic components/devices and their failure modes and mechanisms.
- Knowledge of industry standards, IPC, JEDEC, Telcordia, and MIL‑STD.
- The typical base salary for this position is expected to be between $105,000 and $140,000 per year. Final base salary will be determined on an individual basis taking into consideration experience, skills, knowledge, education and/or certifications.
- Base salary is just one component of ZT Systems total rewards philosophy.
- Other rewards may include bonus, paid time off, generous 401(k) match, tuition reimbursement, wellbeing resources, and more.
ZT Group Int’l. is an Equal Opportunity Employer and prohibits discrimination and harassment of any kind. ZT Systems provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws.
Certain positions may require U.S. citizenship or permanent residency status, as applicable.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).