Reliability Engineer - AI & Hyperscale Server NPI and Mfg
Listed on 2026-02-23
-
Engineering
Systems Engineer
About the Role
Our reliability team is responsible for evaluating, developing, designing, and implementing software and product reliability test regimens to ensure ZT products of the highest quality are delivered to our customers.
We are looking for a passionate Reliability Engineer with exceptional knowledge and experience developing and manufacturing scalable infrastructures. You will be working with the latest technologies that go into building a hyperscale cloud services.
What You’ll DoThe successful candidate will be responsible for using Design for Reliability principles to ensure the cloud hardware developed and delivered to data centers meet specified use‑conditions and stresses to assure its design intent. Act as the internal consultant on all reliability matters and interface with program management, vendors, and design engineering on key reliability programs and issues; supporting the software/script development needs of the reliability team.
This will include the creation or revision of reliability engineering guidelines to improve product field performance through design enhancements to meet reliability goals. Uses principles of performance evaluation and prediction to improve the reliability and maintainability of Cloud Infrastructure servers. Identifies, collects, analyzes, and manages various types of data to minimize failures and improve product performance. Develop scripts that represent the expected environment and operational conditions.
Collaborate with other development functional teams and internal stakeholders regarding the application of Design for Reliability principles to ensure products meet customer expectations.
- Minimum B.S. in Electrical Engineering, Computer with Science/Engineering, or Software development and 5+ years of relevant work experience (alternatively, a MS degree and 3+ years of experience).
- Knowledge of computer systems/hardware structure, as well as switch/network interfaces.
- Knowledge and/or experience with programming languages like Python or Unix (Bash and/or Power Shell).
- Knowledge of statistical & probability techniques and reliability modeling.
- Ability to communicate, collaborate and lead cross‑functionally to resolve issues, including those with customers.
- Fundamental knowledge of Computer Architecture, Server architecture at the block level, and Hardware/Firmware/OS interactions.
- Working knowledge of PCBA (printed circuit board assembly) design, fabrication, and validation testing.
- Experience using tools such as Relia Soft & JMP statistical software packages.
- Working knowledge of electronic components/devices and their failure modes & failure mechanisms.
- Knowledge of industry standards, IPC, JEDEC, Telcordia, and MIL‑STD.
ZT Systems assesses market data to ensure a competitive compensation package. The typical base salary for this position is expected to be between $105,000 and $140,000 per year. If hired, the final base salary will be determined on an individual basis taking into consideration experience, skills, knowledge, education and/or certifications.
Base salary is just one component of ZT Systems total rewards philosophy. We take pride in offering a wide range of benefits and perks that appeal to the variety of needs across our diverse employee base. Other rewards may include bonus, paid time off, generous 401(k) match, tuition reimbursement, wellbeing resources, and more.
What We OfferAt ZT Systems, a Sanmina Company, we believe that investing in our people is key to our continued growth and innovation. When you join our team, you’ll gain access to a comprehensive and inclusive benefits package designed to support your well‑being, financial security, and professional development—both now and in the future.
Health & Wellness- Comprehensive medical, dental, and vision coverage with access to leading providers.
- Mental health resources and employee wellness support programs.
- Company‑paid life and disability insurance.
- Paid time off (PTO) and company‑paid holidays.
- Parental leave and family care support programs.
- Structured training…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).