Manager, Reliability Engineering, NA
Listed on 2026-06-27
-
Engineering
Systems Engineer, Operations Manager, Electrical Engineering
About Vantage Data Centers
Vantage Data Centers powers, cools, protects and connects the technology of the world's well-known hyperscalers, cloud providers and large enterprises. Developing and operating across North America, EMEA and Asia Pacific, Vantage has evolved data center design in innovative ways to deliver dramatic gains in reliability, efficiency and sustainability in flexible environments that can scale as quickly as the market demands.
Reliability Engineering DepartmentThe Reliability Engineering Team is responsible for the overall operating health of critical systems across Vantage global facilities. For each of the major systems Electrical, Mechanical, and Controls, the reliability engineering team ensures success in the commissioning stages of new construction, evaluates and improves the reliability and performance of existing critical infrastructure, sustains equipment operational availability through maintenance program design, provides ongoing technical support to the Site Operations Teams, and offers systems reliability and maintainability feedback to the Design Engineering teams for future design considerations.
Position OverviewThis role can be based in our eastern U.S. data center campuses:
Ashburn, VA;
New Albany, OH; or new sites in the Atlanta, GA market. It is a hybrid role with three days in the office and two days home‑based. Business travel of up to 25% is required.
The Manager of Reliability Engineering is a hands‑on leadership role responsible for guiding a team of engineers in the execution of reliability‑focused initiatives across Vantage's data center operations. The manager ensures system uptime, performance, and operational excellence by applying reliability engineering principles and supporting the implementation of preventive and predictive maintenance programs. The role works closely with cross‑functional teams to drive improvements and ensure consistency in reliability practices.
EssentialJob Functions
- Lead a team of reliability engineers in the day‑to‑day execution of reliability programs and initiatives.
- Support the implementation of reliability strategies that align with organizational goals and operational needs.
- Coordinate with Operations, Engineering, and Construction teams to ensure reliability considerations are integrated into facility design and maintenance planning.
- Oversee the execution of root cause analysis (RCA), failure mode effects analysis (FMEA), and other reliability tools to identify and address system vulnerabilities.
- Monitor system performance using data analytics and reliability metrics to identify trends and recommend improvements.
- Ensure team adherence to industry standards, safety protocols, and regulatory requirements related to reliability and maintenance.
- Collaborate with vendors and service providers to support reliability initiatives and ensure quality of service.
- Contribute to the development and standardization of reliability engineering processes across multiple sites.
- Provide coaching, feedback, and development opportunities to team members to build technical and leadership capabilities.
- Prepare and present operational updates and reliability reports to senior leadership as needed.
- Handle additional duties as assigned by management.
- Bachelor's degree in Engineering, Mechanical, Electrical, or a related field, required.
- Five or more years of experience in reliability engineering, maintenance, or operations, preferably in mission‑critical or data center environments.
- Experience leading or supervising technical teams in an engineering or operations setting.
- Working knowledge of reliability engineering principles and tools such as FMEA, RCA, and predictive maintenance.
- Familiarity with data analysis tools and monitoring systems used in reliability programs.
- Strong organizational and problem‑solving skills with the ability to manage multiple priorities.
- Effective communication skills with the ability to collaborate across departments and present technical information clearly.
- Demonstrated ability to lead through influence and foster a team‑oriented environment.
- Travel required is expected to be up to 25%, but may increase…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).