×
Register Here to Apply for Jobs or Post Jobs. X

Data Center Facility Operations Reliability Engineer

Job in Helena, Lewis and Clark County, Montana, 59604, USA
Listing for: Meta
Full Time position
Listed on 2026-01-02
Job specializations:
  • Engineering
    Systems Engineer, Electrical Engineering
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below

Overview

Summary: Meta was built to help people connect and share, and over the last decade, our tools have played a critical part in changing how people around the world communicate with one another. With over two billion people using the service and hundreds of offices around the globe, a career at Meta offers countless ways to make an impact in a fast-growing organization.

Our Data Centers are the foundation upon which our rapidly scaling infrastructure efficiently operates to deliver our advanced services. Meta is seeking an experienced and self-driven Reliability Lead to join our Asset Management & Reliability team within Facility Operations. This person will work at the leading edge of Facility Operations to identify and manage asset reliability risks and various stages of end-to-end asset lifecycle for the Data Center Operations.

Managing stakeholders spread across time zones is a significant challenge and key to the success of our individual projects and overall asset management, quality and reliability program.

Responsibilities
  • Prevent operational gaps in reliability engineering expertise across all asset management activities
  • Proactively review, identify, and mitigate risks of equipment failures, unscheduled downtime, and reactive maintenance
  • Ensure all new assets are methodically and consistently onboarded into Meta’s asset management ecosystem. Maintain rigorous asset onboarding processes to enable accurate tracking and seamless integration into maintenance programs
  • Establish and maintain a robust asset criticality framework to prioritize resources and mitigate risk
  • Lead Failure Mode and Effects Analysis (FMEA) to predict failure modes, prioritize risks, and develop preventive actions. Develop and execute Reliability Centered Maintenance (RCM) programs to balance cost, risk, and performance
  • Assess operational risks associated with asset failures, maintenance strategies, and process deviations
  • Develop, maintain, and update the Global Maintenance Library of plans, procedures, and best practices
  • Govern the review and implementation of changes to maintenance strategies and procedures
  • Ensure all maintenance changes are data-driven, risk assessed, and systematically implemented
  • Support accurate accounting of asset depreciation and amortization through timely asset tracking
  • Serve as a subject matter expert and technical lead for Enterprise Asset Management (EAM) implementation and optimization
  • Create and maintain asset useful life models to forecast replacement needs and optimize total cost of ownership
  • Provide technical leadership for condition-based, time-based, and specialized reliability maintenance initiatives
  • Analyze asset health metrics and KPIs to identify risks, predict failures, and measure reliability improvements
  • Collaborate with Operations and Maintenance to optimize scheduling and execution of maintenance activities
  • Mentor staff in reliability methodologies and foster a environment of proactive asset management
  • Sustain continuous improvement of asset management work streams and processes
  • 25% to 50% travel domestically and internationally
Minimum Qualifications
  • Bachelor’s degree in Mechanical, Electrical Reliability Engineering or similar technical discipline
  • 10+ years of experience in reliability engineering (related to electrical or mechanical cooling equipment)
  • Experienced in Reliability Centered Maintenance (RCM) and Failure Maintenance Effect Analysis (FMEA) activities for maintenance /process/equipment design optimization to meet reliability requirements
  • Proficient in usage of EAM solutions to extract data and develop meaningful insights
  • Certifications in Maintenance & Reliability such as CMRP, CRL, CRE
  • Knowledgeable of relevant ISO standards (ISO 14224, ISO 17359, ISO 55000)
  • Experience with Program/Project management and cross-functional team management
Preferred Qualifications
  • Experience with data center equipment such as critical cooling systems, generators, main switchboards, network gear
  • Proficient in data analysis techniques that can include Process Control, Reliability modeling and prediction, Fault Tree Analysis, Weibull Tree Analysis, Six Sigma (6σ) Methodology
  • Proficient in developing…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary