Datacenter Site Manager, Ops
Listed on 2026-02-15
-
IT/Tech
Systems Engineer, IT Support, Cloud Computing
Locations: On-site roles available in Buffalo, NY, Houston, TX, & Aberdeen, TX
A cutting-edge infrastructure technology company, recognized for driving innovations in high-performance computing, is seeking a skilled and strategic Datacenter Site Manager, Ops to oversee mission-critical operations across one of its flagship campus locations. This organization powers next-generation compute infrastructure for a range of industry-leading AI, defense, and enterprise clients. As they scale, they are investing in operational leadership to ensure optimal performance and reliability across their growing datacenter footprint.
This is an opportunity to lead from the front—managing high-performing teams, championing uptime and reliability, and executing a clear vision for datacenter excellence in one of the most technically advanced environments in the world.
Key Responsibilities- Lead daily operations of multiple high-density datacenter sites within a campus, overseeing a team of technicians, logistics specialists, and facilities professionals.
- Develop team capabilities through performance coaching, structured mentoring, and workforce planning strategies to support 24x7x365 operations.
- Maintain operational excellence through proactive incident management, process improvements, and shift scheduling for multi-site teams.
- Ensure data center reliability, power and cooling efficiency, hardware lifecycle performance, and ticket resolution meet or exceed internal SLA commitments.
- Collaborate cross-functionally with engineering, supply chain, construction, and program management to execute strategic objectives.
- Own and report on operational KPIs including uptime, MTTR, inventory control, cost optimization, and equipment utilization.
- Serve as escalation point for critical incidents while driving root cause analyses and leading continuous improvement efforts.
- Spearhead automation, sustainability, and safety initiatives in a mission-critical environment.
- Bachelor’s degree in engineering, Operations, Business, or equivalent experience.
- 5+ years of experience in datacenter operations or similar mission-critical environments, including at least 2 years in people management roles with teams of 10 or more.
- Deep understanding of datacenter hardware infrastructure, power and cooling systems, server management, and network fundamentals.
- Proven track record of managing 24x7 operations, meeting aggressive uptime goals, and leading large-scale incident response efforts.
- Strong organizational skills with the ability to manage priorities across multiple active facilities.
- Comfortable navigating cross-functional environments and effectively communicating at the executive level.
- 8+ years in mission-critical infrastructure, ideally with experience in hyperscale datacenter environments or 50MW+ campuses.
- Experience with ITIL or similar operational frameworks, incident/change management, and service delivery in complex technical environments.
- Familiarity with power usage effectiveness (PUE), sustainability metrics, and energy efficiency strategies in large-scale operations.
- Background in scaling teams within rapidly evolving, startup-like environments is a plus.
- Join a company that is directly enabling the future of AI, science, and advanced computation.
- Work alongside high-caliber technical leaders who value ownership, autonomy, and relentless innovation.
- Competitive base salary depending on experience and location.
- Comprehensive benefits package including medical, dental, vision, and retirement plans.
- Generous PTO policy and an environment that prioritizes impact over bureaucracy.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).