Site Lead, Data Center Operations
Listed on 2026-02-07
-
IT/Tech
Data Engineer, Cloud Computing
Overview
Independently responsibility for one or more data centers, leading performance analyses across key operational areas and proactively monitoring facility health to implement significant enhancements. Drives process improvements by partnering across functions and regions, leads on-the-ground teams in incident resolution, manages escalated technical issues, and utilizes advanced automation and monitoring tools to mitigate risks. Maintains an up-to-date knowledge base, executes incident management protocols, and conducts root cause analysis to improve operations.
Oversees new region builds and expansions, serves as the main liaison for expansion projects, and provides oversight for installations, repairs, inventory, and logistics—directing component upgrades and infrastructure changes to optimize data center efficiency and stability.
Responsibilities include supporting ongoing operations, expansion projects, and ensuring data center reliability through proactive management and coordination with cross-functional teams.
ResponsibilitiesKey Responsibilities
- Data Center Site Portfolio Management: Independently responsibility for at least one and occasionally multiple Data Centers.
- Performance Monitoring and Analysis: Leads performance trend analyses related to capacity, temperature, availability, cleanliness, and other aspects. Identifies significant patterns, and suggests operational improvements.
- Issue Management and Automation: Proactively monitors facility health (power, cooling, security) and develops and implements major enhancements. Leads the on-the-ground resources to resolve incidents and performs accurate communication on execution. Oversees and provides support for escalated complex technical issues. Triages and/or escalates issues, and implements advanced automation, scheduling, and monitoring tools to mitigate potential problems effectively. Identifies, documents, and validates issues, processes, and solutions, ensuring the data center knowledge base is comprehensive and up-to-date.
Prepares for, and when needed executes incident or crisis management protocols in alignment with business continuity plans. Performs Root Cause Analysis (RCA) following crises or incidents, and updates documentation to capture process improvements.
- Data Center Expansion Support: Leads and oversees new region builds and expansion activities, both onsite and remotely. Acts as primary liaison with project teams and data center engineering, ensuring all timelines and capacity needs are strategically managed for expansion projects and site builds. Collaborates closely with project teams on critical aspects of expansion projects and site builds to deliver high standards.
- Installation and Maintenance: Provides oversight for installations, repairs, inventory management, and logistics tasks. Directs efforts to replace and upgrade components. Advises on high-level purchases or upgrades for data centers and oversees implementation. Leads planning and execution of rack deployments, installations, and network physical infrastructure upgrades/changes. Ensures proactive maintenance of the Data Center facility with regard to efficiency and stability (e.g. containment, air flow & pressure, power trains).
Responsibilities
- Planning & Execution: Manages and coordinates moderately complex tasks, monitoring timelines and deliverables to ensure timely completion and adherence to requirements for a moderately-sized project or initiative. Efficiently delegates, monitors, and prioritizes work across multiple projects, providing technical oversight and adjusting plans to address shifts in resources or timelines.
- Collaboration & Partnership: Collaborates across the organization to align on expectations and achieve shared objectives. Leverages understanding of business leaders, stakeholders, and/or customers to ensure proposed solutions meet their needs. Supports inclusivity by actively seeking and listening to diverse perspectives, ensuring others feel heard and respected.
- Problem Solving: Identifies and addresses moderately complex issues by analyzing a wide range of data and/or information to identify solutions in accordance…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).