×
Register Here to Apply for Jobs or Post Jobs. X

Director, Data Center Reliability Engineering

Job in Jackson, Hinds County, Mississippi, 39203, USA
Listing for: Oracle
Full Time position
Listed on 2026-06-25
Job specializations:
  • IT/Tech
    Systems Engineer, Data Science Manager
Job Description & How to Apply Below
** Job Description*
* ** Key Responsibilities*
* + Lead reliability engineering and analytics teams across multiple sites.

+ Standardize and enforce FMEA, RCA, and continuous improvement methodologies.

+ Oversee deployment of monitoring, analytics, and automation tools supporting reliability programs.

+ Define, track, and report reliability KPIs to executive and global operations leadership.

+ Ensure corrective actions are implemented, verified, and sustained.

+ Develop engineers and analysts in disciplined, data-driven problem solving.

** Ideal Candidate Profile*
* + Senior experience in reliability engineering, maintenance engineering, or uptime-critical environments.

+ Strong background in analytics, RCA rigor, and reliability frameworks.

** Skills and Competencies*
* + Strong technical leadership and stakeholder influence.

+ Comfortable translating analysis into executive-level decisions.

** Why Oracle Cloud Infrastructure?*
* + Global impact at scale:
Contribute directly to how mission-critical OCI data centers operate across regions and continents, influencing infrastructure reliability, security, sustainability, and long-term capacity growth.

+ Technically rigorous environment:
Work alongside experienced engineers, automation specialists, and compliance teams in a rapidly scaling hyperscale cloud infrastructure, where disciplined execution and technical depth matter.

+ Culture built on operational excellence:
Join an organization that values safety, process rigor, clear accountability, and continuous improvement as foundational to protecting uptime and customer trust.

+ Long-term career development:
Benefit from internal mobility, role-based technical training, and development opportunities designed for professionals building long-term careers in cloud infrastructure and facilities operations.

** Responsibilities*
* ** Key Responsibilities*
* ** Data Center Site Portfolio Management:*
* -Data Center country leader and typically has responsibility for one or more sites & teams in a region.

** Performance Monitoring and Analysis:*
* -Sets strategic direction for data center operations performance monitoring, collaborates with executive leadership.

-Defines strategic direction for network performance evaluation, collaborates with executive leadership.

-Establishes strategic direction for analysis of physical, power, and cooling capacity, in collaboration with executive leadership.

-Defines the strategic direction for continuous improvement, collaborates with executive leadership to achieve KPIs and objectives.

** Issue Management and Automation:*
* -Oversees all aspects of support for escalated complex technical issues across multiple data centers.

-Defines and enforces strategies for issue triage, leveraging advanced automation, scheduling, and monitoring tools.

-Identifies, documents, and standardizes issues, processes, and solutions, ensuring the data center knowledge base is comprehensive, accurate, and strategically aligned with department goals.

-Oversees the implementation of strategy for incident or crisis management protocols in alignment with business continuity plans.

-Establishes best practices for conducting Root Cause Analysis (RCA) following crises or incidents, and updates documentation to capture process improvements.

** Data Center Expansion Support:*
* -Sets the strategic direction and oversees the entire process of new region builds and expansion activities, both onsite and remotely.

-Acts as the primary liaison with senior project teams and data center engineering leadership, organizing resources and ensuring strategic timelines and long-term capacity needs are effectively managed for all expansion projects and site builds.

-Collaborates at the highest level with project teams to ensure the delivery of world-class standards across all expansion projects and site builds.

** Installation and Maintenance:*
* -Directs all aspects of installations, repairs, inventory management, and logistics tasks across several data centers.

-Establishes standards and best practices for component replacements and upgrades.

-Advises on and manages large-scale purchases or upgrades for data centers.

-Ensures implementation of proactive maintenance and lifecycle management strategies of the Data Center facilities with regard to efficiency and stability (e.g. containment, air flow & pressure, power trains).

** Core Responsibilities*
* ** Planning & Execution:*
* -Oversees and guides multiple teams on managing complex projects or initiatives, monitoring timelines, deliverables, and budgets when applicable to ensure strategic objectives are met. Serves as a role model for appropriately delegating work, setting priorities, and ensuring alignment with business needs. Coaches others on adjusting resources or project timelines in anticipation of business changes.

** Collaboration & Partnership:*
* -Role models leading cross-functional collaborative efforts to ensure alignment of expectations and strategic objectives. Empowers team to build and maintain…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary