×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Developer

Job in Nashville, Davidson County, Tennessee, 37247, USA
Listing for: Oracle
Full Time position
Listed on 2026-01-02
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer
Job Description & How to Apply Below
Position: Site Reliability Developer 6

Job Description

Oracle is seeking a Strategic Platform Reliability Engineering (SPRE) Architect to strengthen the architectural foundation and operational resilience of key SaaS offerings, ensuring availability, security, and compliance for top-tier customers. The SPRE Architect will lead cross-functional collaboration with SaaS and OCI teams, applying best practices and commercial blueprints to deliver highly available, future-ready cloud services.

Key responsibilities include safeguarding service uptime, driving automation, enhancing monitoring, and responding to critical incidents in a 24×7 environment. The role demands strong leadership, technical acumen across the full technology stack, and proven experience in large-scale service operations, with a focus on proactive system hardening and continuous improvement. Communication at all levels—including C-suite engagement—is essential, along with stakeholder management and mentoring junior team members.

Candidates must demonstrate expertise in compliance, Linux systems, cloud networking, programming/scripting languages, and Dev Ops tools. Experience supporting secure cloud environments and customer-facing web services at scale is required, along with a strong customer service orientation and the ability to thrive in high-pressure, evolving environments.

In summary:
This is a senior and highly technical leadership role, accountable for the design, resilience, compliance, and operational excellence of Oracle’s SaaS services for its most strategic customers.

Top Skills for the SPRE Architect Role:
  • Technical Leadership & Stakeholder Management Proven ability to lead teams, drive cross-functional collaboration, mentor junior members, and engage with stakeholders at all organizational levels, including the C-suite.
  • Cloud Architecture & Operational Excellence Deep expertise in designing, implementing, and maintaining large-scale, highly available, and secure cloud services.
  • Site Reliability Engineering (SRE) Principles Strong background in SRE best practices, including automation, monitoring, incident response, and continuous improvement.
  • Incident Management & Crisis Response Expertise in monitoring, diagnosing, resolving, and communicating about critical service incidents in a 24×7 environment.
  • Compliance & Security Fundamentals Knowledge of compliance standards relevant to enterprise cloud software and experience securing cloud infrastructure.
  • Analytical & Problem-Solving Skills Strong analytical abilities to troubleshoot complex systems, identify root causes, and develop resilient solutions.
  • Communication & Customer Service Orientation Excellent verbal and written communication skills, with the ability to clearly convey technical and business information to diverse audiences and ensure customer satisfaction during high-pressure situations.
  • Experience with Large-Scale, Customer-Facing Web Services Demonstrated experience operating and scaling major web-based services for enterprise customers.
  • Automation & Dev Ops Tools Familiarity with automation frameworks and Dev Ops tools (e.g., JIRA, Confluence) to streamline operations and maintain high availability.
  • Programming & Scripting Proficiency in one or more scripting/programming languages (e.g., Python, Bash, Powershell, Java, Ruby) to automate operational tasks and tooling.
  • Responsibilities
    • Design, develop, and maintain large-scale, highly available, and secure cloud services.
    • Lead cross-team collaboration for service resiliency, compliance, and operational excellence.
    • Safeguard service uptime by monitoring, automating, and responding to incidents around the clock.
    • Apply site reliability engineering best practices for automation, monitoring, and continuous improvement.
    • Resolve critical incidents and communicate effectively with stakeholders at all levels.
    • Ensure compliance and implement security measures for enterprise cloud environments.
    • Mentor junior team members and foster technical leadership within teams.
    • Engage directly with customers and stakeholders, including executives, to align solutions with business requirements.
    • Develop and support capacity planning, architectural standards, and system hardening across…
    To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
    (If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
     
     
     
    Search for further Jobs Here:
    (Try combinations for better Results! Or enter less keywords for broader Results)
    Location
    Increase/decrease your Search Radius (miles)

    Job Posting Language
    Employment Category
    Education (minimum level)
    Filters
    Education Level
    Experience Level (years)
    Posted in last:
    Salary