Site Reliability Developer
Listed on 2026-01-02
-
IT/Tech
Cloud Computing, Technical Support
Job Description
We are building the next generation IaaS cloud and the next generation cloud support experience to go with it. We are building a team of energetic, customer-focused site reliability engineers to build a world-first and best in class customer experience blending sys admin, database engineering, and cloud disciplines. You’ll be part of a team that learns deeply how our cloud platform works so you can be the bridge between Engineering and Operations.
As part of the broader Engineering organization, you will act as the voice of the customer to influence product features and plans to improve customer experience. This role is integral to the success of our customer relationships and is critical to the success of the platform.
This role will support Oracle’s customers.
Responsibilities- Create and implement processes and solutions to resolve problems faster and provide efficient incident management.
- Act as a point of escalation for incidents and other issues arising within the region
- Troubleshoot and resolve issues across the stack from networking to applications.
- Automate operations and infrastructure management
- Ensures thorough documentation of incidents through company-standard reporting methods.
- Setup alerts and create or update on-call runbooks
- Develop cross-functional relationships with key stakeholders within the company including customer operations and product teams
- Deploys code and executes other changes within the region.
- Operates and performs maintenance to cloud database services running within the region.
- Act as customer champion and translate customer needs to technical requirements or enhancements to the cloud database services.
- Drives and actively participates in the resolution of complex technical issues spanning various services
- Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services.
- Partner with development teams in defining and implementing improvements in service architecture.
- Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack.
- Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations.
- Professional curiosity and a desire to a develop deep understanding of services and technologies.
- Create and implement processes and solutions to resolve problems faster and provide efficient incident management.
- Troubleshoot and resolve issues across the stack from networking to applications.
- Demonstrate clear understanding of automation and orchestration principles. Automate operations and infrastructure management
- Ensures thorough documentation of incidents through company-standard reporting methods.
- Setup alerts and create or update on-call runbooks
- Develop cross-functional relationships with key stakeholders within the company including customer operations and product teams
- Deploys code and executes other changes within the region.
- Operates and performs maintenance to cloud database services running within the region.
- Act as customer champion and translate customer needs to technical requirements or enhancements to the cloud database services.
- Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs).
- Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations.
- Drives and actively participates in the resolution of complex technical issues spanning various services
- Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services.
- Partner with development teams in defining and implementing improvements in service architecture.
- Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack.
- Professional curiosity and a desire to a develop deep understanding of services and technologies.
- Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).