Site Reliability Developer
Listed on 2026-01-02
-
IT/Tech
Cloud Computing, Systems Engineer
Job Description
Proficiency in languages such as Python, Java, or Bash for automation and tooling.
Strong administration skills in Linux/Unix environments and Oracle database.
Monitoring and optimizing system performance.
Experience with monitoring, alerting, and logging tools (e.g. Splunk, OCI Monitoring).
Root cause analysis, drafting and executing runbooks, and participating in on-call rotations.
Analytical and problem-solving mindset.
Continuous learning and adapting to evolving cloud technologies.
Strong collaboration and communication skills, working closely with other engineers, product, and support teams.
Basic understanding of cloud security best practices.
Analyzing system/application metrics and logs for incident response and performance tuning.
Familiarity with cloud architecture, resource orchestration, and multi-region/multi-availability domain deployments, Kubernetes, and OCI.
ResponsibilitiesWork with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture.
Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation points for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs).
Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the effect of product architecture decisions on distributed systems. Professional curiosity and a desire to develop a deep understanding of services and technologies.
Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.
Range and Benefit InformationRange and benefit information provided in this posting are specific to the stated locations only.
US:
Hiring Range in USD from: $79,100 to $158,200 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle’s differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment.
Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).