×
Register Here to Apply for Jobs or Post Jobs. X

Senior Site Reliability Engineer

Job in San Francisco, San Francisco County, California, 94102, USA
Listing for: OutSystems
Full Time position
Listed on 2026-06-02
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing, SRE/Site Reliability, Network Engineer
Job Description & How to Apply Below
There are NO limits to your career: come shape the future and be part of a truly unique global culture at Out Systems!

Hybrid Onsite in Menlo Park, CA

Site Reliability Engineering Function

Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals of SRE are to create scalable and highly reliable systems. Our SREs ensure our production systems' reliability, performance, and scalability while enabling rapid development and deployment of new features and services.

SREs at Out Systems work closely with development teams, acting as an extension of the team, in adopting the reliability tenets with the shared goal of meeting Service Level Objectives (SLOs) and thus delivering a smooth and frictionless Customer Experience.

Site Reliability Engineer Role

As an SRE at Out Systems here are your key responsibilities and duties:

Lead and onboard services and teams to the reliability tenets;

Establish and maintain Service Level Objectives (SLOs) and Service Level Agreements (SLAs);

Design and implement scalable, reliable, and secure infrastructure, while ensuring cloud-native best practices;

Collaborate with software development teams to ensure systems are resilient (observable, fault-tolerant, recoverable, scalable) and performant;

Implement monitoring, alerting, logging, and tracing solutions to detect and respond to incidents;

Lead incident response efforts, ensuring quick resolution and minimal downtime, and conduct RCA/post-mortems;

Automate every operational task, with a special focus on fast incident detection & recovery;

Programming in Python supported by Gen AI tooling to accelerate development of mission critical automation and tools.

Foster a culture of continuous improvement and knowledge sharing;

Communicate effectively with stakeholders, providing updates on system reliability and performance;

Participate in on-call rotation to provide 24/7 support for production systems.

Site Reliability Engineering Performance Indicators

The main KPIs that aid in understanding the impact and success of the SRE function at Out Systems are:

SLA and Service Level Objectives (SLO) compliance;

SLO Coverage and Detection Ratio;

MTTA - Mean time to acknowledge;

MTTR - Mean time to resolve.

Qualifications and Skills

To illustrate the desired profile for a Site Reliability Engineer. Nevertheless, the selection of candidates will always vary depending on specific knowledge of the field and prior experience.

Qualifications

BS/MS in Computer Science or Equivalent

6+ years of experience in Site Reliability Engineering, managing infrastructure and services at scale

History of end-to-end project delivery

Experience managing Hadoop and Kubernetes infrastructure and related services, or equivalent experience

Advanced knowledge of Linux, Networking, and Containers

Proficiency in at least one high-level programming language (Python, GoLang etc.).

Strong troubleshooting and debugging skills.

Fluency in English and excellent communication skills.

Soft Skills

Communication - able to communicate effectively (in English) both orally and written showing empathy for the other person;

Collaboration - Proactive collaboration and presentation skills to effectively communicate ideas and represent the deliverables and needs of the SRE team with leadership.

Humbleness - accepts mistakes and acts accordingly, with a humble attitude, apologizing for them and mitigating them ASAP to avoid higher impact.

Accountability - takes ownership of problems and makes sure to see them through. Even if he does not have all the necessary knowledge to move on alone, can involve the right people to reach closure.

Negotiation Skills - has tough and politically complex conversations with colleagues and customers, defusing disagreements and leading towards a mutual agreement and understanding of all parties involved.

Process Oriented - is organized and able to properly follow defined processes, whilst being able to properly challenge inefficient processes and suggest improvements.

Problem-solving - Has a top-down approach to problems, breaking them into smaller pieces and solving…
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary