×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer

Job in San Mateo, San Mateo County, California, 94404, USA
Listing for: Omega Solutions, Inc.
Full Time position
Listed on 2026-07-01
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing: Infrastructure & Operations, SRE/Site Reliability, IT Support
Job Description & How to Apply Below

Site Reliability Engineer

As a Site Reliability Engineer, you have a mindset to maximize system availability through both proactive and reactive means: you build robust technical support and automation to eliminate or minimize incidents, as well as investigate and resolve issues in response to live incidents. You are comfortable working with software engineering teams and supporting their demanding needs to ensure the security, availability and performance of the platform.

Responsibilities:
As a Staff Site Reliability Engineer:

  • You will identify and support all site reliability request related to Visa Cloud Platform services (IaaS/PaaS/Container as a service)
  • You will lead/determine and develop architectural approaches, Infrastructure solutions to improve the availability, scalability, latency and efficiency of Visa Cloud Platform services
  • You will partner closely with software and systems engineers across the organization to ensure services/systems are highly stable and performant
  • Mentor other team members on managing end-to-end availability and performance of mission critical services while working on individual projects priorities, deadlines, and deliverables
  • Strong communication skills with a strong sense of urgency and attention to details

Basic Qualifications:

  • B.S. or higher in Computer Science or other technical discipline, or related practical experience
  • 8+ years' experience in Site Reliability or Production Engineering group for high availability/critical platforms/applications
  • Java experience is must and supporting applications
  • Have strong hands on experience in Linux and Windows systems to patch and troubleshoot issues
  • Expert knowledge in CI/CD and hands on implementation experiences
  • Hands on experience with container related technologies like Docker, Kubernetes
  • Hands on experience on how to monitor software and Infrastructure and its related tools such as Prometheus, Grafana, Splunk and ELK
  • Live in terminal and ability to script/debug in Shell/Power Shell
  • Working knowledge of relational and non-relational databases, including creating and running queries [MySQL and NOSQL]
  • Working knowledge of web/middleware servers like Nginx and Tomcat
  • Experience with configuration management tools such as Chef/Ansible
  • Experience of working with ITIL disciplines (Event, Incident, Problem, & Change)
  • Have an urge to document all the things so you don't need to learn the same thing twice
  • Have an enthusiastic, go-for-it attitude
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary