More jobs:
Site Reliability Engineer
Job in
Oakland, Alameda County, California, 94601, USA
Listed on 2026-07-01
Listing for:
Samprasoft
Full Time
position Listed on 2026-07-01
Job specializations:
-
IT/Tech
Systems Engineer, Cloud Computing: Infrastructure & Operations, SRE/Site Reliability, IT Support
Job Description & How to Apply Below
Site Reliability Engineer
Site Reliability Engineer with well-developed organizational, analytical and problem-solving skills for a multi-year engagement with a foremost Healthcare IT Solutions group based in the San Francisco Bay Area.
A highly motivated professional who can drive incidents to resolution by collaborating with multiple engineering teams.
Analytical mindset and good problem-solving skills, excellent written and verbal communication & ability to work as part of a team.
Excellent multitasking skills with the ability to prioritize projects with high pressure deadlines.
Responsibilities:
- Working closely with counterparts in the Infrastructure and Application teams, the Site Reliability Engineer will help build a more sustainable platform through the development of systems that analyze our environments, predict future problems, and actively support the production environment
- Experience working with automating system administration tasks using scripting tools such as Python or shell (preferred).
- Experience with monitoring and automation tools further analyze real time issues.
- Monitoring and Metrics in Prometheus, Grafana and integrations with Service Now.
- Work with other teams to make sure that the infrastructure and applications that depend on it work together seamlessly.
- Support other team’s infrastructure needs on an as-needed basis.
- Use and develop tools for systems continuous delivery automation.
- System Administration on Linux (CentOS) and Windows Server.
- Proficient with Dev Ops tools and environments like Team City/Jenkins, Git.
- Experience with monitoring implementations and administration.
- Ability to discuss and resolve technical issues and escalations with other technical staff as the needs arise.
- Work to automate detection and resolution of recurring issues in the production environment.
Prerequisites:
- 5 to 7 years of production support experience.
- Working experience in using Tomcat, Git, Splunk, Jenkins & Team City.
- Advanced knowledge of Relational Databases and SQL.
- Experience in Shell Scripting.
- Experience in maintaining monitoring and alert systems.
- Experience troubleshooting relational databases and distributed platforms.
- Experience in maintaining Java applications.
- Experience in Docker orchestration and management.
- Experience with Kubernetes.
Education:
- Bachelor of Science degree in engineering or a related technical area
- Master’s degree preferred
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×