More jobs:
Site Reliability Engineer
Job in
Sunnyvale, Santa Clara County, California, 94087, USA
Listed on 2026-06-12
Listing for:
Diverse Lynx
Full Time
position Listed on 2026-06-12
Job specializations:
-
IT/Tech
Systems Engineer, Cloud Computing: Infrastructure & Operations, SRE/Site Reliability, Network Engineer
Job Description & How to Apply Below
Sunnyvale, CA (3x/ week onsite)
Duration: 6 months
SRE - Site Reliability Engineer
Responsibilities:
Engage with our product teams to understand requirements, design and implement resilient and scalable infrastructure solutions.
Operate, monitor, and triage all aspects of our production and non-production environments.
Collaborate on code, infrastructure, design reviews, and process enhancements Evaluate and integrate new technologies to improve system reliability, security, and performance.
Develop and implement automation to provision, configure, deploy, and monitor services.
Participate in an oncall rotation providing hands-on technical expertise during service impacting events.
Contribute to capacity planning, scale testing, and disaster recovery exercises Approach operational problems with a software engineering mindset.
Min
Qualification:
5+ years in Infrastructure Ops, Site Reliability Engineering, or Dev Ops focused role.
BS degree in computer science or equivalent field with 5+ years of experience.
Knowledge of Linux operating system principles, networking fundamentals, and systems management.
Demonstrable fluency in at least one of the following languages:
Java, Python, or Go.
Experience in managing and scaling distributed systems in a public, private, or hybrid cloud environment.
Familiarity with micro-services architecture and container orchestration with Kubernetes.
Awareness of key security principles including encryption, keys (types and exchange protocols).
Understanding of SRE principals including monitoring, alerting, error budgets, fault analysis, and automation.
Strong sense of ownership, with a desire to communicate and collaborate with other engineers and teams.
Ability to identify and communicate technical and architectural problems, while working with partners and their team to iteratively find solutions.
Experience implementing automation
Scripting experience in Python
Enjoy building partnerships
Apple Exp is preferred
Role Description s: JDWe are seeking a Dev Ops Site Reliability Engineer (SRE) with strong experience in containerization| orchestration| and automation. The ideal candidate will have hands-on expertise in Kubernetes| Docker| and Python| and will be responsible for building scalable infrastructure| automating operations| and ensuring high availability of production systems.
Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×