×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer Sr

Job in Friday Harbor, San Juan County, Washington, 98250, USA
Listing for: Hewlett Packard Enterprise Company
Full Time position
Listed on 2026-02-18
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing, SRE/Site Reliability
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below
Position: Site Reliability Engineer Sr. Staff
Location: Friday Harbor

Site Reliability Engineer Sr. Staff

Who We Are

Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today's complex world. Our culture thrives on finding new and better ways to accelerate what's next.

We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.

Job Description Job Family Definition

Designs, develops, troubleshoots and debugs software programs for software enhancements and new products. Develops software including operating systems, compilers, routers, networks, utilities, databases and Internet-related tools. Determines hardware compatibility and/or influences hardware design.

Management Level Definition

Contributions impact technical components of HPE products, solutions, or services regularly and sustainable. Applies advanced subject matter knowledge to solve complex business issues and is regarded as a subject matter expert. Provides expertise and partnership to functional and technical project teams and may participate in cross-functional initiatives. Exercises significant independent judgment to determine best method for achieving objectives. May provide team leadership and mentoring to others.

In

a typical day as a Site Reliability Engineer Staff, you would

As a Staff Software Engineer, you will play a key role in designing, building, and optimizing cloud infrastructure and deployment systems. Your work will directly impact scalability, security, and operational efficiency across our platforms.

Key responsibilities include:

  • Enhance Infrastructure as Code (IAC) and enforce best practices.
  • Optimize cloud infrastructure for scalability, security, and cost-effectiveness.
  • Develop internal tools to support and streamline cloud platform operations.
  • Improve CI/CD pipelines and deployment workflows using FluxCD and Jenkins.
  • Address container image vulnerabilities and standardize remediation processes.
  • Build Amazon Machine Images (AMIs) aligned with CIS and STIG benchmarks.
  • Strengthen monitoring, alerting, and observability using Prometheus, Grafana, and logging tools.
  • Troubleshoot complex production issues to ensure system reliability and customer satisfaction.
  • Fine-tune distributed systems such as Apache Kafka and Cassandra.
  • Collaborate with development, security, and operations teams to align infrastructure with application needs.
What you need to bring
  • Minimum of 10 years of hands‑on experience in Infra Ops, Dev Ops, or Site Reliability Engineering (SRE).
  • Proficiency with Linux systems, especially Debian-based distributions.
  • Strong experience with cloud platforms such as AWS and GCP.
  • Expertise in Infrastructure as Code tools like Terraform, Packer, and Ansible.
  • Solid programming skills in Python and/or Golang.
  • Deep understanding of containerization (Docker, Container) and orchestration tools (AWS EKS, GCP GKE).
  • Experience with Git Ops workflows.
  • Proven track record in implementing and maintaining CI/CD pipelines.
  • Strong background in security and familiarity with security programs.
  • Experience with monitoring and logging tools (Prometheus, Grafana, ELK).
  • Knowledge of both relational (SQL) and non-relational databases.
  • Excellent problem-solving and debugging skills with a strong sense of ownership.
  • Experience managing distributed systems like Apache Kafka and Cassandra.
  • Effective communicator and collaborative team player.
  • It is mandatory to attend to San Juan office twice a week.
Preferred Qualifications
  • Experience contributing to open-source projects.
  • Background in security engineering or related disciplines.
Additional Skills

Cloud Architectures, Cross Domain Knowledge, Design Thinking, Development Fundamentals, Dev Ops, Distributed Computing, Microservices Fluency, Full Stack Development, Security-First Mindset, Solutions Design, Testing &…

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary