×
Register Here to Apply for Jobs or Post Jobs. X

ECS Site Reliability Engineer

Job in Seattle, King County, Washington, 98127, USA
Listing for: Alibaba Cloud
Full Time position
Listed on 2026-02-16
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer
Salary/Wage Range or Industry Benchmark: 150000 - 200000 USD Yearly USD 150000.00 200000.00 YEAR
Job Description & How to Apply Below

Global Talent Acquisition Talent Sourcer

Elastic Compute Service (ECS) is a core product of Alibaba Cloud. The Elastic Compute team is dedicated to building world‑leading cloud computing infrastructure. As a key component of Alibaba Cloud's self‑developed Apsara operating system, Elastic Compute Service (ECS) provides full‑stack computing resources covering virtual machine instances, container services and heterogeneous computing clusters.

Through technological innovation and product optimization, the Alibaba Cloud Elastic Compute team continuously drives advancements in cloud computing technologies, delivering high‑quality computing services to users worldwide. Our goal is not only to support enterprises in achieving elastic scalability but also to deeply empower infrastructure innovation in the new era. Our mission is to build an intelligent foundation of "Computing as a Service," enabling developers to focus on business breakthroughs without worrying about the complex engineering implementations from chips to clusters.

SRE

Team

The Alibaba Cloud Elastic Compute Service (ECS) SRE (Site Reliability Engineering) team is a critical force in ensuring system stability and reliability. The SRE team focuses on guaranteeing the high availability, high performance, and robust stability of ECS products through technical expertise and innovation.

The Alibaba Cloud ECS SRE team is not only a core technical safeguard but also a driver of technological innovation and continuous optimization. By leveraging technical capabilities and collaborative teamwork, we ensure the stability and reliability of ECS products, safeguarding global customers' businesses. Additionally, we are committed to advancing cloud computing technologies through knowledge sharing and industry collaboration.

Joining the Alibaba Cloud ECS SRE team offers the opportunity to engage in the development and optimization of world‑leading cloud computing technologies while growing alongside a passionate and creative team.

Responsibilities
  • Stability, Performance Optimisation, Monitoring and Operations:
    Oversee the stability, performance optimisation, monitoring, and operational work for multiple core products of Alibaba Cloud (e.g. ECS, ACK, ACS, heterogeneous computer cluster, OOS, Compute Nest, etc.), taking responsibility for the online stability of these products.
  • Operation System and Online System Development:
    Engage in the development of operation systems and some online systems. Through tools, process optimisation, and system improvements, ensure the stability and performance of Alibaba Cloud's Elastic Computing‑related products.
  • Customer and Team

    Collaboration:

    Work closely with other teams (e.g. R&D, after‑sales support, etc.) to ensure efficient technical support and problem resolution.
  • Optionally, take responsibility for one or more core duties based on expertise, while demonstrating cross‑team collaboration skills and system‑level thinking abilities.
Qualifications
  • Bachelor’s degree or higher in Computer Science, Information Technology, or a related field.
  • At least 3 years of experience in system operations or SRE, with familiarity in cloud computing services and core products (e.g. ECS, K8S, heterogeneous computer, etc.).
  • Familiarity with the design and optimisation of cloud resource provisioning and delivery systems; experience in serving overseas customers is preferred.
  • In‑depth understanding of the overall architecture and operational mechanisms of the elastic computing product line, with the ability to quickly identify and resolve complex issues.
  • Possession of cloud‑related certifications (e.g. ACP, ACE, or other major cloud vendor certifications).
  • Participation in the architectural design or performance optimisation projects of large cloud platforms.
  • Outstanding contributions in system stability assurance, automation tool development, or cloud‑native domains are highly valued.
Position Highlights
  • Deeply engage in the core operations of Alibaba Cloud's elastic computing product line, ensuring service stability for global users.
  • Explore cutting‑edge technologies in virtualization, containerisation, cloud‑native, driving technological innovation.
  • Gro…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary