×
Register Here to Apply for Jobs or Post Jobs. X

Senior Site Reliability Engineer

Job in Irvine, Orange County, California, 92713, USA
Listing for: TP-Link
Full Time position
Listed on 2025-12-01
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing
Salary/Wage Range or Industry Benchmark: 140000 - 180000 USD Yearly USD 140000.00 180000.00 YEAR
Job Description & How to Apply Below

At the forefront of the future of connected living, TP-Link's Systems Inc. R&D Center in Irvine, Southern California's innovation hub, spearheads research and development of next‑generation networking, IoT smart home products, and software services. Our team of passionate engineers constantly innovates, engineering solutions that transform the end‑user experience with simpler, smarter, and more reliable connectivity.

We're looking for a passionate and experienced Senior Site Reliability Engineer to join our team and play a crucial role in ensuring our cloud platform's security, reliability, scalability, and operational excellence.

About Us

Headquartered in the United States, TP-Link Systems Inc. is a global provider of reliable networking devices and smart home products, consistently ranked as the world’s top provider of Wi‑Fi devices. The company is committed to delivering innovative products that enhance people’s lives through faster and more reliable connectivity. With a commitment to excellence, TP‑Link serves customers in over 170 countries and continues to grow its global footprint.

We believe technology changes the world for the better! At TP‑Link Systems Inc, we are committed to crafting dependable, high‑performance products to connect users worldwide with the wonders of technology. Embracing professionalism, innovation, excellence, and simplicity, we aim to assist our clients in achieving remarkable global performance and enable consumers to enjoy a seamless, effortless lifestyle.

Responsibilities
  • Serve as a technical SME for implementing and operating Microservices on Kubernetes cloud‑based platforms.
  • Collaborate with the Cloud Technical Development and Dev Ops teams to deploy services to the Multi‑Cloud Platform.
  • Perform Load Tests and Chaos Tests to ensure the scalability and reliability of microservices.
  • Build observability for microservices and cloud platforms like AWS, OCI, Azure, and GCP.
  • Write and execute disaster‑recovery plans in collaboration with the Development and Dev Ops team.
  • Analyze and resolve production risks caused by insufficient resources, such as node groups, CPU, memory, HPA scheduling, and JVM pre‑warming.
  • Write and maintain scripts for automation using languages such as Python, Go, or Bash.
  • Define and maintain the KPIs (SLA/SLO/SLI) for all cloud microservices with development teams to better understand the business.
  • Create and maintain technical documentation, including architecture diagrams, design documents, and standard operating procedures.
  • Guarantee adherence to security and compliance standards, including ISO
    27001, SOC2, and GDPR.
  • Lead incident response efforts to troubleshoot and resolve production issues quickly.
  • Perform post‑incident analysis to identify root causes and potential workarounds/solutions.
  • Assist with product/technology selection, including implementation of POCs.
  • Be fluid and open to change and evolving processes and tools.
  • Help mentor and train less senior members of the team.
  • Ability to be part of on‑call rotation and provide support after work hours and on weekends.
  • Other duties as assigned.
Requirements
  • Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • 5+ years of experience as a Site Reliability Engineer.
  • Proficiency in programming and scripting languages such as Java, Python, Bash, or Power Shell.
  • Hands‑on experience in SRE, Dev Ops, cloud operations, and cloud security best practices.
  • Strong knowledge of security technologies, including identity and access management, network security, application security, and data protection.
  • Strong problem‑solving and analytical skills, with the ability to work independently and as part of a team.
  • Experience in developing and maintaining technical documentation and implementing compliance requirements.
Additional Skills (Preferred)
  • Expert‑level cloud certifications such as AWS Solutions Architect, Professional, Azure Solutions Architect Expert, and GCP Professional Cloud Architect.
  • Experience with container orchestration technologies (e.g., Kubernetes).
Benefits
  • Salary range: $140,000 - $180,000.
  • Free snacks and drinks, and provided lunch on Fridays.
  • Fully paid medical, dental, and vision…
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary