×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer

Job in San Diego, San Diego County, California, 92189, USA
Listing for: Sony Interactive Entertainment
Full Time position
Listed on 2026-05-16
Job specializations:
  • Software Development
    Cloud Engineer - Software, DevOps, Software Engineer
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below
Position: Staff Site Reliability Engineer

Why Play Station?

Play Station isn't just the best place to play - it's also the best place to work. Today, we're recognized as a global leader in entertainment producing the Play Station family of products and services, including Play Station 5, Play Station 4, Play Station VR, Play Station Plus, acclaimed Play Station software titles from Play Station Studios, and more.

Play Station also strives to create an inclusive environment that empowers employees and embraces diversity. We welcome and encourage everyone who has a passion and curiosity for innovation, technology, and play to explore our open positions and join our growing global team.

The Play Station brand falls under Sony Interactive Entertainment, a wholly‑owned subsidiary of Sony Group Corporation.

Staff Site Reliability Engineer

San Diego, CA

As a member of the Commerce Reliability Engineering team, you will carry the responsibility of keeping our monetization platform highly available and resilient, while continually enabling our service teams to deliver new and exciting product and technical features. Our team strives to iteratively learn, improve and automate our processes every single day, which continually improves operational excellence within our organization. You will be empowered to be a technical leader on our team, helping identify and proactively drive improvements in both process and technology.

Responsibilities
  • Apply, integrate and automate the configuration and ongoing operations of AWS managed services.
  • Identify areas for operational process improvement and automation. Drive the hands‑on development of scripts and tools to automate these processes within our environment.
  • Increase observability on our platform by implementing robust monitoring and alerting patterns across our services. Develop rich, informative dashboards and reports on our services that provide valuable insight, and develop meaningful alerting patterns to drive down the MTTD and MTTR on platform incidents.
  • Collaborate and partner with other SRE teams that specialize in areas such as data services, data platform, and platform hosting to inspire changes and ensure optimal application performance and resiliency across all back‑end services within Play Station.
  • Iteratively lead performance and capacity validation analysis for our commerce platform services. Utilize AWS patterns and technologies such as spot instances, dynamic auto‑scaling and EKS to efficiently make the most of our AWS spend.
  • Review service flows and architecture to influence resiliency, availability and scalability for all services within our platform.
  • Provide rotational on‑call support where you'll respond, detect, triage and resolve production incidents on the commerce and payments platform.
  • Conduct, document and present root cause analysis documents to share incident insights and findings with our broader engineering organization.
Qualifications
  • BS degree in Computer Science, Engineering, or related technical subject area.
  • 7+ years hands‑on AWS experience – integrating, developing and managing applications.
  • 10+ years of relevant work experience in a high‑volume and/or critical production, software environment.
  • 10+ years of hands‑on software engineering or systems engineering experience (Java and/or C++ services).
  • 5+ years of experience with building automation into daily operational processes through one or more programming languages (preferably Python or Go).
  • Strong experience in configuring, tuning and automating operational responsibilities for AWS managed data services including RDS, Dynamo

    DB and Elasticache.
  • Experience with monitoring and log management tools (e.g., Data Dog, Cloud Watch, Splunk).
  • Experience with container technologies and orchestration (e.g., Docker, Kubernetes, EKS, Fargate).
  • Hands‑on experience in triaging and tuning Java cloud applications with integration into AWS.
  • Solid understanding of AWS networking systems and protocols (e.g., ALB, R53, API‑Gateway, TCP/IP, HTTP/HTTPS, DNS).
  • Experience with developing or supporting Continuous Integration and Continuous Delivery/Deployment pipelines (CI/CD).

Please refer to our Candidate Privacy Notice for more information about how we…

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary