×
Register Here to Apply for Jobs or Post Jobs. X

Principal Site Reliability Engineer

Job in San Francisco, San Francisco County, California, 94199, USA
Listing for: Early Warning Services LLC
Full Time position
Listed on 2025-11-27
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer, IT Support
Salary/Wage Range or Industry Benchmark: 206000 - 258000 USD Yearly USD 206000.00 258000.00 YEAR
Job Description & How to Apply Below
Positions located in Scottsdale, San Francisco, Chicago, or New York follow a hybrid work model to allow for a more collaborative working environment.

Candidates responding to this posting must independently possess the eligibility to work in the United States, for any employer, at the date of hire. This position is ineligible for employment Visa sponsorship.
** Overall Purpose
** The Principal Site Reliability Engineer partners with development teams by designing availability and resiliency patterns in applications and infrastructure.
*
* Essential Functions:

*** Design and Implement software and tools to improve the performance - availability, scalability, and latency, while delivering end products to customer with the highest efficiency and meeting all security standards.
* Supports the company’s commitment to risk management and protecting the integrity and confidentiality of systems and data.
* Build automation and tooling around application management, such as deployments, configuration changes and disaster recovery scenarios.
* Design, Implement and evangelize Observability and monitoring systems to proactively detect problems and identify cause.
* Evaluate capacity of the application on a continuous basis to provide stats to the Product/Business teams and recommend an efficient path to scale for future needs.
* Identify performance bottlenecks and work with cross-functional teams to troubleshoot and resolve issues.
* Serve as a technical liaison for the application and provide documents and runbooks to Level 1 and Level 2 teams.
* Participate in 24 X 7 on-call rotation.
* Be a champion of excellent processes; take the initiative in developing repeatable patterns and standard, re-usable work across teams.
* Work directly with application development teams to provide feedback and technical requirements to the software development lifecycle, implementing best-practice microservice design patterns and other modern software development approaches.
* Understand and support the adoption of best-practice microservice design patterns and other modern software reliability approaches and techniques.
* Be a thought leader: a senior point of expertise on site reliability engineering issues, industry trends and developing technologies. Be a role model to others on the team.  Coach and mentor team members.
** Minimum Qualifications
*** Education and experience typically obtained through completion of a Bachelor’s Degree in Business and/or Computer Science or related field.
* 12+ years of related experience managing large complex projects in a technical or software development environment inclusive of post-graduate degree
* Proven ability to lead a team through high priority Incidents and improve the RCA proces
* Excellent troubleshooting skills and proven experience resolving technical issues in complex environments
* Hands-on experience in designing and developing using the one or more of the following technologies     - Python, Go, Java     - Docker     - Experience in Microservices Architecture.
- Messaging frameworks such as Kafka, SQS or JMS     - Database Technologies like Oracle, Dynamo DB, Aurora etc..
- Caching layers such as Redis and memcached
* Strong understanding of Linux administration
* Experience with CI/CD pipeline implementation including GIT, Chef, Maven, Jenkins etc
* Strong understanding of networking fundamentals
* Experience in leading cross-functional teams to create technical solutions.
* Proven track record designing and building complex end-to-end systems (full stack developer)
* Background and drug screen
** Preferred Qualifications
*** Good programming skills in one or more of the following languages:
Java, ruby, python, JavaScript and GO
* Hands-on experience in supporting applications in a 24X7 customer-facing production environment.
* Working knowledge of AWS, Docker, Kubernetes, Swarm The base pay scale for this position in:  Phoenix, AZ/ Chicago, IL in USD per year is: $172,000 - $215,000.  New York, NY/ San Francisco, CA in USD per year is: $206,000 - $258,000.  Additionally, candidates are eligible for a discretionary incentive plan and benefits.
** Physical Requirements
** Employee must be…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary