×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer - Bilingual - Portuguese and English

Job in Town of Poland, Jamestown, Chautauqua County, New York, 14701, USA
Listing for: nClouds
Full Time position
Listed on 2026-02-16
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below
Location: Town of Poland

Summary

The SRE team is responsible for availability, reliability, performance, monitoring, change-management, emergency response for infrastructure or applications, and reducing manual work by implementing SRE principles and practices. SRE team directly works with Devs/Dev Ops teams, Operations teams, Product teams, and other teams to deploy new features, changes, and maintain infrastructure, operations, CI/CD, IAC to achieve availability and reliability so that SLOs and SLAs can be protected.

We utilize a variety of Dev Ops automation tools like Ansible, Docker, Kubernetes, Terraform, Jenkins, along with cloud vendor-specific tools like ECS, Cloud formation, EKS, Opsworks, beanstalk. The SRE engineer is capable of implementing Observability, SLO, SLI, SLA, and Disaster Recovery and Backup Plans in cloud environments mainly AWS.

Deliverables
  • Key Responsibilities:
  • Ensure the availability and reliability of distributed systems.
  • Help the L1 team to resolve the client’s infrastructure/system issues, escalations, alerts, tickets, and queries.
  • Works as a bridge between Dev Ops and other teams in order to build maintain resilient systems.
  • Conduct, coordinate and oversee post incident Root Cause Analysis / Reviews.
  • Build and maintain documentation for all assigned clients / projects.
  • Leverage Dev Ops, Agile methodology, and standards in day-to-day work.
  • Adopt and propose automation of repetitive tasks to reduce/eliminate toil.
  • Implement and troubleshoot using observability tools like Datadog, New Relic, Splunk, Cloud Watch etc.
  • Adopt and ensure the SRE practices in Team.
  • Maintenance of AWS managed resources, CI/CD, IAC.
  • Planning and implementing disaster recovery and backup plans for AWS cloud platforms.
  • Proactively work on efficiency and capacity planning.
  • Keep a proactive approach to spotting problems, areas for improvement, and performance bottlenecks
  • Liaise and work closely with Layer-1 Oncall support, Dev Ops and Operations teams
  • Drive availability and reliability by defining and implementing SLI, SLO, error budget, Observability, Disaster recovery, and backup to detect and mitigate issues.
  • Qualifications:
  • Bachelor’s degree in computer science (preferred) or equivalent management, technical, scientific discipline
  • Ability to program (structured and OO) with one or more high level languages, such as Python, Java, C/C++, Ruby, and Java Script
  • Clear understanding of SRE principles and practices and Agile and Dev Ops methodologies.
  • Experience in AWS Well-Architected framework in order to implement the scalable and reliable infrastructure.
  • Great team player with flexibility to work.
  • Excellent written/verbal communication and leadership skills.
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary