Job Description & How to Apply Below
Role: Site Reliability Engineer (SRE)
Location:
Hyderabad – Marriott Office
Work Mode: 5 Days Work From Office
Experience
Required:
7+ Years
Notice Period: Immediate to 15 Days
Budget: Up to 25 LPA
About the Role
we are looking for a highly skilled Site Reliability Engineer (SRE) to manage and enhance the reliability, scalability, and performance of cloud-based production systems. The ideal candidate will have strong experience in AWS, automation, infrastructure as code, and monitoring tools to ensure highly available and resilient systems.
Key Responsibilities:
Design, implement, and maintain scalable and highly available infrastructure on AWS.
Automate infrastructure provisioning and configuration using Terraform and Ansible.
Develop automation scripts using Python and Bash for operational efficiency.
Deploy, manage, and optimize containerized workloads using Kubernetes.
Design, implement, and maintain robust CI/CD pipelines for reliable deployments.
Monitor system health, performance, and availability using tools like Dynatrace, Prometheus, Grafana, and ELK stack.
Perform incident management, root cause analysis, and implement preventive solutions.
Collaborate with development and engineering teams to improve system reliability and performance.
Ensure adherence to cloud security, reliability, and operational best practices.
Required Skills:
7+ years of experience in Site Reliability Engineering, Dev Ops, or related roles.
Strong hands-on expertise in AWS services and scalable architecture design.
Proficiency in Python and Bash scripting for automation.
Hands-on experience with Terraform, Kubernetes, and Ansible.
Strong experience in CI/CD pipeline design and release engineering.
Experience with monitoring and observability tools such as Dynatrace, Prometheus, Grafana, ELK, or similar platforms.
Strong troubleshooting, analytical, and problem-solving skills.
Preferred Qualifications:
Prior experience working as a Site Reliability Engineer (SRE).
Experience managing production environments with high availability.
Strong understanding of cloud security and reliability best practices.
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×