Operations/SRE Engineer
Listed on 2026-07-01
-
IT/Tech
Systems Engineer, Cloud Computing: Infrastructure & Operations, SRE/Site Reliability, IT Support
Job Title
3+ Years of Experience in Incident Management, Change Management and Problem Management.
1-2 years of Experience in Infrastructure Support, Configuration and Release Management.
2-3 years of hands on experience with Tools including Splunk, Grafana, Loki, APPDynamics or other APM solutions
2+ years of Experience with Application support built on-prem and native cloud environments
Able to code – Java, SQL, PromSQL, Shell and Python.
Root cause analysis, management communication and client relationship management in partnership with Infrastructure Support team members.
Ensuring all production changes are made in accordance with life-cycle methodology and risk guidelines.
Applicant should have full understanding of Various API's and Middle wares like Apigee, Vordel, Data power and Nodejs
Applicant should expertise in configuring, supporting and manage Rancher, Kubernetes and Docker Containerization.
Ability to work on-call production support and Managing in a 24/7 support environment
Understanding and working knowledge of infrastructure environments.
Excellent problem management skills and relentless drive for root cause and execute measures to reduce repeat occurrence.
Good communication (Verbal/written) and Interpersonal Skills
Required Skills:
Reliability
Additional
Skills:
Reliability Engineer
This is a high PRIORITY requisition.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).