Job Description & How to Apply Below
Enhance system performance as a Site Reliability Engineer, leveraging expertise in networking and incident management. Utilize tools like Dynatrace and Grafana for monitoring and observability.
In this senior role, you'll oversee incident management processes and ensure application reliability in a fast-paced environment. With over 10 years of experience, you’ll engage in root cause analysis, focusing on long-term solutions to minimize recurring incidents. Collaborate directly with development and operations teams to enhance CI/CD pipelines and infrastructure management.
Key Responsibilities:
• Lead incident management for high-availability applications
• Monitor systems using Dynatrace, Splunk, Grafana
• Conduct thorough root cause analysis of outages
• Streamline CI/CD pipelines and automate workflows
• Provide networking expertise for infrastructure design
Requirements:
• Bachelor's or Master's degree in Computer Science
• Over 10 years in an SRE/Dev Ops role
• Proficient with Dynatrace, Splunk, Grafana
• Strong skills in network debugging tools like Wireshark
• Excellent communication and leadership abilities
Drive system reliability and innovative practices as a key SRE contributor.
#J-18808-Ljbffr
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×