Senior Site Reliability Engineer; SRE
Job in
Plano, Collin County, Texas, 75086, USA
Listed on 2026-06-05
Listing for:
Apex Systems
Full Time
position Listed on 2026-06-05
Job specializations:
-
IT/Tech
Systems Engineer, Cloud Computing – Infrastructure & Operations
Job Description & How to Apply Below
##
Job Description # Senior Site Reliability Engineer (SRE)
* Perform full-stack triaging of alerts to identify the root cause of application performance and stability issues.
* Work with stakeholders to define and track service level objectives (SLOs) for application features and services.
* Design and develop dashboards and reports to communicate key performance metrics.
* Identify opportunities to improve alerting posture and update alerts accordingly.
* Collaborate with the engineering team to understand application architecture and perform single point of failure analysis.
* Create and derive NFR/Workload models to ensure performance and resiliency are considered early in the software development lifecycle.
* Execute performance and chaos tests, analyzing results with APM tools to identify stability issues.
* Document and present findings, analysis, and results to stakeholders.
* Perform analytics on past incidents to understand root causes and implement automation to prevent recurrence.
* Demonstrate proficiency with Dev Ops tools including JIRA, BMC Remedy, and Service Now.
* Experience triaging production issues using APM tools (Dynatrace, App Dynamics, Prometheus, Grafana, New Relic) and log aggregation tools (Splunk, ELK).
* Expertise in Java and front-end development (React JS, Angular).
* Experience with Apache/Tomcat middleware and Java/RESTful services.
* Backend database experience with Oracle, SQL Server, or Hadoop.
* Proficiency in Python, UNIX, and Perl/Shell scripting.
* Experience with CI/CD tools such as Bitbucket, JFrog Artifactory, Jenkins, and Ansible.
* Familiarity with SRE concepts like SLI/SLOs and error budgets.
* Experience with Agile/Scrum methodology.
* Proficiency in system, network, security, and database operations.
* Experience with tools such as Tanium, Artifactory, and BMC True Sight Orchestration.
* Experience with command-line interfaces (CLI) and third-party API integration.
* Server administration experience with Red Hat Enterprise Linux and Windows Server.
* Understanding of developing fault-tolerant solutions, horizontal scaling, and high availability.
* College Degree or equivalent work experience.
* Everforth Apex is a world-class IT services company that serves thousands of clients across the globe. When you join Everforth Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package.
Our commitment to excellence is reflected in many awards, including Clearly Rated's Best of Staffing(R) in Talent Satisfaction in the United States and Great Place to Work(R) in the United Kingdom and Mexico. Everforth Apex uses a virtual recruiter as part of the application process. Click
* * for more details.
* #J-18808-Ljbffr
Position Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×