More jobs:
Senior Site Reliability Engineer - Observability & Monitoring
Job in
Plano, Collin County, Texas, 75023, USA
Listed on 2026-06-06
Listing for:
Stratacuity
Full Time
position Listed on 2026-06-06
Job specializations:
-
IT/Tech
Systems Engineer, IT Support
Job Description & How to Apply Below
Job Description:
Senior Site Reliability Engineer - Observability & Monitoring
Location:
Plano, Texas (Onsite)
Employment Type:
12 Months Contract
Role Overview
We are seeking an experienced Observability and Monitoring Site Reliability Engineer to help design, implement, and operationalize monitoring for an enterprise Event Management platform. This role will focus on defining observability coverage, implementing monitoring instrumentation, building operational dashboards, and improving visibility across platform components, integrations, and services. The primary tools for this role are Dynatrace and Splunk.
Key Responsibilities
* Define and implement monitoring and observability coverage for the Event Management platform.
* Establish standards for metrics, logs, traces, events, synthetic checks, and platform telemetry.
* Build monitoring for IBM Cloud Pak for Watson AIOps, Netcool OMNIbus, Netcool Impact, Open Shift, Linux, Kafka-based services, and Service Now integration points.
* Design and maintain Dynatrace monitoring for applications, infrastructure, synthetic checks, and platform dependencies.
* Design and maintain Splunk searches, dashboards, alerts, log onboarding patterns, and operational views.
* Create Open Shift and Kubernetes monitoring using available platform metrics, Prometheus, and Grafana.
* Monitor Linux-based platform components, including processes, services, file systems, and resource utilization.
* Monitor Kafka-based integrations, including topic health, consumer lag, and message throughput.
* Provide end-to-end visibility for event flow from platform ingestion through downstream integration.
* Develop runbooks, troubleshooting guides, validation procedures, and operational documentation.
Required Qualifications
Technical
Skills:
* Hands-on experience with Dynatrace for infrastructure, application, synthetic, service, and dependency monitoring.
* Hands-on experience with Splunk, including Search Processing Language (SPL), dashboards, alerts, and field extraction.
* Understanding of Open Shift or Kubernetes monitoring concepts.
* Experience monitoring Linux-based services, processes, logs, file systems, and resource utilization.
* Experience defining monitoring coverage for distributed platforms and integration services.
* Experience with REST APIs, JSON, webhooks, and system-to-system integrations.
* Experience with scripting or automation using Python, shell scripting, or Power Shell.
* Ability to troubleshoot issues across application, infrastructure, platform, and integration layers.
* Strong documentation skills for runbooks, monitoring standards, and support procedures.
Preferred Qualifications
* Experience with IBM Cloud Pak for Watson AIOps.
* Experience with IBM Netcool OMNIbus, including Object Server, probes, and gateways.
* Experience with Netcool Impact, including event enrichment and policy logic.
* Experience with Prometheus and Grafana.
* Experience monitoring Kafka, including consumer lag, topic health, and broker health.
* Experience with Service Now event, incident, or integration workflows.
* Experience monitoring .NET applications and services.
* Experience with distributed tracing and Open Telemetry.
* Experience with Git, CI/CD pipelines, and monitoring-as-code or configuration-as-code.
* Familiarity with production change management and regulated enterprise environments.
Everforth Apex is a world-class IT services company that serves thousands of clients across the globe. When you join Everforth Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package.
Our commitment to excellence is reflected in many awards, including Clearly Rateds Best of Staffing in Talent Satisfaction in the United States and Great Place to Work in the United Kingdom and Mexico.
Everforth Apex uses a virtual recruiter as part of the application process. for more details. By applying for this job, you agree to receive calls, AI-generated calls, text messages, or emails from Everforth Apex and its affiliates, and contracted partners. Frequency varies for text messages. Message and data rates may apply. Carriers are not liable for delayed or undelivered messages. You can reply STOP to cancel and HELP for help.
You can access our privacy policy at
Everforth Apex Benefits Overview:
Everforth Apex offers a range of supplemental benefits, including medical, dental, vision, life, disability, and other insurance plans that offer an optional layer of financial protection. We offer an ESPP (employee stock purchase program) and a 401K program which allows you to contribute typically within 30 days of starting, with a company match after 12 months of tenure. Everforth Apex also offers a HSA (Health Savings Account on the HDHP plan), a Support Linc Employee Assistance Program (EAP) with up to 8 free counseling sessions, a corporate discount savings program and…
Position Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×