×
Register Here to Apply for Jobs or Post Jobs. X

Senior Monitoring Engineer

Job in Fort Worth, Tarrant County, Texas, 76102, USA
Listing for: Relha LLC
Full Time position
Listed on 2026-02-21
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing, IT Support
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below

We’re seeking a Senior Monitoring Engineer to join a high‑performing Monitoring Engineering team in a fast‑paced finance technology organization. You’ll design, develop, and maintain monitoring and observability solutions that keep core applications and infrastructure healthy and visible. In close partnership with application, platform, and development teams, you will implement alerting systems, dashboards, correlations, and automation—driving reliability, reducing MTTR, and elevating operational awareness.

Critical thinking, system analysis, and proactive troubleshooting are essential to success in this role.

Key Responsibilities Design, Build, and Maintain Monitoring & Observability Solutions

Develop and maintain instrumentation, telemetry, and alerting for the Enterprise Monitoring Center using industry‑leading tools, such as:

  • Grafana
  • Ops Ramp
  • App Dynamics
  • Elastic Stack
  • Big Panda
  • AWS Cloud Watch

Implement Observability best practices, ensuring comprehensive coverage of metrics, logs, and traces across critical systems.

Integrate and manage Open Telemetry for distributed tracing and telemetry data collection, enabling end‑to‑end visibility of business‑critical transactions.

Collaborate with application development teams to define and document observability requirements for each project or release.

Participate in complex initiatives, ensuring accurate and actionable monitoring and tracing are in place for every step of business‑critical workflows.

Define and maintain standardized alert payloads per engineering guidelines, ensuring alerts are actionable.

Partner with Level 2 and Level 3 support teams to reflect process changes in monitoring dashboards.

Maintain and optimize thresholds, ensuring seamless escalations via Big Panda as the central alert hub.

Create and maintain intuitive, actionable dashboards for the Enterprise Monitoring Center and other finance teams.

Ensure dashboards are effectively monitored by Level 1 teams, presenting clear, actionable data that reduces MTTR.

System Validation, Documentation & Automation

Develop and maintain automation scripts to enhance monitoring efficiency and improve team quality of life.

Proactively identify process improvements and learning opportunities; drive continuous improvement.

Contribute to the automation of monitoring, alerting, and operational tasks to streamline workflows and improve overall system reliability.

Qualifications Education

Bachelor’s in Computer Science, IT, or related field.

Experience

Minimum 4 years in a technology organization, with ≥1 year hands‑on engineering experience in monitoring or production operations.

Required Skills

Strong experience developing instrumentation and alerting for large, complex environments.

Expertise in ≥4 of the following:
Ops Ramp, Grafana, App Dynamics, Elastic Stack, Influx

DB, Big Panda, and other monitoring solutions.

Hands‑on experience with Observability concepts and frameworks, including metrics, logs, and traces.

Working knowledge of Open Telemetry for distributed tracing and telemetry data collection.

Experience with dashboard creation, alert management, and tool configuration.

Excellent verbal and written communication—able to present complex technical issues to both technical and non‑technical stakeholders.

Strong problem‑solving and troubleshooting in high‑pressure environments.

Ability to prioritize and manage multiple tasks in a deadline‑driven setting.

Proven collaboration with cross‑functional teams in large, complex IT environments.

Experience designing and implementing scalable, reliable monitoring solutions.

Experience with agile software development methodologies

Familiar with problem diagnosis; performance tuning; capacity planning and configuration management across the stack via continuous improvement.

Preferred Qualifications
  • Experience querying, manipulating, and visualizing time‑series data.
  • Familiarity with Infrastructure as Code tools (e.g., Ansible, Terraform).
  • Strong understanding of how to create actionable, digestible visualizations for Level 1 monitoring teams.
  • Working knowledge of REST APIs, JSON, and Service Now.
  • Experience with cloud monitoring—particularly AWS or Azure.
Who we Are

One Main Financial…

Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary