Sr. Observability Engineer
Listed on 2025-12-27
-
IT/Tech
Systems Engineer, IT Support, Cloud Computing, Cybersecurity
Senior Observability Engineer
The Sr. Observability Engineer plays a critical role in ensuring the reliability, availability, and performance of enterprise systems by designing and implementing observability solutions. This position supports the IT Incident Management Team by providing actionable insights, telemetry, and automation to reduce Mean Time to Detect (MTTD) and Mean Time to Resolve (MTTR). The role combines deep technical expertise in monitoring, logging, and tracing with a strong understanding of SRE principles.
Key Responsibilities- Designs, implements, and maintains observability tools (e.g., Splunk, Prometheus, Grafana, Open Telemetry).
- Develops dashboards, alerts, and automated workflows to support proactive incident detection.
- Partners with IMT to provide real-time telemetry during major incidents.
- Conducts root cause analysis using logs, metrics, and traces.
- Improves incident response processes through automation and data-driven insights.
- Defines and monitors Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Error Budgets.
- Collaborates with application and infrastructure teams to embed observability into CI/CD pipelines.
- Identifies gaps in monitoring coverage and implements solutions.
- Drives adoption of observability best practices across engineering teams.
- 3 years of technical experience supporting enterprise systems
- Previous experience with observability tools or site reliability engineering
- 5 years of experience with observability tools (Splunk, ELK, Prometheus, Grafana, Open Telemetry)
- Proficiency in scripting languages (Python, Bash) and automation frameworks
- Certifications in SRE, ITIL, or cloud technologies
- Familiarity with cloud platforms (Azure, AWS, or GCP) and container orchestration (Kubernetes)
- Experience with AIOps or machine learning for anomaly detection
- Incident Management (IMT) – Provide Incident Analysis, Run Book, suggest improvements and collaborate with wider group
- Build & Publish operation KPI’s – Sev1 / Sev2, MTTR, MTTD, Incident Volume, Application performance
- CI/CD Tools – Git Hub, Jenkins, Azure Dev Ops
- University (Degree) Preferred
- Physical Requirements:
Sedentary Work
7IC
Salary Range$108,758 - $147,143
Company OverviewEver Bank, N.A. is a nationwide specialty bank providing high-value products and services to consumer and commercial clients nationwide. As a pioneer in online banking, we offer convenient digital access for clients 24/7, in addition to phone banking services and a network of financial centers. The Company’s commitment is to deliver high-performing, high-yield solutions backed by exceptional service, always giving clients the advantage they expect, to make the most of their money.
VEVRAA Federal Contractor;
Member FDIC
- Medical, dental, vision & HSA/FSA
- 401(k) savings
- Paid holidays & generous PTO
- Additional wellness & voluntary benefits
- Tuition reimbursement
- Commuter Benefits
- Life and Disability Insurance
12/31/25
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).