More jobs:
Enterprise Monitoring Engineer
Job in
Springfield, Fairfax County, Virginia, 22161, USA
Listed on 2026-03-03
Listing for:
SAIC
Full Time
position Listed on 2026-03-03
Job specializations:
-
IT/Tech
Systems Engineer, IT Support, Cybersecurity, Cloud Computing
Job Description & How to Apply Below
Location: SPRINGFIELD, VA, US
Date Posted:
Category: Information Technology
Subcategory: IT Systems Engineer
Schedule: Full-time
Shift: Day Job
Travel: No
Minimum Clearance Required: Secret
Clearance Level Must Be Able to Obtain: Top Secret
Potential for Remote Work: No
Description
The Enterprise Monitoring Engineer in Springfield, VA is a senior-level technical expert who is accountable for the advanced troubleshooting, performance analysis, and optimization of enterprise monitoring platforms. This position is responsible for the design, implementation, and ongoing enhancement of observability solutions in hybrid environments, including on-premises, cloud, and virtual infrastructure. The engineer is responsible for the final escalation point for complex monitoring issues, collaborates with other teams to guarantee system reliability, and promotes best practices in observability.
Key Responsibilities:
- Serve as the Tier 3 escalation point for issues related to any of the monitoring/observability platforms and tools.
- Lead root cause analysis (RCA) for major incidents and recurring performance issues.
- Maintain, configure, and optimize monitoring tool deployments across cloud (e.g., AWS, Azure), on-premises, and VMware environments.
- Design and implement custom dashboards, synthetic monitoring, and service-level objectives (SLOs).
- Develop and maintain alerting strategies that reduce noise and ensure actionable notifications.
- Work closely with application, infrastructure, Dev Ops, and security teams to define monitoring requirements and integrate observability into CI/CD pipelines.
- Analyze metrics, logs, and traces to ensure end-to-end service visibility and performance optimization.
- Assist in onboarding applications and teams into the observability platform.
- Provide training and mentorship to Tier 1 and Tier 2 support teams.
- Ensure platform resilience, availability, and compliance with internal standards and SLAs.
- Participate in on-call rotations for high-priority incidents as needed.
Required Education & Experience:
- BS an 9 years experience; MS and 7 years experience; may accept additional experience in lieu of degree.
- 5+ years of experience in IT infrastructure, application performance monitoring, or site reliability engineering (SRE).
- 2+ years hands-on experience with enterprise monitoring and observability practices, including configuration of alerts, dashboards, metrics, logs, and event correlation across complex IT environments (e.g., Solar Winds, Grafana, Prometheus, Splunk, Cloud Watch, App Dynamics, Dynatrace, Zabbix, VMware Cloud Foundation or similar enterprise monitor tools).
- Solid understanding of observability concepts including metrics, logs, traces, and user experience monitoring.
- Experience supporting complex, distributed systems in cloud and hybrid environments.
- Proficient with scripting and automation (e.g., Power Shell, Python, Bash, or Ansible).
- Strong understanding of networking, Linux/Windows systems, containers, and application architectures (microservices, APIs, etc.).
- Experience curating and implementing dashboards.
- Excellent troubleshooting and problem-solving skills.
- Strong written and verbal communication.
- Ability to work independently and collaboratively across teams.
- Customer-focused mindset and attention to detail.
- Continuous learning and adaptability in a fast-paced environment.
- ship.
- Active secret security clearance with the ability to obtain a top secret clearance.
- Dynatrace Associate or Professional Certification.
- Experience with Dynatrace, including One Agent deployment, Smartscape, Pure Path, and Davis AI.
- Experience with integration of Dynatrace with tools such as Service Now, Splunk, Jira, or CI/CD pipelines.
- Experience with other observability tools (e.g., Prometheus, Grafana, New Relic, App Dynamics, Splunk, Elastic).
- Familiarity with Dev Ops practices and Infrastructure-as-Code (e.g., Terraform).
- Understanding of ITIL framework and change management processes.
Headquartered in Reston, Virginia, SAIC has annual revenues of approximately $4.5 billion. For more information, visit For information on the benefits SAIC offers, see .
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×