×
Register Here to Apply for Jobs or Post Jobs. X

Principal AI Site Reliability Engineer, EI Production Services

Job in Roanoke, Denton County, Texas, 76299, USA
Listing for: Fidelity Investments
Full Time position
Listed on 2026-04-29
Job specializations:
  • IT/Tech
    SRE/Site Reliability, Cloud Computing, Systems Engineer
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below

Principal AI Site Reliability Engineer, EI Production Services

Note:

Fidelity will not provide immigration sponsorship for this position.

The Role

The EI Production Services organization at Fidelity is seeking a strategic and proactive Principal AI Site Reliability Engineer (SRE). In this role, you will drive operational excellence, observability, and intelligent automation for mission‑critical contact center applications supporting Wealth and Workplace Investing business units. You will lead efforts to reduce manual toil, enhance associate experience, and improve system reliability by leveraging AI‑driven automation and industry best practices.

Your work will transform the support model for critical contact center applications, reducing downtime and improving associate productivity by enabling faster triage, improved resiliency, and a superior experience for associates and customers.

The Expertise And Skills You Bring
  • Lead initiatives to advance observability, automation, and operational efficiency for critical associate‑facing applications.
  • Drive proactive monitoring and AI‑powered telemetry to minimize reactive incident response and accelerate resolution.
  • Collaborate with engineering and business leaders to prioritize and resolve issues impacting associate experience.
  • Implement automation and self‑service capabilities to reduce manual intervention and improve reliability.
  • Establish and track SLIs/SLOs to measure and optimize system performance.
  • Communicate progress, outcomes, and technical concepts clearly to senior leadership and stakeholders.
  • 10+ years in technology operations, systems engineering, or production support leadership.
  • Proven ability to deliver complex improvement initiatives in large‑scale, high‑availability environments.
  • Deep expertise in IT Service Management (ITSM), incident/problem management, and operational process optimization.
  • Advanced knowledge of observability and monitoring tools (OTEL, Splunk, Data Dog, Prometheus, Grafana).
  • Experience leveraging AI and automation to drive efficiency and reliability.
  • Proficiency in scripting and automation (Python, Bash, Power Shell, or similar).
  • Strong understanding of On‑Prem and Public Cloud (AWS/Azure/GCP) environments.
  • Familiarity with networking, load balancing, and security fundamentals.
  • Agile and Dev Ops mindset with experience in CI/CD and operational automation.
  • Exceptional communication, collaboration, and stakeholder management skills.
  • Data‑driven approach to problem‑solving and progress tracking.
  • Leadership excellence: ability to inspire, mentor, and guide teams toward operational excellence.
  • Optional certifications: ITIL, AWS, SRE‑related credentials.
Certifications

Category:
Information Technology

Location

Most roles at Fidelity are hybrid, requiring associates to work onsite every other week (all business days, M‑F) in a Fidelity office. This does not apply to remote or fully onsite roles.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary