×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer III

Job in Plano, Collin County, Texas, 75086, USA
Listing for: JPMorganChase
Full Time position
Listed on 2026-02-24
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below

Job Description

There’s nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.

Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability.

As a Site Reliability Engineer at JPMorgan Chase within the CORPORATE SECTOR, ENTERPRISE TECHNOLOGY team, you will be instrumental in enhancing intelligent and resilient platform operations for a global financial institution. You will drive the integration of traditional support with modern Site Reliability Engineering (SRE) principles, utilizing agentic AI as a core capability to achieve our vision of a proactive, automated, and customer‑centric reliability function.

This role demands a blend of deep technical expertise, a growth‑oriented mindset, and a strong dedication to operational excellence. You will excel in modern infrastructure and observability, promoting AI‑powered incident management, autonomous runbooks, and support intelligence initiatives.

Job Responsibilities
  • Advocate and embody site reliability principles, fostering a culture of excellence and technical influence within your team.
  • Leverage AI tools to enhance operational effectiveness and automate processes, ensuring high‑quality customer service.
  • Spearhead projects aimed at enhancing the reliability and stability of applications and platforms.
  • Utilize data‑driven analytics and AI technologies to automate detection, diagnosis, resolution processes, elevate service levels and drive continuous improvement.
  • Engage stakeholders to establish realistic service level objectives and error budgets, ensuring alignment with customer expectations.
  • Exhibit technical proficiency in one or more domains, proactively addressing technology‑related bottlenecks.
  • Employ AI‑driven solutions to streamline processes and enhance operational efficiency.
  • Participate in troubleshooting during incidents, demonstrating the ability to swiftly identify and resolve issues to prevent financial losses.
  • Act as a culture carrier by documenting learnings and disseminating knowledge through internal forums and communities of practice.
  • Mentor team members, guiding them in the strategic adoption of AI technologies to enhance operational effectiveness and customer service.
Required Qualifications , Capabilities, And Skills
  • Formal training or certification on site reliability engineering concepts and 2+ years applied experience in areas such as resiliency, scalability, performance and security.
  • Proven success in an SRE or Dev Ops role, with knowledge of service level indicators/objectives (SLIs/SLOs), incident management, blameless postmortem analysis, and systems reliability.
  • Expert with observability stacks (e.g., Prometheus, Grafana, Splunk, Open Telemetry), including deep experience correlating telemetry across services and time.
  • Hands‑on skills in coding (at least one high‑level programming language), cloud platforms (AWS or GCP), container orchestration (Kubernetes), infrastructure as code (Terraform), and resilient CI/CD pipelines.
  • Active experience or deep curiosity in applying AI to operations—such as LLM‑based copilots, anomaly detection, automated runbooks, autonomous agents (e.g., CrewAI, Lang Graph), or Retrieval‑Augmented Generation (RAG) workflows for support.
  • A track record of delivering under pressure. You finish what you start, adapt to uncertainty, and thrive in high‑accountability environments.
  • You deconstruct complexity, organize effectively, and drive clarity into ambiguous operational environments. Documentation and design are second nature.
  • Outstanding communication, empathy, and professionalism—especially during incidents. You recognize that great systems serve real people.
Preferred Qualifications , Capabilities, And Skills
  • Experience with operational and compliance rigor in banking, fintech, or similar.
  • Practical use of LLM frameworks (e.g., Lang Chain, Semantic Kernel), AI orchestration tools, vector databases, or custom agents supporting reliability workflows.
  • Expe…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary