Principal Software Engineer â Performance Engineering
Listed on 2026-06-02
-
IT/Tech
Systems Engineer
JOB DESCRIPTION
If you are looking for a game-changing career, working for one of the world's leading financial institutions, youâve come to the right place.
As aPrincipal Software Engineer atJPMorgan
Chase within the Commercial and Investment Bank Payment Performance Engineering team, you will serve as the technical authority and execution lead for performance engineering across our platforms. You will be hands-on and strategic—defining non-functional requirements (NFRs) and service level objectives (SLOs), embedding performance into CI/CD, scaling automated performance testing, and driving resiliency, capacity, and user experience outcomes across distributed systems. You will set the bar for performance-first design, partner closely with architecture, SRE, and application teams, and ensure performance risks are identified and eliminated before production.
responsibilities
- Define and institutionalize application- and endpoint-level NFRs and SLOs, including p95/p99 latency, throughput, ramp profiles, and error budgets
- Drive proactive performance engineering through early bottleneck detection, architectural guidance, and capacity modeling
- Serve as the final technical authority for performance sign-offs across platform releases
- Design, build, and maintain automated test suites for load, stress, soak, spike, and capacity scenarios
- Virtualize partner dependencies and inject faults to validate components when upstream systems are unavailable
- Expand fully automated, environment-aware performance test execution (on-commit/overnight) with health checks and actionable sanity tests
- Build dashboards and alerts correlating performance test signals with production telemetry against defined SLOs
- Provide actionable reporting on SLO variance, drift, and per-endpoint hotspots using RUM, synthetic, and server-side metrics
- Embed performance gates into CI/CD pipelines (pre-deploy smoke, post-deploy validation, regression detection with auto-fail/notify)
- Lead chaos and resiliency experiments (CPU, memory, network, latency, dependency failures) and validate autoscaling under extreme load
- Apply AI/LLMs to workload and scenario generation, metrics interpretation, and automated reporting with measurable success guardrails
- 15+ years of overall engineering experience, with 10+ years in performance engineering for high-traffic distributed systems (web, APIs, microservices, event-driven, data-centric)
- Hands-on software engineering experience with Java/Spring Boot and Kubernetes (self-managed and EKS)
- Deep expertise in workload modeling, queuing theory, and statistical analysis of latency/throughput; fluent with percentile-based SLOs and error budgets
- Proficiency with load and protocol testing tools such as JMeter and Blaze Meter
- Scripting/orchestration skills in Java, Python, or Type Script for performance automation and execution control
- Experience with service virtualization and fault injection (e.g., Wire Mock, Mountebank, Toxiproxy), including record-replay and dynamic templating
- Strong observability/APM capabilities using Dynatrace and/or Open Telemetry, plus RUM and synthetic monitoring approaches
- Experience building dashboards and analysis workflows in Kibana and/or Grafana to drive actionable decisions
- Strong CI/CD and Dev Ops experience (e.g., Jenkins, Git Lab, Git Hub Actions) including repeatable sign-offs, artifact/version alignment, and environment promotion
- Infrastructure-as-code and platform delivery experience (e.g., Terraform, Cloud Formation) including autoscaling strategies
- Ability to partner across architecture, SRE, and application teams to coach standards adoption and drive release readiness
- Experience with data-platform performance optimization (e.g., Oracle tuning, JDBC pool tuning, Kafka throughput/partitioning, caching strategies)
- Strong systems and cloud performance background (Linux tooling, JVM tuning, containers, AWS primitives such as compute, ALB/NLB, EKS, networking)
- Experience with k6 and other modern cloud-native load testing frameworks
- Familiarity with service mesh technologies (Istio/Linkerd) and traffic-control patterns (rate limiting, back pressure)
- Practical application of LLMs for test generation, anomaly detection, or automated reporting in engineering workflows
- Experience operating in financial-services scale, low-latency systems, and/or regulated environments
- Knowledge of advanced performance tooling (e.g., perf, eBPF) and production-grade troubleshooting practices
This position is subject to Section 19 of the Federal Deposit Insurance Act. As such, an employment offer for this position is contingent on JPMorgan Chaseâs review of criminal conviction history, including pretrial diversions or program entries.
ABOUT USJPMorgan
Chase, one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the worldâs most prominent corporate, institutional and…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).