Site Reliability Engineer III: NeoVest
Listed on 2026-06-03
-
IT/Tech
Systems Engineer, SRE/Site Reliability, Cloud Computing, IT Support
Job Description
There’s nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.
As a Site Reliability Engineer III at JPMorgan
Chase within Neovest, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end‑to‑end operations, availability, reliability, and scalability of your application or platform.
You will blend release management discipline with hands‑on engineering to automate deployment and monitoring workflows across cloud and server environments.
- Lead and coordinate release planning by maintaining a release calendar, communicating dependencies, and managing release windows across teams
- Design, build, and improve automated continuous integration and continuous delivery pipelines, including gating, rollback, and promotion strategies
- Implement deployment strategies (such as blue/green, canary, and phased rollouts) and coordinate post‑deploy validation, hotfixes, and recovery actions
- Engineer reliability improvements by defining and operationalizing service level indicators and service level objectives, alerts, and error budgets
- Develop and maintain infrastructure, configuration, and network as code to support secure, repeatable environments
- Automate repetitive operational and release tasks using scripting and software engineering best practices
- Troubleshoot complex production issues across applications, cloud infrastructure, and networking layers, partnering with domain experts as needed
- Improve observability through telemetry, dashboards, and actionable alerting using industry‑standard monitoring and logging tools
- Promote site reliability engineering best practices by sharing knowledge, reviewing designs, and contributing to operational readiness across the team
- Formal training or certification on software engineering concepts and 3+ years applied experience
- 3+ years of experience supporting production services with a focus on availability, reliability, and operational excellence
- Proficiency in at least one programming or scripting language (for example, Python, C#, or Java) used for automation and tooling
- Hands‑on experience designing or operating continuous integration and continuous delivery pipelines, including source control workflows and release controls
- Experience implementing and using observability practices (white‑box and black‑box monitoring, telemetry, and service level objective alerting)
- Experience working in multiple cloud platforms and Windows/Linux server environments
- Familiarity with containers and container orchestration (for example, Kubernetes, Azure Kubernetes Service, or Docker)
- Ability to troubleshoot common networking technologies and issues in distributed environments
- Strong communication skills, with the ability to collaborate across engineering, operations, and non‑technical stakeholders with limited supervision
- Strong understanding of release engineering practices, Git Ops, branching strategies, and change management discipline
- Experience with incident management, on‑call operations, capacity planning, and reliability engineering practices
- Experience authoring pipeline configuration (for example, YAML) and infrastructure templates (for example, Terraform, ARM, or Bicep)
- Demonstrated ability to assess risk, remain calm under pressure, and drive clear decision‑making
- Azure experience preferred;
Google Cloud Platform exposure is a plus
We offer a competitive total rewards package including base salary determined based on the role, experience, skill set and location. Those in eligible roles may receive commission‑based pay and/or discretionary incentive compensation,…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).