More jobs:
Production Support Analyst
Job in
Charlotte, Mecklenburg County, North Carolina, 28245, USA
Listed on 2026-06-17
Listing for:
慨正橡扯
Full Time
position Listed on 2026-06-17
Job specializations:
-
IT/Tech
IT Support, Systems Engineer, SRE/Site Reliability, Cloud Computing: Infrastructure & Operations
Job Description & How to Apply Below
Responsibilities Platform & Reliability Engineering
- Embed SRE and production engineering principles into Payments Modernization from design through early life support
- Define and validate non-functional requirements (NFRs) covering resilience, scalability, observability, recovery, and operability
- Drive replay, retry, and exception-handling validation for event-driven payment flows
- Lead capacity and performance testing, including volume growth and peak event scenarios (e.g. Fed Now, CHIPS, SWIFT)
- Own Permit-to-Operate readiness across environments (NFR Testing)
- Define cutover, shadow support, and early life support models
- Ensure runbooks, support procedures, on-call readiness, and escalation paths are production-grade before go-live
- Partner with Change Assurance to apply risk-based release controls, canary/blue-green strategies, and rollback automation
- Implement end-to-end observability across Kafka, MongoDB, API layers, and downstream payment components
- Define and monitor SLOs, error budgets, and golden signals
- Reduce alert noise through signal design, correlation, and automation
- Analyze early defects and exception patterns (ACK/NACKs, business errors) to drive stabilization
- Design and execute controlled failure testing (chaos engineering) to validate recovery patterns and blast radius
- Lead blameless RCAs, ensuring corrective actions are owned and recurrence is prevented
- Drive continuous service improvement (CSI) initiatives, including automation, resilience uplift, and technical debt reduction
- Range from juniors with 3-5 years experience to mid range, 10+ years.
- Service management experience, payments knowledge and tech wise knowledge on framework such as spring boot, mongodb, kakfa, Kubernetes/ CI/CD pipelines
- Hands on experience with UNIX, SQL to assist with troubleshooting
- Knowledge of Automation Related activities using scripting languages such as Python, Bash, Perl, Ruby
- Excellent analytical and communication skills
- Ability to prioritize and willingness to take ownership
- Problem solving mindset and solution enabler
Great Problem trouble shooting skills
Applicants must be currently authorized to work in the United States on a full-time basis. The Company will not sponsor applicants for work visas.
#J-18808-LjbffrPosition Requirements
5+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×