×
Register Here to Apply for Jobs or Post Jobs. X

SRE Manager​/SRE Architect

Job in New York, New York County, New York, 10261, USA
Listing for: Qode
Full Time position
Listed on 2026-06-07
Job specializations:
  • IT/Tech
    SRE/Site Reliability, Cloud Computing, IT Project Manager, Systems Engineer
Salary/Wage Range or Industry Benchmark: 140000 - 180000 USD Yearly USD 140000.00 180000.00 YEAR
Job Description & How to Apply Below
Position: SRE Manager / SRE Architect
Location: New York

Job Description – SRE Manager / SRE Architect (Hands-on)

Location: New York City, NY / Fort Mill, SC (Hybrid)

Employment Type: Full-Time / Contract

Industry: Financial Services

Position Overview

We are seeking a highly experienced and hands‑on Site Reliability Engineering (SRE) Manager / SRE Architect to lead reliability, availability, performance, and release management initiatives across enterprise‑scale applications and platforms. This role requires a strong blend of SRE, Dev Ops, Release Management, Cloud Engineering, Automation, and Production Operations expertise.

The ideal candidate will be deeply involved in designing and implementing reliability strategies, driving release governance, improving deployment processes, and ensuring operational excellence across cloud‑native environments.

Launch Darkly experience is highly preferred but not mandatory.

Key Responsibilities Site Reliability Engineering (SRE)
  • Design and implement SRE best practices focused on reliability, scalability, performance, and availability.
  • Define and monitor SLIs, SLOs, and error budgets across critical applications and services.
  • Drive proactive monitoring, alerting, observability, and incident management processes.
  • Lead root cause analysis (RCA) efforts and implement preventive measures.
  • Improve system resiliency through automation, self‑healing capabilities, and operational excellence.
  • Establish reliability standards across distributed systems and cloud platforms.
Release Management
  • Own and drive end‑to‑end release management processes across multiple environments.
  • Coordinate application releases across development, QA, UAT, staging, and production environments.
  • Develop release governance, release calendars, deployment strategies, rollback procedures, and change management processes.
  • Partner with development, QA, infrastructure, and business teams to ensure smooth production deployments.
  • Identify and mitigate release risks while minimizing downtime and business impact.
  • Implement deployment automation and continuous delivery best practices.
Dev Ops & Automation
  • Design and maintain CI/CD pipelines using modern Dev Ops tools.
  • Automate infrastructure provisioning, deployment, monitoring, and operational workflows.
  • Drive Infrastructure as Code (IaC) adoption using Terraform or similar technologies.
  • Support cloud‑native architectures and containerized application deployments.
  • Partner with engineering teams to improve developer productivity and deployment velocity.
Cloud & Platform Engineering
  • Manage and optimize cloud infrastructure on AWS and/or Azure.
  • Support Kubernetes, container orchestration, and cloud‑native application platforms.
  • Ensure platform scalability, security, compliance, and operational readiness.
  • Drive platform modernization initiatives and operational transformation efforts.
Required Skills & Experience Core SRE Skills
  • 15+ years of IT experience with strong focus on SRE, Dev Ops, Platform Engineering, or Production Support.
  • Extensive hands‑on experience implementing SRE practices in enterprise environments.
  • Strong understanding of:
  • SLI/SLO/Error Budgets
  • Incident Management
  • Problem Management
  • Capacity Planning
  • Reliability Engineering
  • Observability & Monitoring
Release Management
  • Proven experience managing large‑scale production releases.
  • Strong expertise in:
  • Release Planning
  • Release Governance
  • Change Management
  • Deployment Automation
  • Rollback Strategies
  • Production Readiness Reviews
Dev Ops & Cloud
  • Hands‑on experience with:
  • AWS and/or Azure
  • Kubernetes (EKS, AKS, Open Shift preferred)
  • Docker
  • Terraform
  • Git Hub Actions, Jenkins, Azure Dev Ops, Git Lab CI/CD
  • Experience building and maintaining CI/CD pipelines.
Monitoring & Observability
  • Strong experience with:
  • Dynatrace
  • Datadog
  • Splunk
  • Prometheus
  • Grafana
  • ELK Stack
  • Cloud Watch
Scripting & Automation
  • Experience with Python, Bash, Power Shell, or similar scripting languages.
  • Strong automation mindset with focus on operational efficiency.
Nice to Have
  • Launch Darkly end‑to‑end implementation experience
  • Feature flag management and progressive delivery strategies.
  • Financial Services, Banking, or Wealth Management domain experience.
  • Experience leading SRE or Dev Ops transformation initiatives.
  • Cloud certifications (AWS, Azure, Kubernetes).
Preferred Candidate Profile
  • Strong hands‑on SRE leader, not just a people manager.
  • Deep expertise in Release Management and Production Support.
  • Proven background in Dev Ops, Cloud Engineering, and Platform Reliability.
  • Ability to work with development, infrastructure, security, and business teams.
Keywords

SRE, Site Reliability Engineering, Release Management, Dev Ops, Terraform, AWS, Azure, Kubernetes, Dynatrace, CI/CD, Launch Darkly, Production Support, Incident Management, Reliability Engineering, Observability, Platform Engineering, Infrastructure Automation.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary