×
Register Here to Apply for Jobs or Post Jobs. X

Manager, SRE FedRAMP

Job in Chicago, Cook County, Illinois, 60290, USA
Listing for: Cisco Systems
Full Time position
Listed on 2026-01-04
Job specializations:
  • IT/Tech
    SRE/Site Reliability, Cloud Computing
Job Description & How to Apply Below
Position: Manager, SRE FedRAMP-33539

Splunk, a Cisco company, is building a safer and more resilient digital world with an end-to-end full stack platform made for a hybrid, multi-cloud world. Leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. Come help organizations be their best, while you reach new heights with a team that has your back.

Meet the Team

The Splunk Observability Cloud team provides full-fidelity monitoring and fixing across infrastructure, applications, and user interfaces, in real-time and at any scale, to help our customers keep their services reliable, innovate faster, and deliver great customer experiences. Infrastructure Software Engineers at Splunk are cloud-native systems engineers who use infrastructure-as-code, microservices, automation, and efficient design to build, operate, and scale our products.

You will lead and manage one of the largest and most sophisticated cloud-scale, Bigdata, and microservices platforms in the world. You will be responsible for managing engineers who operate highly available, scalable, and cost-efficient applications with low operational burden by handling and improving the reliability and resiliency of services and infrastructure. You thrive driving initiatives on automation, infrastructure-as-code, reliability engineering, and getting rid of tedious, manual tasks.

  • Lead a team of super smart engineers who are passionate about large scale distributed systems for Splunk Cloud Observability in FedRAMP environments
  • Manage across the organization to deliver quality products that delight Splunk's passionate users.

    Mentor and grow teams of tight-knit engineers who are building a state-of-the-art, cloud-based environment for massive-scale data processing.
  • Partner with our Talent Acquisition team as we recruit, interview and hire the best engineering talent to join Splunk's growing SRE FedRAMP team!
  • Manage engineers to achieve more than they thought possible. You enjoy managing and driving teams to success and are fulfilled through the success of others.
Your Impact

Manage a team working on reliability projects, including:

  • HA, Business Continuity Planning, disaster recovery, backup/restore, RTO, RPO
  • Chaos engineering
  • Application uptime and performance
  • Capacity management & planning
  • SLIs, SLOs, error budgets, and monitoring dashboards
  • Responsible for deployment and operations of large-scale distributed data stores and streaming services
  • Establishing design patterns for monitoring and benchmarking
  • Establishing and documenting production run books and guidelines for developers
  • Tooling, toil reduction, runbooks & automation to handle production environments
  • Incident management and improving MTTD/MTTR for services
  • Cloud cost optimization-5 sentences) A brief description of the role, also include what the employee would do and what makes this role exciting:
Minimum Qualifications
  • 8+ years of experience in handling large-scale cloud-native microservices platforms.
  • 2+ years of strong hands-on management experience managing teams deploying, handling, and monitoring large-scale Kubernetes clusters in the public cloud specifically AWS or GCP
  • Experience with and leading a team in infrastructure automation and scripting using Python and/or Golang.
  • Experience managing remote teams.
  • Strong hands‑on experience in monitoring tools such as Splunk, Prometheus, Grafana, ELK stack, etc. in order to build observability for large-scale microservices deployments.
  • Experience with deployment, operations, and performance management of one or more of the following large-scale clusters such as Cassandra, Kafka, Elastic Search, Mongo

    DB, Zoo Keeper, Redis, etc.
  • Excellent problem‑solving, triaging, and debugging skills in large-scale distributed systems
Preferred Qualifications
  • Familiarity working with and/or managing in compliance environments such as HIPPA, Gov Cloud, State Government, Federal Government, SOC2 or FedRAMP
  • AWS Solutions Architect certification preferred.
  • Confluent Certified Administrator for Apache Kafka and/or Apache Cassandra Administrator Associate certifications are preferred
  • Experience with Infrastructure-as-Code using Terraform, Cloud Formation,…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary