Site Reliability Engineer II
Listed on 2026-06-01
-
IT/Tech
Cloud Computing, SRE/Site Reliability
Hybrid role
, 2 days on site. Role is located in NYC with alternative location Chicago, IL. We are looking for local candidates only. Working days:
Tuesday‑Saturday.
Working hours:
9am‑5pm EST.
Site Reliability Engineer
II (Tuesday‑Saturday). CME Group is seeking an SREII to build, operate, and scale systems in our Markets portfolio. Markets SREs work on products and applications related to CME’s Globex trading platform. Our systems deliver a combination of low‑latency performance and rock‑solid reliability to seamlessly handle the world’s busiest trading days. The successful candidate will work alongside senior engineers to learn how we observe, monitor, automate, and improve Production service reliability.
As we evolve our operations, we are increasingly emphasizing the integration of Artificial Intelligence (AI) and Machine Learning (ML) to drive smarter, more predictive reliability and reduce operational toil.
- Work alongside product teams and senior engineers to assist with building out observability, monitoring, and alerting for key services.
- Implement AI‑driven reliability solutions, including anomaly detection, predictive alerting, and root cause analysis in production environments.
- Collaborate with engineers and product teams to ensure requirements are understood, planned carefully, and implemented safely.
- Participate in on‑call rotation and assist in incident response under guidance from senior engineers.
- Write scripts and tools to reduce toil and improve velocity, including building or integrating intelligent auto‑remediation and capacity forecasting systems.
- Leverage LLMs and Generative AI to enhance incident management, automate runbooks, and streamline log analysis.
- Contribute to disaster recovery (DR) and systems resiliency testing & improvements.
- Support the migration of markets applications to Google Cloud Platform (GCP).
- Collaborate with cross‑functional teams to improve system performance and operational efficiency.
- A keen interest in SRE, automation, and intelligent operations (AIOps).
- Experience with Linux‑based systems.
- Programming and scripting skills (Python, Bash, etc.).
- Strong problem‑solving and analytical abilities.
- Excellent communication and teamwork skills.
- Eagerness to learn and adapt in a fast‑paced trading environment.
- AI/ML for Operations:
Demonstrated hands‑on experience applying AI/ML techniques to improve operational efficiency, reliability, or observability. - AIOps Platforms:
Experience using platforms such as Dynatrace, New Relic, Moogsoft, Big Panda, or integrating open‑source tools (e.g., Prometheus with ML models). - Generative AI Tooling:
Experience with LLMs for operations, incident management, or log analysis (e.g., using Lang Chain, Llama Index, or tools like Pager Duty AIOps). - Cloud Platforms:
Experience with Cloud‑based platforms—Google Cloud Platform (GCP), GCE, and/or GKE is a strong bonus. - Traditional Observability:
Experience with metrics & monitoring tools like Open Telemetry, Splunk, Prometheus, and Grafana. - Systems Architecture:
Experience with Kubernetes and knowledge of working with distributed systems. - Core Concepts:
Basic knowledge of networking (HTTP/TCP/UDP/IP) and message‑oriented middleware. - Industry & Process:
Experience in financial markets and working in an Agile environment.
- Competitive compensation and benefits package. Salary ranges:
Chicago: $93,900–$156,500;
New York/New Jersey: $103,200–$172,000. - Annual target bonus opportunity for all employees.
- Broad‑based equity program for employees.
- Comprehensive health coverage, retirement package (401(k) and active pension plan), paid time off, and mental health benefit.
- Education reimbursement provisions.
CME Group is an equal‑opportunity employer. We consider all potential employees without regard to any protected characteristic. We embrace the unique experiences and skills of our employees to ensure that everyone’s perspectives are acknowledged and valued.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).