Site Reliability Engineer Job Miami area,Florida USA,IT/Tech

Position: Staff Site Reliability Engineer

Core Scientific is a leading provider of infrastructure for high-performancecomputein North America. Our mission is to accelerate digital innovation by scaling high-valuecomputerapidly, efficiently, and responsibly.

We transform energy into high-valuecomputewith unmatched efficiency company is a $5 billion publicly traded company (NASDAQ: CORZ).

We power AI, HPC, and other next-generation data center workloads demanding exceptional computing power, in addition to our digital asset mining operations.

We own andoperatenine data centers in seven states, housing advanced infrastructure for our customers.

What sets us apart? We have an entrepreneurial culture, a "can-do" and collaborative attitude, and we own and control our infrastructure. These strategic advantages enable us tomaintainoperational excellence, increase efficiency, and rapidly deploy cutting-edgeinnovations developed by our team of experts.

Join us and accelerate your career alongside our groundbreaking journey. Weseeksmart, creative, and collaborative professionals who thrive in a fast-paced, result-driven environment. Ready to be part of something exceptional? Apply today and make an impactat

Core Scientific.

Title

Staff Site Reliability Engineer

Reports To

Site Reliability Engineering Manager

The Job

We areseekinga capable, motivated generalist who thrives in a change-controlled,compliant environment and enjoys working across hybrid cloud and on-premises systems. This role partners closely with application architecture and peer engineering teams while contributing hands-on across platform engineering, Dev Ops, and SRE.

This position is expected to take ownership of complex technical initiatives and see them through to completion—balancing hands-on implementation with effective delegation and cross-team coordination.

Responsibilities

Lead end-to-end delivery of complex technical initiatives, from problem definition and design through implementation, rollout, and operation
Own the design, implementation, and reliability of systems across hybrid cloud and on-premises environments
Take accountability for technical outcomes, including system reliability, scalability, and performance in regulated, change-controlled environments
Drive execution by coordinating work across engineers and teams, delegating effectively whileremaininghands-on where needed
Partner with application architecture and peer teams to shape system design and influence technical decisions
Build, deploy, andoperateinfrastructure and applications using automation and infrastructure as code
Improve observability, monitoring, and incident response practices
Establish and promote best practices for reliability, security, and operational excellence across teams
Mentor engineers and contribute to raising the technical bar across the organization
Foster open, respectful, and professional communication directly within the team as well as with co-workers/ teammates and leaders across the organization
Performs other duties as assigned

Qualifications

Bachelor'sdegree in Computer Scienceor a related field,7+ years of experience, or equivalent demonstrated impact in SRE, Dev Ops, or Infrastructure Engineering
Broad technical experience across infrastructure and distributed systems, with the ability to design effective solutions, apply appropriate patterns, andanticipatescaling, reliability, and operational challenges
Strong understanding of distributed systems behavior, including application runtime characteristics, service-to-service communication, networking, and failure modes in production environments
Experienceoperatingin regulated, compliant, or change-controlled environments
Experience working in hybrid environments (AWS preferred; on-premises infrastructure required)
Strong experience with Infrastructure as Code, configuration management, and orchestration tools (Terraform, Helm,Kustomize, Ansible)
Experience with Kubernetes and virtualization technologies
Experience with observability platforms (e.g., Datadog), including building monitoring and alerting integrations
Experience with build and release systems (e.g., Git Hub Actions,Makefiles, Python tooling)

Location

To be considered for the role you…