Senior Dev Ops Engineer
Listed on 2026-05-05
-
IT/Tech
Cybersecurity, Systems Engineer
Overview
Bold Thinking. World Changing. At Sky Water, our ingenuity helps improve lives around the world by manufacturing U.S. made semiconductors that are essential ingredients of modern life. Automotive safety enhancements, life-saving medical devices, consumer electronics and American security require semiconductors. Working in our Minnesota headquarters, Florida, or Texas location - employees join together to improve the world.
Explore what's possible. Joining our U.S.
-based team means contributing to and learning about the commercialization of some of the most exciting technologies the world has ever seen. We are turning "science fiction" into everyday reality through technologies such as superconducting, 3D integrated circuits or computer chips, carbon nanotubes, photonic logic devices, micro electro-mechanical systems and other emerging device topologies. We manufacture products for aerospace and defense, medical, automotive, consumer and industrial markets, to name a few.
Our customers include emerging leaders who rely on our intellectual property security and quality manufacturing services.
Step into the future. Sky Water's values of Integrity, Excellence, Collaboration, Empowerment and Growth Mindset guide us to cultivate an empowered, learning environment. We also invest in developing highly skilled, dedicated employees - and employees who are entering the workforce for the first time, from the military, and a variety of educational backgrounds.
Are you bold thinking? Find your place on our team and help us change the world!
About the teamWe build and operate the platforms that powers our enterprise AI/ML
, data engineering
, and reporting/BI workloads. We run in a regulated Google Cloud environment (FedRAMP High), where reliability, security, and operational rigor are non-negotiable.
We are hiring a seasoned Dev Ops engineer who can join our team and be self-sufficient from day one
-owning infrastructure, CI/CD, observability, and security guardrails that keep our AI + data + reporting systems secure, compliant, and reliable.
You will serve as a hands-on engineer and mentor others: you'll standardize environments, reduce toil, harden delivery pipelines, and improve incident response-while working inside the constraints of a FedRAMP High environment.
What you'll do (responsibilities)- Own production reliability for AI + data platforms
- Operate and continuously improve platform reliability for batch + streaming pipelines, reporting SLAs, and ML workloads.
- Define and run SLOs/SLIs
, alerting standards, and incident response processes (on-call, postmortems, measurable follow-ups). - Build runbooks, dashboards, and automation that reduce MTTR and recurring incidents.
- Design and maintain CI/CD for services, pipelines, infra, and (where applicable) model artifacts.
- Implement safe deployment patterns: progressive delivery, automated rollbacks, change controls, and release governance appropriate for regulated environments.
- Own "golden paths" and templates so engineering teams can ship reliably without reinventing the wheel.
- Design and maintain Terraform modules and IaC standards for repeatable GCP provisioning.
- Operate GCP org/folder/project structures, network patterns, and environment separation (dev/stage/prod) aligned to compliance requirements.
- Establish secure baseline configurations and guardrails (policy-as-code where relevant).
- Implement and operate security controls aligned to FedRAMP High / NIST 800-53 High baseline concepts
: IAM hardening, audit logging, encryption, vulnerability management, secure configuration, incident handling, and continuous monitoring. - Partner with compliance/security stakeholders to support audit readiness through evidence automation
, control mapping, and operational documentation.
- GKE platform operations: cluster lifecycle, upgrades, node pools, workload identity, RBAC, network policy, resource governance.
- Centralized logging/monitoring and audit: alert hygiene,…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).