SRE/Dev Ops Engineer; Hybrid,Sunnyvale Job Sunnyvale area,California USA,IT/Tech

Position: SRE/Dev Ops Engineer (Hybrid, Sunnyvale)

About the Role

Crowd Strike's engineering organization depends on shared infrastructure platforms that power critical product capabilities. These platforms need dedicated engineering ownership to operate reliably, scale safely, harden for security, and mature into self‑service capabilities that teams can depend on. You will own production infrastructure that spans multiple cloud providers and regions, serving engineering teams across the organization. The work is equal parts platform engineering and operational excellence — building automation, hardening security, establishing governance, and enabling consuming teams to adopt these platforms effectively.

You will also help shape what comes next as the team's scope grows. We're hiring at all seniority levels — scope and compensation adjust accordingly.

What You'll Do

Run production infrastructure – Deploy, upgrade, and maintain platform services across multiple clouds and regions on Kubernetes.
Build and maintain CI/CD pipelines – Make infrastructure changes safe and fast to ship using Git Ops workflows and release automation.
Build control planes – Create the APIs and tooling that make provisioning and scaling repeatable and self‑service.
Own capacity planning – Track usage, forecast growth, right‑size clusters, and keep infrastructure costs in check.
Build observability – Set up metrics, dashboards, and alerts using Prometheus and Grafana; write runbooks that make on‑call clear and actionable.
Own on‑call and incidents – Join the rotation, resolve issues, write post‑mortems, and turn repeat problems into automation.
Automate everything – Deployments, upgrades, certificate rotations, failover; if you do it by hand more than once, automate it.
Driving system reliability by blending software engineering principles with AI‑driven automation, moving from reactive firefighting to proactive, automated operations.
Harden security – Set up auth, encryption, secret rotation, network policies; keep dependencies patched and CVEs resolved.
Own disaster recovery – Build backup strategies, test failover, and ensure platforms can survive infrastructure failures.
Enable other teams – Provide templates, patterns, and direct support to help engineering teams use platforms reliably.
Collaborate across teams – Work with Infrastructure, SRE, and Data Services on shared operational problems.

What You'll Need

8+ years in Dev Ops, SRE, or platform engineering.
Hands‑on experience running stateful distributed systems on Kubernetes in production.
CI/CD experience – Building and owning pipelines using Git Hub Actions, Jenkins, Tekton, or similar tools.
Infrastructure‑as‑code skills – Terraform, Pulumi, or Crossplane; no manual configuration.
Git Ops experience – ArgoCD or Flux for managing infrastructure deployments.
Observability skills – Prometheus, Grafana, and distributed tracing tools such as Jaeger or Open Telemetry.
Database operations – Backup, restore, schema management, and performance tuning for relational and No

SQL databases.
Security mindset – Implement auth, encryption, secret management, and network policies as part of normal work.
Multi‑cloud or multi‑region experience – Manage infrastructure across providers or regions.
Able to work in our Sunnyvale office 2+ days per week.

Bonus Points

Experience running security platforms or telemetry pipelines at large scale.
Experience building internal developer platforms and self‑service tooling.
Familiarity with service mesh tools such as Istio or Linkerd.
Experience running workflow orchestration platforms like Temporal or Argo Workflows.
Experience running distributed tracing or telemetry infrastructure at scale.
Experience with disaster recovery automation for stateful systems.
Background at a cybersecurity or high‑availability SaaS company.
Contributions to open‑source projects or the broader tech community.
Go proficiency – our platforms and services are primarily written in Go.

Benefits of Working at Crowd Strike

Market leader in compensation and equity awards.
Comprehensive physical and mental wellness programs.
Competitive vacation and holidays for recharge.
Paid parental and adoption leaves.
Professional development opportunities for all…

SRE​/Dev Ops Engineer; Hybrid, Sunnyvale

SRE/Dev Ops Engineer; Hybrid, Sunnyvale