DevOps, SRE & Infrastructure Architect C2C Sunnyvale, CA
Job in
Sunnyvale, Santa Clara County, California, 94087, USA
Listed on 2026-06-26
Listing for:
Tech Mirrors
Full Time
position Listed on 2026-06-26
Job specializations:
-
IT/Tech
Cloud Computing: Infrastructure & Operations, SRE/Site Reliability
Job Description & How to Apply Below
Job Title:
Principal Dev Ops, SRE & Application Infrastructure Architect
Location:
Sunnyvale, CA
Duration:
Contract
Need 12+ Years Candidate
- Infrastructure & Git Ops, Kubernetes & Containerization:
Design, deploy, and optimize secure Docker/Kubernetes (AKS) environments using Helm and ArgoCD. - Networking & Edge:
Manage cloud Ingress, Load Balancers, and end-to-end certificate management (SSL/mTLS). - CI/CD & Automation:
Automate tasks with Shell/Python; build Git Ops pipelines and manage schema migrations via Flyway. - SRE & Observability Reliability:
Own end-to-end production availability and performance; define and track SLAs/SLOs/SLIs and error budgets. - Telemetry:
Build observability stacks using Open Telemetry, Prometheus, Grafana, and Splunk. - Incident Management:
Lead P0/P1 incident response, deep-dive distributed system debugging, RCAs, and on-call rotations. - Application & Database Operations:
Polyglot DB Management — design and operate high-availability cloud database infrastructure (Oracle, Postgres, Cassandra, Couchbase, Redis, Cockroach DB). - Data Replication & DR:
Manage Oracle Golden Gate replication, patching, purging, and execute robust P0 Disaster Recovery/failover strategies. - App Support:
Perform deep-dive troubleshooting in Java application layers, gRPC, REST/HTTP/JSON, and caching/messaging systems (SNS/SQS, Elasticsearch, Solr).
- Orchestration & Dev Ops:
Kubernetes (AKS), Docker, Helm, ArgoCD, Flyway. - Scripting & OS:
Strong Linux/Unix internals, Shell scripting, and Python. - Observability:
Open Telemetry, Prometheus, Grafana, Splunk. - Database & Replication: SQL/PL-SQL (procedures, triggers, tuning), Golden Gate, No
SQL (Cassandra, Couchbase, Redis), and transactional databases (Oracle, Postgres). - App Troubleshooting:
Java application debugging, gRPC, REST, and cloud-native caching/queues.
- Exposure to Ali Cloud or multi-cloud environments.
- Security operations such as automated password rotations and IAM privilege management.
- Strong cross-functional collaboration with security, network, and application teams.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×