Sr. Software Engineer - Cloud SRE & DevOps
Job in
Palo Alto, Santa Clara County, California, 94306, USA
Listed on 2026-05-19
Listing for:
Rockwoods Inc
Full Time
position Listed on 2026-05-19
Job specializations:
-
Software Development
Cloud Engineer - Software, DevOps, AI Engineer (Applied/Software), Software Engineer
Job Description & How to Apply Below
Job Summary
Rockwoods is seeking a Senior Software Engineer to join our Enterprise AI group, serving as the primary engine for internal AI integration. You will architect the fundamental patterns and platforms that streamline AI adoption, working at the nexus of distributed systems and applied AI to enhance company-wide engineering efficiency.
Key Responsibilities- Lead Platform Initiatives: Manage the full lifecycle of major infrastructure projects, ensuring success from initial design to post-deployment.
- Master Kubernetes Ecosystems: Orchestrate complex container life cycles, networking, and multi-tenant clusters at a granular level.
- Optimize Cloud Architecture: Build and refine high-performance distributed services leveraging AWS or GCP.
- Uphold Technical Standards: Standardize engineering quality through rigorous code reviews, CI/CD automation, and robust design patterns.
- Cross-Functional Synergy: Partner with AI, Security, and Product teams to ensure infrastructure aligns with broader corporate goals.
- Technical Mentorship: Act as a force multiplier by coaching peers through complex architectural trade-offs and delivery hurdles.
- Experience: 6+ years in backend engineering with a significant focus on infrastructure.
- Programming Mastery: Proficient in Python or Go
, with a track record of delivering production-level software. - K8s Expertise: Extensive hands‑on experience building and maintaining clusters, not merely utilizing them.
- Systems Design: Demonstrated ability to architect and support distributed production environments.
- Cloud Proficiency: Skilled in AWS/GCP services, including IAM, networking, and storage.
- Modern Tooling: Competency with Terraform (IaC) and modern CI/CD workflows.
- AI Integration: Familiarity with AI gateways, frameworks (like LiteLLM), and building services that interface with agentic AI tools.
- Implementation of observability stacks (Prometheus, Grafana, or Open Telemetry).
- Knowledge of API/AI gateways and authorization frameworks like OPA or ABAC.
- Experience with service mesh or designing internal Platform-as-a-Service (PaaS) solutions.
- Exposure to multi-cloud or hybrid infrastructure environments.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×