CloudOps/SRE Engineer Austin
Listed on 2026-02-07
-
IT/Tech
Cloud Computing, SRE/Site Reliability
Autonomize AI is revolutionizing healthcare by streamlining knowledge workflows with AI. We reduce administrative burdens and elevate outcomes, empowering professionals to focus on what truly matters — improving lives. We're growing fast and looking for bold, driven teammates to join us.
About the RoleWe’re looking for a Cloud Ops / Site Reliability Engineer to lead the charge in building a fully automated, secure, and scalable multi-cloud infrastructure for our AI-powered healthcare platform. Your mission: keep our deployments lightning-fast, reliable, and invisible. You’ll own the orchestration of services across AWS, Azure, and GCP, automating everything from infra provisioning to rollbacks — with security and uptime built in.
This is a builder role — ideal for someone who can go deep into CI/CD, lives for IaC, and thinks deployment velocity is just as important as resiliency.
Key Responsibilities- Multi-Cloud Infra Management:
Design and manage highly available, scalable, and secure infrastructure across AWS, Azure, and GCP - End-to-End Automation:
Build deployment workflows using Terraform, Ansible, Helm, ArgoCD, Git Hub Actions or equivalent - CI/CD at Scale:
Own automated delivery pipelines for infrastructure and applications across staging and production - Reliability Engineering:
Define and uphold SLAs/SLOs; own incident management, blameless postmortems, and error budgets - Security & Compliance:
Implement and continuously harden controls for HIPAA, SOC2, and zero-trust environments - Monitoring & Observability:
Deploy and maintain logs, metrics, and alerting systems using Prometheus, Grafana, Datadog, etc. - Documentation & Process:
Create robust runbooks, architectural diagrams, and continuous improvement loops - Installation and configuration of AI Platform and Solutions at customer deployments
- Support in various IT / Info sec discussions and reviews with customers
- Guide the offshore team as necessary and help with automation of deployments
- 5+ years in SRE/Cloud Ops roles with production-grade infrastructure experience
- Expertise in AWS, and solid hands-on experience in Azure and GCP
- Proven track record with Infrastructure as Code (Terraform preferred) and modern deployment frameworks
- Deep CI/CD experience including automated rollbacks, blue/green or canary deployments
- Skilled in Kubernetes, Docker, and container orchestration
- Experience with secure cloud architectures, RBAC, IAM, and secrets management
- Bias for automation — scripting in Python, Bash, or Go
- Culture fit: you take full ownership, run toward complexity, and operate in the final mile
- Prior experience supporting healthtech, life sciences, or other regulated domains
- Implemented policy-as-code tools like OPA/Gatekeeper
- Experience running GPU workloads, ML pipelines, or scalable microservices
- Contributions to open-source Dev Ops/SRE communities
- Opportunity to make a tangible impact in the healthcare industry
- Ownership, Autonomy & Mastery - ability to chart your own trajectory in a fast growing company
- Competitive salary and bonus pay structure
- Comprehensive medical, dental, and vision insurance - premium free for employees.
- Retirement Plans: 401(k)
- Professional Development:
Budget for learning and development, particularly in AI or productivity tools.
Please submit your resume and a brief cover letter to careers explaining why you are the ideal candidate for this role. We are excited to meet someone who is eager to bring their skills, enthusiasm, and creativity to our team!
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).