AI Site Reliability Engineer
Listed on 2025-12-20
-
IT/Tech
Systems Engineer, AI Engineer, Cloud Computing, Cybersecurity
Job Description
- Security Vulnerability & Findings Automation
- Deploy AI agents to identify ownership and prioritize remediation, eliminating manual triage and saving 200+ hours per SE team annually. - Gauntlet Platform Enablement
- Expand adoption of the Gauntlet agentic AI platform to enable low-code/no-code agent development across technical teams. - Fin Ops Optimization Across Clouds
- Implement AI agents to monitor cloud usage and spending, delivering real-time cost‑saving recommendations with a target of $9M in FY26 savings. - SRE Incident Response Agents
- Deploy autonomous agents to correlate observability signals, perform initial triage, and take automated action on low‑risk incidents to reduce MTTI and cognitive load. - Tech Debt Remediation
- Use AI to autonomously refactor legacy codebases, addressing up to 80% of flagged tech debt and reducing QA/infrastructure overhead by 35% in pilot environments.
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances.
If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy:
- Bachelor's Degree
- Agentic AI Development:
Experience building and deploying intelligent agents using platforms like Gauntlet, N8N, or similar low-code/no-code frameworks. - Systems Engineering & Dev Ops Expertise:
Deep understanding of modern infrastructure, CI/CD pipelines, observability, and automation tooling. - Security Automation:
Familiarity with vulnerability management workflows and AI-driven prioritization and ownership assignment. - Cloud Fin Ops Acumen:
Ability to analyze multi-cloud environments and implement AI-based cost optimization strategies. - Incident Response & SRE Practices:
Knowledge of signal correlation, automated triage, and incident mitigation using AI agents. - Tech Debt Remediation:
Proficiency in codebase analysis, refactoring, and optimization—especially in dataflow-heavy environments like Google Dataflow. - Cross-Functional Collaboration:
Strong communication skills to work across engineering, security, and finance teams. - Innovation Mindset:
Demonstrated ability to operate autonomously, experiment rapidly, and deliver high-impact outcomes in ambiguous, future-focused environments.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).