Senior DevOps Engineer
Listed on 2026-05-17
-
IT/Tech
Cloud Computing, Systems Engineer, SRE/Site Reliability
Who We Are
Angel is the home of stories that amplify light. Through its platform, thousands of “Angel” investors choose which titles will be created, funded, and distributed. Angel allows creators and audiences to form passionate communities around their creative projects, making the story behind the story as important as the final project itself. Some of the studio's key projects— The Sound of Freedom and Dry Bar Comedy—have earned billions of views around the world.
Learn more at
We’re looking for passionate team members who want to build world-class products that will reshape media over the coming decades. Join us and be part of stories that amplify light.
Summary / ObjectiveAngel Studios is seeking a skilled and experienced Senior Dev Ops Engineer to join the Platform team. This role is integral to ensuring our systems are robust, scalable, and reliable. You will play a key role in maintaining the reliability, performance, and scalability of our systems. You will work closely with our development teams to automate processes, deploy infrastructure, and manage cloud environments using AWS.
Key Responsibilities- Dev Ops Practices:
Automate infrastructure provisioning, configuration, and deployment processes using tools such as Terraform, or similar. Collaborate with development teams to integrate CI/CD pipelines and streamline code deployment. Leverage AI-powered tools to improve deployment workflows, incident analysis, and operational efficiency. - AWS Cloud Management & Optimization:
Design, deploy, and manage scalable and secure AWS infrastructure. Optimize AWS resource usage and cost through effective monitoring and management. Implement security best practices for cloud-based environments. Evaluate and implement AI-driven cloud optimization or observability tooling where appropriate. - System Reliability & Performance:
Implement and maintain monitoring, logging, and alerting systems to ensure high availability and performance of applications. Identify and resolve performance bottlenecks and reliability issues in production environments. Contribute to and execute incident response and disaster recovery plans. Identify opportunities to automate repetitive operational tasks using scripting or AI agents. - Collaboration & Communication:
Work closely with software engineers to ensure seamless integration of new features and services. Participate in on-call rotations and provide support for incident management. Document processes, configurations, and procedures to ensure knowledge sharing and continuity.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).