Applied
AI is a pioneering AI technology company headquartered in Abu Dhabi, committed to innovation and excellence in artificial intelligence solutions in regulated industries such as healthcare, insurance, government, and financial services.
Opus is the world's first Knowledge Work AI platform
. Built by Applied
AI to pioneer Supervised Automation
, a human-in-the-loop model where AI handles repetitive, structured tasks while human experts provide crucial oversight at defined intervals.
The platform uses its proprietary Large Work Model to generate and orchestrate outcome‑based workflows, enabling a dramatic reduction in the cost of knowledge work and allowing human talent to focus on high‑value, creative, and judgement‑intensive activities.
Position OverviewAs an Infrastructure Engineer, you will play a key role in delivering and maintaining the Opus Infrastructure component of our platform. This includes contributing to feature design, development, deployment, and testing.
You’ll work hands‑on with technologies like AWS, Azure, Crossplane, Terraform, Kubernetes, Git Ops practices, IAM, and microservices, in an environment that values scalability, performance, and collaboration.
The ideal candidate brings solid engineering fundamentals, startup energy, and thrives in a fast‑paced, growth‑stage tech company with ambitious goals and tight feedback loops.
Key Responsibilities Infrastructure Deployment & Automation- Design, implement, and maintain cloud infrastructure using Terraform, Helm, Kustomize, and FluxCD (Git Ops).
- Manage and optimize multi‑cloud environments (AWS, Azure) and cross‑account deployments.
- Build and maintain CI/CD pipelines ensuring smooth and automated deployments across environments.
- Migrate and manage Docker images from public registries to internal/private repositories.
- Deploy and manage microservices using Kubernetes (EKS/AKS) clusters.
- Optimize cluster performance, resource utilization, and workload distribution for scalability and cost efficiency.
- Harden networks, load balancers, and API gateways to ensure secure communication between services.
- Manage IAM roles, policies, and service permissions to ensure least‑privilege access.
- Implement and maintain VPC, security groups, and network policies for cross‑environment isolation.
- Establish and maintain monitoring, logging, and tracing systems using tools like Prometheus, Grafana, Cloud Watch, and ELK.
- Proactively identify performance bottlenecks, network issues, and reliability risks.
- Drive continuous optimization to improve system uptime, stability, and resilience.
- Work closely with backend and infrastructure teams to support feature rollouts and operational readiness.
- Contribute to documentation, runbooks, and incident response processes.
- Champion Dev Ops best practices to improve release cycles, infrastructure as code, and automation coverage.
- Bachelor’s degree in Computer Science, Artificial Intelligence, or a related technical field.
- 4–6 years of experience in a relevant engineering or development role, ideally within a tech company or startup.
- Proven experience with AWS (EKS, IAM, VPC), Terraform, Kubernetes, and microservice‑oriented architecture.
- Strong problem‑solving and debugging skills
- Proficiency in Git Ops workflows (FluxCD, ArgoCD)
- Ensure reliability of CI/CD pipelines
- Familiarity with network optimization and security hardening
- Ability to work cross‑functionally in a distributed team
- Comfort operating in fast‑paced, agile environments
- Excellent communication and collaboration skills
- Passion for innovation and learning new technologies
- Opportunity to work with a leading AI technology company.
- Collaborative and innovative work environment.
- Growing, entrepreneurial and forward‑thinking culture.
- Career growth and professional development opportunities.
- Exposure to a thriving ecosystem working from our Abu Dhabi HQ.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).