Senior DevOps Engineer
Listed on 2025-12-24
-
IT/Tech
Cloud Computing, Systems Engineer
Apiphani is a technology-enabled managed services company dedicated to redefining what it means to support mission-critical enterprise workloads. We’re a small but rapidly growing company, which means there’s lots of room for growth and learning opportunities abound!
Apiphani is dedicated to creating a diverse and inclusive work environment for all as a fundamental component of our business. Diversity and inclusion are the bedrock of creativity and innovation. Without diversity of experience and thought, we would fail to progress as a company and as a team. We embrace the unique experiences, perspective, and cultural background, which only you can bring to the table.
SeniorDev Ops Engineer — Agentic AI Platform
Location: Remote | Full-time | Competitive Compensation
At Apiphani, we’re building the future of infrastructure automation through agentic AI. As a Senior Dev Ops Engineer, you’ll design, deploy, and manage the core infrastructure that powers our platform — ensuring reliability, scalability, and observability across environments.
You’ll work closely with software engineers, QA, and product teams to build a secure, automated, and cloud-native foundation. This role combines hands-on cloud engineering with infrastructure-as-code, CI/CD automation, and AI system enablement.
What You’ll Do- Design, provision, and maintain AWS-based infrastructure for production and development environments
- Build and optimize CI/CD pipelines using Git Hub Actions to support multi-language deployments
- Manage container orchestration and deployments using Kubernetes and Docker
- Implement Infrastructure as Code using Terraform and Terragrunt, with strong versioning and reusability practices
- Configure observability and monitoring for AWS services using Cloud Watch, Cloud Trail, and related tools
- Set up and manage secret management systems (AWS Secrets Manager, Parameter Store, or Vault)
- Oversee database infrastructure, with a focus on Postgre
SQL and SQL migration workflows - Create and maintain infrastructure diagrams and documentation for visibility and auditing
- Implement cost and billing management strategies to optimize AWS spend and efficiency
- Develop automation scripts and utilities using Bash, Python, or Node.js
- Deploy and support Lambda-based serverless workloads
- Collaborate with AI and application teams to ensure infrastructure readiness for AI workloads and data pipelines
- Establish best practices for infrastructure reliability, scalability, and security
- 5+ years of experience in Dev Ops, Cloud Engineering, or Infrastructure roles
- Deep expertise with AWS services (EC2, S3, RDS, Lambda, Cloud Formation, Cloud Watch, etc.)
- Strong background in Kubernetes administration and containerized deployments
- Proven experience writing and managing Infrastructure as Code using Terraform and Terragrunt
- Hands-on experience managing CI/CD workflows with Git Hub Actions
- Proficiency in scripting with Bash, Python, and/or Node.js for automation tasks
- Understanding of Postgre
SQL administration, SQL schema management, and migrations - Experience with observability and monitoring in AWS environments
- Familiarity with cost optimization and billing management for AWS accounts
- Knowledge of secret management solutions (AWS Secrets Manager, Vault, or equivalent)
- Ability to create clear infrastructure documentation and architecture diagrams
- Exposure to AI/ML stacks or interest in supporting agentic AI applications
- Experience with container security and policy enforcement tools
- Familiarity with AI-related compute infrastructure (e.g., GPU instances, Sage Maker, Bedrock)
- Understanding of networking concepts including VPC design, subnets, and load balancing
- Contributions to open-source IaC or Dev Ops tooling
- AWS certification or equivalent cloud credential
You’ll be part of a highly technical, fast-moving team building the foundation for a new generation of agentic AI systems. This is an opportunity to own our infrastructure roadmap — from design and automation to observability and AI enablement. Everything you build will directly power the next phase of intelligent enterprise operations.
$150,000 - $180,000 USD
Company…(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).