DevOps Engineer/Cloud & Monitoring Specialist
Listed on 2026-06-10
-
IT/Tech
Cloud Computing, Systems Engineer
About Apexon:
Apexon is a digital-first technology services firm specializing in accelerating business transformation and delivering human-centric digital experiences. We have been meeting customers wherever they are in the digital lifecycle and helping them outperform their competition through speed and innovation.
Apexon brings together distinct core competencies in AI, analytics, app development, cloud, commerce, CX, data, Dev Ops, IoT, mobile, quality engineering and UX, and our deep expertise in BFSI, healthcare, and life sciences to help businesses capitalize on the unlimited opportunities digital offers. Our reputation is built on a comprehensive suite of engineering services, a dedication to solving clients toughest technology problems, and a commitment to continuous improvement.
Backed by Goldman Sachs Asset Management and Everstone Capital, Apexon now has a global presence of 15 offices (and 10 delivery centers) across four continents.
We enable #HumanFirstDIGITAL
We are seeking a highly skilled Dev Ops Engineer with strong expertise in cloud technologies, monitoring tools, and CI/CD pipeline implementation. The ideal candidate will have hands-on experience with AWS and Google Cloud Platform environments, infrastructure automation using Terraform, containerization using Docker, and orchestration with Kubernetes. This role requires a proactive individual capable of managing deployments, ensuring system reliability, and collaborating with cross-functional teams in an Agile environment.
Key Responsibilities
1. Requirements & Agile Collaboration
Collaborate with product owners and business analysts to understand and refine business requirements.
Convert requirements into well-defined user stories using Gherkin format.
Manage and maintain the product backlog and update it after sprints and production releases.
Participate in sprint planning, backlog grooming, and Agile ceremonies.
Identify risks, blockers, and dependencies, and communicate them to stakeholders.
2. CI/CD & Deployment Management
Design, implement, and maintain CI/CD pipelines using Jenkins, Git Hub Actions, and Git Lab CI.
Build multi-branch pipelines using DSL and deploy applications to various environments based on parameter inputs.
Automate infrastructure provisioning using Terraform and Packer templates.
Containerize applications using Docker and deploy them on Kubernetes clusters (EKS/GKE).
Configure and manage Jenkins runners and deployment workflows.
Perform release management and ensure smooth deployment of critical applications.
3. Cloud & Infrastructure Management
Provision, configure, and manage cloud infrastructure across AWS and Google Cloud Platform.
Work with AWS services including EC2, EKS, ELB, VPC, RDS, IAM, S3, Cloud Front, Lambda, Route
53, SNS, SQS, Cloud Watch, and more.
Work with Google Cloud Platform services such as Compute Engine, Kubernetes Engine, App Engine, Cloud Storage, Cloud Functions, and VPC.
Manage networking, security, and system configurations across environments.
Build and maintain Terraform scripts for staging and production environments.
4. Monitoring & Observability
Monitor applications and infrastructure using tools such as:
Nagios, Prometheus, Grafana
Loki Stack & Promtail
Datadog, Dynatrace
Amazon Cloud Watch
ELK/EFK Stack
Server Density
Analyze logs, troubleshoot issues, and ensure high availability of systems.
Set up alerts and dashboards for proactive monitoring.
5. Build, Release & System Administration
Handle build and release processes using Maven, Jenkins, and Linux/Ubuntu systems.
Manage web/application servers such as Nginx and Apache Tomcat.
Configure messaging systems like Rabbit
MQ and Redis.
Perform infrastructure orchestration and migration activities.
6. Code Quality & Reviews
Conduct peer and self-reviews for Infrastructure-as-Code (IaC).
Ensure adherence to coding and deployment standards.
Use Sonar Qube for code quality checks and ensure no critical vulnerabilities.
Manage version control, branching, and merging strategies using Git tools.
7. Documentation
Prepare detailed deployment and technical documentation.
Maintain runbooks, SOPs, and best practice guidelines.
Assist team members in creating standard operational documentation.
Tools & Technologies
Cloud Platforms
AWS, Google Cloud Platform
Infrastructure & Automation
Terraform, Terragrunt, Packer, Boto3, Chef, Ansible, Rundeck
Containerization & Orchestration
Docker, Kubernetes, Rancher, Helm, Kustomize, Istio
CI/CD & Dev Ops Tools
Jenkins, Git Hub Actions, Git Lab CI, ArgoCD, Spinnaker, FluxCD (Git Ops)
Monitoring & Logging
Nagios, Prometheus, Grafana, Loki Stack, ELK/EFK
Datadog, Dynatrace, Cloud Watch, Server Density
Databases
MySQL, Dynamo
DB, Aurora, Mongo
DB, Percona
Programming & Scripting
Python (basic), Shell/Bash
Other Technologies
Redis, Rabbit
MQ, Sonar Qube, Ops Genie, SSO, Databricks
Skill Requirements
Must-Have Skills
Strong experience with monitoring tools (Nagios, Prometheus, Grafana, ELK/EFK, Datadog, Dynatrace, Cloud Watch).
Hands-on experience with Terraform, Kubernetes,…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).