Senior CloudOps Engineer - Evinova
Publicado en 2026-02-22
-
TI/Tecnología
Ingeniero de sistemas, Cloud, Ingeniería de confiabilidad del sitio/Confiabilidad del sitio, Seguridad cibernética
Role based in Barcelona - 3 days at office / 2 days at home
As a Senior Cloud Ops Engineer, you will serve as a technical leader within our Platform Operations group—driving operational excellence across our global SaaS platform. You will own complex cloud operations, strengthen CI/CD reliability, lead cost‑optimization initiatives, and provide deep expertise in AWS, Kubernetes (EKS), and database operations (Mongo
DB and/or RDS Aurora). This role requires a strong operations mindset, the ability to diagnose and resolve complex issues, and the capability to influence engineering practices across teams. You will mentor junior engineers, lead automation efforts, and help shape the standards that ensure the platform is secure, reliable, and cost‑efficient at scale.
- Lead day‑to‑day operational support across AWS accounts (EC2, EKS, RDS, S3, VPC, IAM).
- Own lifecycle management activities, including patching, scaling, performance tuning, and backup strategy.
- Serve as an escalation point for major incidents; lead root‑cause analysis and post‑incident reviews.
- Improve operational resilience through automation, standardization, and proactive monitoring.
- Architect improvements to CI/CD pipelines (ArgoCD, Git Hub Actions, Jenkins) to increase reliability and reduce deployment friction.
- Drive automation of repetitive operational tasks using Python, Bash, or similar.
- Partner with engineering teams to streamline environment provisioning and deployments.
- Mentor Cloud Ops engineers on CI/CD patterns and operational guardrails.
- Lead cloud cost‑efficiency initiatives across compute, storage, data services, and networking.
- Analyze usage trends and guide right‑sizing, modernization, and storage tiering strategies.
- Partner with engineering and Fin Ops teams to forecast cloud spend and identify optimization opportunities.
- Drive accountability by defining and tracking cost KPIs across services.
- Lead operational ownership of Mongo
DB and/or RDS Aurora (performance tuning, scaling, failover, backup/restore validation). - Diagnose and resolve complex database issues impacting availability or performance.
- Guide application teams on efficient usage patterns for Mongo
DB/RDS Aurora. - Contribute to improvements in backup strategy, disaster recovery, and data resiliency.
- Design and enhance monitoring, alerting, and dashboards (Cloud Watch, Grafana, Splunk, Open Telemetry). Build and maintain runbooks, SLOs, and operational playbooks.
- Improve alert quality, signal‑to‑noise ratio, and overall response processes.
- Champion observability adoption across teams to reduce MTTR and improve uptime.
- Enforce strong IAM governance, least‑privilege access, and secure operational patterns.
- Lead patching, vulnerability remediation, and evidence collection for compliance (SOC 2, ISO 27001, HIPAA).
- Partner with Security to maintain audit readiness and reduce operational risk.
- Identify control gaps and contribute to continuous compliance automation.
- 5–8+ years in Cloud Ops, Dev Ops, SRE, or infrastructure engineering roles.
- Deep, hands‑on experience operating AWS environments at scale (EC2, RDS, EKS, S3, Cloud Watch).
- Production‑level expertise with Mongo
DB and/or RDS Aurora (operations, optimization, DR). - Strong automation and scripting capability (Python, Bash, or similar).
- Hands‑on experience supporting CI/CD pipelines and deployment platforms.
- Strong troubleshooting and performance‑tuning skills across cloud, networking, and application layers.
- Experience with observability tooling (Cloud Watch, Grafana, Splunk, Open Telemetry).
- Ability to mentor junior engineers and lead operational improvements.
- Operations leader: thrives in environments where reliability, cost‑efficiency, and stability matter most.
- Proactive problem solver: anticipates issues and drives long‑term technical improvements.
- Data‑driven: uses metrics, logs, and cost data to guide decisions.
- Collaborative…
Para buscar, ver y solicitar empleos que acepten solicitudes de su ubicación o país, toque aquí para realizar una búsqueda: