DevOps/Platform Engineer
Publicado en 2026-02-21
-
TI/Tecnología
Cloud, Ingeniero de sistemas, Ingeniería de confiabilidad del sitio/Confiabilidad del sitio, Gerente de Proyectos TI
Take the reins of a mission‑critical web platform at the heart of a modern, cloud‑driven organization. In this role, you will own the end‑to‑end infrastructure powering medium‑scale web environments, from AWS and Infrastructure as Code to CI/CD pipelines, monitoring, and security. Working hand‑in‑hand with developers, you will shape a stable, scalable, and future‑ready platform, driving automation, performance, and reliability while guiding the journey toward containerization and Dev Ops excellence.
Here'sa little taste of your challenge:
Full ownership of the web development platform and infrastructure across multiple environments, providing a reliable playground for the web development organization;
Installation, configuration, and administration of
web and database servers on AWS (EC2, S3, database services, load balancers), ensuring stability and scalability;Management and provisioning of data center and cloud resources using Infrastructure as Code (Puppet/Ansible);
Contribute to and support the gradual transition towards our internal developer platform (IDP) based on Kubernetes and a shared observability stack, following Git Ops principles;
The platform teams provide and maintain core infrastructure components such as CI/CD and progressive delivery tooling (Jenkins, ArgoCD), cluster lifecycle, DNS, AWS network, identity and credential/secret management, monitoring, logging and various controllers and operators (e.g. for Ingress, load balancing, DNS, Certificate management and more).
Your role will be focused on migrating, operating, modernising and maturing the web‑services running on top of this platform;Ownership of the
automated build and deployment pipeline (Jenkins);Integration with the organization’s observability platform (powered by Prometheus, Grafana, Loki, Ops Genie, Pingdom);
Ensuring systems are
patched and aligned with IT security standards
, including evaluation of updates for compatibility and functional impact;Configuration of application‑level routing and SSL/TLS settings using the platform’s ingress and certificate‑management tooling (e.g., Traefik/Istio, cert‑manager, external‑dns);
Incident investigation and root‑cause analysis
, driving stability and long‑term improvements;Definition and maintenance of
backup and disaster‑recovery strategies for critical services;Planning and execution of
load and performance testing;Close collaboration with both developers and platform teams to ensure alignment with platform‑supported patterns and tooling, to maintain and improve service reliability, mature automation and operational best practices;
Creation and maintenance of clear documentation and knowledge sharing;
3+ years of professional experience designing, implementing, and operating
multi‑tier environments with automated deployments, load balancing, and monitoring;Advanced
Linux knowledge: OS installation and administration, shell scripting, RPM packaging, and automated operations;Solid experience hosting, administering, and monitoring
cloud environments (AWS and/or Azure), including cost optimization;Experience hosting and operating
web frameworks and CMS (e.g. Cocoon, Word Press);Strong troubleshooting skills and hands‑on experience with incident analysis;
Strong communication, collaboration, and documentation skills;
Ownership: you take responsibility for outcomes;
Curiosity: you enjoy learning, improving, and sharing knowledge;
Quality mindset: you build with reliability, performance, and scalability in mind;
Pro‑active/Preventive thinking;
Fluent in English;
Proficiency with
Apache HTTP Server, Jetty, Varnish Cache, and Solr in high‑performance web environments;CI experience with
Jenkins and exposure to automated testing;Experience installing and maintaining
DBMS (Postgres, Maria
DB/MySQL);Experience with
caching strategies (CDN, Varnish) and performance optimization;Infrastructure as Code using
Terraform / Terra space
, Ansible
, and Puppet (user level knowledge);
Proper understanding of
DNS
, request routing, and dependencies in web architectures;Deep understanding of
SSL/TLS
, OpenSSL, and Java SSL tooling;Experience with
scripting…
Para buscar, ver y solicitar empleos que acepten solicitudes de su ubicación o país, toque aquí para realizar una búsqueda: