Senior DevOps Engineer
Listed on 2026-06-26
-
IT/Tech
SRE/Site Reliability, AWS, Cloud Computing: Infrastructure & Operations, IT Infrastructure
About Us Kyriba is a global fintech leader empowering CFOs and finance teams with cloud‑based treasury, payments, risk management and working capital solutions. We serve 3,000+ customers worldwide, managing $15 trillion in payments annually and helping businesses optimize liquidity performance across the enterprise.
Dream Big. Go Beyond. Be Unstoppable. We are on a mission to become the most sought‑after cloud technology company globally. We think big, innovate relentlessly, and challenge the status quo every day. If you are a problem‑solver ready to push boundaries and achieve more than you thought possible, you’ll find an exceptional career within an extraordinary business.
AboutThe Role
We are looking for a passionate Senior Dev Ops Engineer with strong experience in Storage to join our Platform Engineering team. As a core member of the team, you will actively contribute to the full platform ecosystem: infrastructure automation, continuous delivery pipelines, self‑service provisioning capabilities, production operations runbooks, and L2 support.
You will bring a broader Dev Ops culture and mindset, with a solid command of AWS infrastructure, tooling, and modern platform practices including AIOps and advanced Kubernetes. In addition to your full platform engineering contributions, you will serve as a team's storage domain expert, particularly around Net App on AWS. This specialization will be especially prominent in the early stages of your onboarding, but it does not define the boundaries of your role.
You will investigate and resolve storage‑related incidents at L2 level, while also continuing to contribute across all dimensions of the platform.
This is a hands‑on, automation‑first role where you are expected to codify everything, document what you build, and make it available as a self‑service to the engineering teams around you.
Essential Duties And Responsibilities Platform Engineering (Full Scope)- Contribute to infrastructure automation across the AWS platform using Terraform Enterprise, Ansible, Harness, Kubernetes (including operators, CRDs, and cluster lifecycle management), and Git Ops practices
- Support and evolve continuous delivery pipelines, ensuring reliable and repeatable deployments across environments (PRE, SBX, PRD)
- Build and maintain self‑service capabilities so that developers and engineering teams can autonomously claim and consume infrastructure resources (storage, compute, databases, messaging) through Kubernetes‑native APIs and Git Ops workflows, without manual intervention
- Leverage AIOps practices to improve platform reliability: intelligent alerting, anomaly detection, AI‑assisted incident triage, and automated remediation using tools like AWS Dev Ops Agent, Datadog and Git Hub Copilot
- Write, maintain, and improve production runbooks to ensure operational procedures are automated, documented, and accessible to the team
- Provide L2 support on automation systems and pipelines already in place, troubleshooting issues across the platform ecosystem
- Participate in on‑call rotations and contribute to incident response and post‑mortem processes
- Collaborate actively with Production Ops, DBA, and application engineering teams on platform improvements and migrations
- Contribute to Fin Ops by identifying and implementing AWS cost optimization opportunities
- Document architecture decisions, operational procedures, and contribute to team knowledge sharing
- Own, automate and improve Kyriba's storage platform (Net App, S3, EBS), ensuring reliability, performance, and DR readiness
- Design and implement storage architectures on AWS with high availability and fault tolerance
- Provide L2 support on storage incidents, leading investigations on performance degradation, availability issues, and data integrity events
- Integrate storage with Kubernetes workloads (persistent volumes, storage classes, CSI drivers) and backup solutions (AWS Backup, Netapp B&R)
- Ensure compliance with RPO/RTO objectives and contribute to DR testing and validation
- Administer and continuously improve the Active Directory and DNS environment, with a focus on performance optimization, security…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).