Site Reliability Engineer III
Listed on 2026-06-04
-
IT/Tech
Cloud Computing, Systems Engineer
Veeam is the Data and AI Trust Company, specializing in helping organizations ensure their data and AI are fully understood, secured, and resilient to enable the acceleration of safe AI the market leader in both data resilience and data security posture management, Veeam is built for the convergence of identity, data, security, and AI risk. Headquartered in Seattle with offices in more than 30 countries, Veeam protects over 550,000 customers worldwide, who trust Veeam to keep their businesses running.
Join us as we go fearlessly forward together, growing, learning, and making a real impact for some of the world’s biggest brands.
The Role
We are looking for an experienced Senior Site Reliability Engineer to join the Veeam Data Cloud (VDC) engineering team. You will be working with a global team to build the world’s next modern data protection platform for Veeam. This is an excellent opportunity for someone with SaaS experience to work with a cutting-edge technology stack based on containers, serverless infrastructure, Golang, public cloud services in the SaaS domain.
WhatYou'll Do
- Design, implementation and maintenance of scalable and reliable infrastructure solutions on Microsoft Azure and additional cloud platforms in the future
- Automation of the deployments, maintenance of a resilient, secure, and efficient SaaS application platform to meet established service levels
- Upkeep and support of delivery and release pipelines
- Continuous evaluation and improvement of the reliability, performance, and scalability of our systems
- Development of comprehensive monitoring and alerting solutions
- Incident response for distributed applications in production environments, including a mandatory participation in on-call rotations
- Proactively meet standards for information security and compliance, such as ISO (International Standards Organization), SOX (Sarbanes Oxley), SSAE (Standards for Attestation Engagements) 16, etc.
- Shepherd the definition, documentation, and improvement of our internal standards for style and maintainability
- 24/7 Support
- Microsoft TFS, Azure Dev Ops, Git, Bit Bucket
- Azure (Entra , API Management, Cosmos DB, Storage services, Azure Functions, static website hosting, Azure security, etc.)
- IaC tools (Azure ARM templates, AWS Cloud Formation, Terraform, the Serverless Framework, etc.)
- Observability (Azure Monitor, App Insights, Elastic Stack)
- 3+ years of experience in 24x7 production operations for a SaaS (Software as a Service) or cloud service provider
- Experience with implementation and maintenance of leading infrastructure and application monitoring tools (Azure Monitor, App Insights, Elastic Cloud)
- Experience managing Azure IaaS (Infrastructure as a Service) and PaaS (Platform as a Service) solutions
- Strong problem-solving skills and the ability to troubleshoot complex issues in a distributed, multi-tenant environments
- Experience with container orchestration and management platforms
- Possess system programming skills in Python, Power Shell, Bash, Go, etc.
- Experience with implementation, maintenance, and support of CI/CD practices and tools (Azure Dev Ops or similar)
- You are experienced with distributed, event-based messaging architectures (Azure Event Hub, Azure Service Bus, Kafka, etc.)
- English proficiency level sufficient to communicate with international teams
- Industry-recognized certifications in the relevant field (e.g., AZ-400, AWS Certified Dev Ops Engineer, DCA)
- Experience with migrating and adapting on-premises products to cloud infrastructure
- Experience with AWS (ECS, RDS, Dynamo
DB, VPCs, Step Functions, Lambda, IAM, EC2, S3, etc.) - Experience with C# and .NET
- Unlimited paid time off, 12 paid holidays, plus 4 extra global Veea Me Days for self-care and 24 paid volunteer hours annually through Veeam Cares
- Paid parental leave: 8 weeks for all parents, 16 weeks for birthing parents
- Medical, dental, and vision coverage starting on your first day
- Mental health support, therapy sessions, and digital wellness tools via our Employee Assistance Program
- 401(k) retirement plan with company matching contributions
- Fertility, adoption, and…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).