Senior Site Reliability Engineer
Listed on 2025-12-25
-
IT/Tech
Cloud Computing, Systems Engineer, Cybersecurity
Saviynt's AI-powered identity platform manages and governs human and non-human access to all of an organization's applications, data, and business processes. Customers trust Saviynt to safeguard their digital assets, drive operational efficiency, and reduce compliance costs. Built for the AI age, Saviynt is today helping organizations safely accelerate their deployment and usage of AI. Saviynt is recognized as the leader in identity security, with solutions that protect and empower the world’s leading brands, Fortune 500 companies and government institutions.
For more information, please visit
We are seeking a highly skilled and experienced Senior Dev Ops Engineer with a strong focus on observability to join our team. You will play a key role in ensuring the reliability, performance, and scalability of our cloud-based infrastructure by leveraging monitoring, logging, and tracing tools. Your insights will empower engineering teams to build resilient services, proactively resolve issues, and drive continuous improvement.
WHAT YOU WILL DO:- Design, implement, and maintain robust CI/CD pipelines to support reliable and efficient software delivery, including proactive monitoring and troubleshooting.
- Collaborate with development, QA, and operations teams to improve service development, testing, and deployment workflows.
- Automate infrastructure provisioning and management using Infrastructure as Code (IaC) tools (e.g., Terraform, Cloud Formation).
- Manage and optimize cloud infrastructure across platforms such as AWS, Azure, or Google Cloud to ensure high availability, performance, and scalability.
- Ensure compliance with security, performance, and quality standards throughout the CI/CD and deployment lifecycle.
- Install and configure Saviynt products in accordance with defined procedures and organizational best practices.
- Automate customer deployment, migration, and upgrade processes by reducing manual tasks during all deployment phases (pre-, in-, and post-deployment).
- Troubleshoot and resolve cloud infrastructure and deployment-related incidents in collaboration with engineering and IT teams to minimize downtime and maintain service quality.
- Develop and maintain technical documentation for infrastructure, automation tools, and deployment procedures.
- Create and deploy automation scripts to streamline system tasks, reduce manual intervention, and eliminate human error in cloud environments.
Required Qualifications (Senior-Level Role)
- Master’s degree in Engineering, or Bachelor’s degree with professional software engineering experience or equivalent experience.
- 2+ years of Sr. Engineer level hands‑on experience in cloud infrastructure, Dev Ops, and automation within AWS and Azure environments. Experience designing, deploying, and maintaining scalable infrastructure for SaaS applications in AWS and Azure.
- Strong expertise in Infrastructure as Code (IaC) using tools such as Terraform, Cloud Formation, ARM, Ansible, and Puppet.
- Solid knowledge of cloud networking concepts, including BGP, routing, and REST APIs.
- Experience working with cloud-native networking services for secure and scalable integrations.
- Experience in CI/CD and Dev Ops environments using Jenkins, Git Lab CI, and similar tools.
- Container orchestration experience using Docker and Kubernetes for microservices-based architectures.
- Extensive scripting experience in Python, Java, and Bash to automate system tasks and reduce manual intervention.
- Proficiency in managing Linux environments (e.g., Red Hat, CentOS).
- Experience building and maintaining observability stacks (monitoring, logging, tracing) for cloud-native systems.
- Experience with tools such as Prometheus, Grafana, ELK Stack, Splunk, Datadog, New Relic, and Open Telemetry.
- Experience working with REST/SOAP APIs and tools like Postman.
- Proficient in MySQL and database operations.
- Strong understanding of Identity and Access Management (IAM) and general cybersecurity best practices.
- Proficient in Git and version control workflows.
- Certifications in AWS, Azure, GCP, Kubernetes, or Terraform.
- Experience with Open Telemetry SDKs (Go, Java, Node.js).
- Familiarity with SLOs, error budgets, and observability-as-code practices.
$125,000 - $150,000 a year We offer you a competitive total rewards package, learning and tremendous opportunities to grow and advance in your career. At Saviynt, it is not typical for an individual to be hired at or near the top of the range for their role and final compensation decisions are dependent on many factors including but are not limited to location;
skill sets; experience and training; licensure and certifications; and other relevant business and organizational needs. A reasonable estimate of the current range is $125,000 - $150,000 annually.
You may also be eligible to participate in a Saviynt discretionary bonus plan, subject to the rules governing the program, whereby an award, if any, depends on various factors, including, without limitation,…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).