×
Register Here to Apply for Jobs or Post Jobs. X

Software Engineer - Site Reliability

Job in New York, New York County, New York, 10261, USA
Listing for: PowerToFly
Full Time position
Listed on 2025-12-20
Job specializations:
  • IT/Tech
    Cloud Computing, SRE/Site Reliability
Salary/Wage Range or Industry Benchmark: 194000 - 260000 USD Yearly USD 194000.00 260000.00 YEAR
Job Description & How to Apply Below
Position: Staff Software Engineer - Site Reliability
Location: New York

Overview

We're Celonis, the global leader in Process Mining technology and one of the world's fastest-growing SaaS firms. We believe there is a massive opportunity to unlock productivity by placing data and intelligence at the core of business processes - and for that, we need you to join us.

Celonis is the global leader in Process Mining technology and one of the fastest-growing SaaS companies worldwide. We are on a mission to unlock unprecedented productivity by embedding data and intelligence at the core of every business process—and we’re looking for passionate individuals to help us realize that vision.

The Team

As a member of our Reliability Engineering team, you will play a critical role in ensuring the health, performance, and resilience of our platform. The team applies advanced software engineering and Site Reliability Engineering (SRE) principles to drive system reliability, scalability, and operational excellence across the organization.

The Role
  • Join a highly technical, collaborative, and innovation-driven team that blends Site Reliability Engineering with modern Software Engineering practices to build resilient and scalable systems.
  • Lead reliability efforts for a fleet of 80+ FedRAMP-compliant microservices running on Kubernetes, applying SRE principles to drive observability, automation, and incident prevention.
  • Own high-priority application incident escalations, performing deep technical analysis and restoration within defined SLOs, while continuously improving detection and response mechanisms.
  • Engineer solutions to enhance the availability, latency, and performance of production services—automating manual processes to eliminate toil and scale operational efficiency.
  • Collaborate closely with platform and application engineering teams to conduct post-incident reviews, extract insights, and implement systemic changes that improve overall reliability.
  • Document operational knowledge and runbooks, embedding SRE best practices into onboarding, incident response, and platform architecture standards.
Qualifications
  • Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related technical field (or equivalent hands-on experience).
  • Minimum of 5 years of experience building and maintaining cloud-based software applications with at least one public cloud platform (AWS, Azure, or GCP).
  • Proficiency in Java, the Spring framework, and Python (or a similar scripting language) in a Linux environment.
  • Prior experience contributing to Site Reliability Engineering initiatives or similar operational roles.
  • Knowledge of SRE principles, including SLI/SLO design, error budgets, and toil reduction strategies.
  • Proven expertise in developing and operating production-grade, scalable services using Kubernetes and elastic cloud architectures.
  • Strong problem-solving and troubleshooting abilities in complex, distributed systems.
  • Excellent written and verbal communication skills in English.
  • Please note :
    This position is not eligible for immigration visa sponsorship, now OR in the future.
Nice to Have
  • Familiarity with observability and monitoring tools (e.g., Datadog, etc.).
  • Experience with CI/CD pipelines and tools such as ArgoCD, Git Hub Actions, or similar.
  • Experience with Infrastructure as Code (IaC) tools such as Terraform and Kustomize.
  • Exposure to incident management practices, on-call rotations, and postmortem culture.
Compensation

The base salary range below is for the role in the specified location, based on a Full Time Schedule.

Total compensation package will include base salary + bonus/commission + equity + benefits (health, dental, life, 401k, and paid time off). Please note that the base salary range is a guideline, and that the actual total compensation offer will be determined based on various factors, including, but not limited to, applicant's qualifications, skills, experiences, and location.

The base salary range below is for the role in New York, based on a Full Time Schedule.

$194,000—$260,000 USD

What Celonis Can Offer You
  • Pioneer Innovation: Work with the leading, award-winning process mining technology, shaping the future of business.
  • Accelerate Your Growth: Benefit from clear career paths,…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary