Site Reliability Engineer; Senior or Atlas
Listed on 2026-06-02
-
IT/Tech
Cloud Computing, SRE/Site Reliability
The Team
This role can sit in our NYC HQ on a hybrid basis, or it can be fully remote while working from a location based in either Eastern or Central time zones. We are looking for an experienced Senior Engineer for our SRE, Atlas team to support, maintain and grow the Atlas platform. As a senior SRE, you will be expected to be able to design & build complex systems, operate with autonomy and act as owner for everything you do.
The SRE Atlas team works alongside the various Atlas software engineering teams to provide expertise about running systems at scale, build new tooling and automation and perform essential maintenance of the Atlas fleet.
This is an SRE team, which means you can expect a highly hands-on approach, tackling the technical challenges of implementing large scale solutions that have the ability to impact our customer’s most crucial workloads.
Role OverviewWe are seeking a talented Site Reliability Engineer (SRE) with a strong infrastructure background. This role requires engineers to have a customer-first mindset to ensure that everything we do results in a stronger product and a better experience for all Atlas customers.
The ideal candidate should- Have 5+ years of experience running critical systems at scale
- Value efficiency in processes and operations, and display a preference for automation over manual processes (“allergic to ops work”)
- Be familiar with a major cloud provider (AWS, Azure, or GCP) and possess the ability to build and operate systems in a multi-cloud environment
- A strong understanding of how to run a large scale Linux environment, including low level fundamentals
- Firm grasp of at least one modern programming language, beyond basic scripting (Go, Ruby, Python)
- Solid understanding of web and network protocols and standards (HTTP, TLS, DNS, etc)
- Participate in the development of a reliable and resilient multi-cloud platform that hosts business critical applications for a wide & varied range of customer applications
- Collaborate with service-owning teams to provide internal support, solve technical challenges and adapt or build tooling to solve novel use cases in a generic fashion
- Participate in a 24/7 on-call rotation to swiftly resolve issues related to any disruption of our customer facing Atlas fleet, ensuring minimal disruption and high availability
Mongo
DB is built for change, empowering our customers and our people to innovate at the speed of the market. We have redefined the database for the AI era, enabling innovators to create, transform, and disrupt industries with software. Mongo
DB’s unified database platform—the most widely available, globally distributed database on the market—helps organizations modernize legacy workloads, embrace innovation, and unleash AI. Our cloud-native platform, Mongo
DB Atlas, is the only globally distributed, multi-cloud database and is available across AWS, Google Cloud, and Microsoft Azure.
With offices worldwide and nearly 60,000 customers—including 75% of the Fortune 100 and AI-native startups—relying on Mongo
DB for their most important applications, we’re powering the next era of software.
Our compass at Mongo
DB is our guiding how and why we make decisions, show up for each other, and win. It’s what makes us Mongo
DB.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).