Platform - Site Reliability Engineer II; Networking
Listed on 2026-02-12
-
IT/Tech
Systems Engineer, Cloud Computing, SRE/Site Reliability, Network Engineer
Location: Dimondale
Platform - Site Reliability Engineer II (Networking)
Join to apply for the Platform - Site Reliability Engineer II (Networking) role at Elastic
Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale — unleashing the potential of businesses and people. The Elastic Search AI Platform, used by more than 50% of the Fortune 500, brings together the precision of search and the intelligence of AI to enable everyone to accelerate the results that matter.
By taking advantage of all structured and unstructured data, securing and protecting private information more effectively, Elastic’s complete, cloud-based solutions for search, security, and observability help organizations deliver on the promise of AI.
The Role
As part of the Platform Engineering department, the Traffic team is crafting, building, and improving the multi‑cloud platform at scale for Elastic Cloud Hosted and Serverless
. We grow and mature our distributed network services and solutions for multiple cloud service provider platforms. We are built on Kubernetes, Go/Scala, and custom orchestration architectures. In your daily life with us, you will participate in coding, innovating technical designs, crafting solutions, improving resilience, and prioritizing security, bug fixes, and features. For example, Debugging Azure Networking for Elastic Cloud Serverless is part of our efforts, and we want your experience to contribute to a truly exceptional customer experience.
You Will Be Doing
- Taking an engineering approach in leading technical initiatives for automating network engineering efforts to guarantee the reliability of the global Elastic infrastructure.
- Growing our global Platform infrastructure to meet the increasing scaling demands by developing and maintaining software, tooling and automations.
- Collaborating in an environment with an inclusive approach, focusing on operational excellence, and uplifting others.
- Responding to and preventing repeated customer impact in response to major incidents and prioritized problem management. Our on‑call rotation uses a follow‑the‑sun model where everyone participates in it during their working hours.
- Success and lessons from striving for "progress not perfection" in the name of Platform reliability. We want to hear about your customer‑first approach in solving operational problems with an SRE perspective.
- A background in software engineering to collaborate with engineers to identify, implement, and deliver solutions. Experience in public cloud and managed Kubernetes services is advantageous.
- Passion for developing solutions that involve inclusive communication methods to grow and strengthen partner and team relationships. Experience working in distributed teams or remotely is desirable.
- Operated a SaaS product in a public cloud ideally built using Infrastructure‑as‑Code tooling such as Crossplane or Terraform.
- Built or operated a Kubernetes‑at‑scale infrastructure across multiple cloud providers, and the vital automation to support it.
- Written non‑trivial programs in Golang or other programming languages.
- Worked with containerized services such as Docker.
- Proven experience in leading and improving alerting and major incident management standard processes, metrics, and systems (e.g. Elastic Stack, Graphite, Prometheus, Influx) to diagnose issues and quantify impacts for varied organizational levels.
- Experience in system administration with professional skills in Linux on distributed systems at scale.
- Diagnosed or designed, implemented, and created solutions with the Elastic Stack.
- Experienced in thriving in a self‑organizing and sharing globally distributed team environment.
- Strengthened team members by uplifting others with coaching and mentoring.
As a distributed company, diversity drives our identity. Whether you’re looking to launch a new career or grow an existing one, Elastic is the type of company where you can balance great work with great life. Your age is only a number. It doesn’t matter if you’re just out of college or your children are; we need…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).