Platform - Site Reliability Engineer II; Networking
Listed on 2026-02-12
-
IT/Tech
Systems Engineer, Cloud Computing, SRE/Site Reliability
Platform - Site Reliability Engineer II (Networking)
Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data at scale—unleashing the potential of businesses and people. The Elastic Search AI Platform, used by more than 50 % of the Fortune 500, brings together the precision of search and the intelligence of AI to accelerate the results that matter. By securing and protecting private information more effectively, Elastic’s complete, cloud‑based solutions for search, security, and observability help organizations deliver on the promise of AI.
Whatis the Role
As part of the Platform Engineering department, the Traffic team is crafting, building, and improving the multi‑cloud platform at scale for Elastic Cloud Hosted and Serverless
. We grow and mature our distributed network services and solutions for multiple cloud service provider platforms. We are built on Kubernetes, Go/Scala, and custom orchestration architectures. In your daily life with us, you will code, innovate technical designs, craft solutions, improve resilience, and prioritize security, bug fixes, and features. For example, debugging Azure Networking for Elastic Cloud Serverless is part of our efforts, and we want your experience to contribute to a truly exceptional customer experience.
You Will Be Doing
- Lead technical initiatives for automating network engineering efforts to guarantee the reliability of the global Elastic infrastructure.
- Grow our global Platform infrastructure to meet the increasing scaling demands by developing and maintaining software, tooling, and automations.
- Collaborate in an inclusive environment focused on operational excellence and uplifting others.
- Respond to and prevent repeated customer impact in major incidents and prioritised problem management. Our on‑call rotation uses a follow‑the‑sun model where everyone participates during their working hours.
- Success and lessons from striving for "progress not perfection" in the name of Platform reliability. We want to hear about your customer‑first approach to solving operational problems with an SRE perspective.
- A background in software engineering to collaborate with engineers to identify, implement, and deliver solutions. Experience in public cloud and managed Kubernetes services is advantageous.
- Passion for developing solutions that involve inclusive communication methods to strengthen partner and team relationships. Experience working in distributed teams or remotely is desirable.
- Operated a SaaS product in a public cloud using Infrastructure‑as‑Code tooling such as Crossplane or Terraform.
- Built or operated a Kubernetes‑at‑scale infrastructure across multiple cloud providers, and the vital automation to support it.
- Written non‑trivial programs in Go or other programming languages.
- Worked with containerised services such as Docker.
- Proven experience in leading and improving alerting and major incident management standard processes, metrics systems (e.g. Elastic Stack, Graphite, Prometheus, Influx).
- Experience in system administration with professional Linux skills on distributed systems at scale.
- Diagnosed or designed, implemented, and created solutions with the Elastic Stack.
- Experienced in thriving in a self‑organising and sharing globally distributed team environment.
- Strengthened team members by uplifting others through coaching and mentoring.
As a distributed company, diversity drives our identity. Whether you’re launching a new career or growing an existing one, Elastic balances great work with great life. Your age is only a number, and we value what you can do.
- Competitive pay based on the work you do.
- Health coverage for you and your family in many locations.
- Flexibility to craft your calendar with flexible locations and schedules for many roles.
- Generous vacation days each year.
- Matching up to $2,000 for financial donations and service.
- Up to 40 hours each year to use toward volunteer projects.
- Minimum 16 weeks of parental leave.
Elastic is an equal‑opportunity employer and is committed to creating an inclusive culture that celebrates different perspectives, experiences, and backgrounds. Qualified applicants will receive consideration for employment without regard to race, ethnicity, color, religion, sex, pregnancy, sexual orientation, gender identity or expression, national origin, age, marital status, protected veteran status, disability status, or any other basis protected by federal, state, or local law, ordinance, or regulation.
We welcome individuals with disabilities and strive to create an accessible and inclusive experience for all. To request an accommodation during the application or recruiting process, please email candid We will reply within 24 business hours.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).