Site Reliability Engineer
Listed on 2026-02-16
-
IT/Tech
Cloud Computing, Systems Engineer
About Us
We love going to work and think you should too. Our team is dedicated to trust, customer obsession, agility, and striving to be better everyday. These values serve as the foundation of our culture, guiding our actions and driving us towards excellence. We foster a culture of performance and recognition, allowing us to transform growth as we enable our employees to do the best work of their careers.
This role is open to candidates based in or near Austin, TX. Our Austin office is based in the vibrant San Jacinto Center downtown with breathtaking views of Lady Bird Lake. At Logic Monitor, we hire within our Centers of Energy—vibrant locations where our teams connect, collaborate, and innovate. To learn more about life at Logic Monitor, check out our Careers Page.
WhatYou'll Do
Logic Monitor® is the AI-first hybrid observability platform powering the next generation of digital infrastructure. Logic Monitor delivers complete visibility and actionable intelligence across on-premises, cloud, and edge environments. By anticipating issues before they strike, optimizing resources in real time, and enabling faster, smarter decisions, Logic Monitor helps IT and business leaders protect margins, accelerate innovation, and deliver exceptional digital experiences without compromise.
Our customers love Logic Monitor's ability to bring cloud and traditional IT together into one view, as seen in minimal churn rates, expansion business, and exciting new customer references. Logic Monitor has received the highest Net Promoter Score of any IT Infrastructure Management provider. Logic Monitor also boasts high employee satisfaction. We have been certified as a Great Place To Work®, and named one of Built In s Best Places to Work for the seventh year in a row!
We are seeking a talented and experienced Site Reliability Engineer (SRE) to help ensure the uptime and reliability of our mission-critical systems. In this high-impact role, you’ll automate and streamline operational tasks, continuously looking for ways to improve performance, efficiency, and scalability.
You’ll work closely with developers to provide infrastructure-focused feedback that enhances product performance within the LM environment. This is a unique opportunity to sharpen your SRE skill set and become an invaluable member for the core LM Operations team.
Here s a closer look at this key role:
Infrastructure Reliability & Uptime- Maintain uptime of Logic Monitor’s SaaS-based platform and implement technical and process improvements to enhance system reliability.
- Ensure the security and stability of the production environment through proactive monitoring and risk mitigation strategies.
- Design, deploy, and manage scalable infrastructure and system integrations to support business growth and technical innovation.
- Write code to automate infrastructure maintenance, deployments, and routine operational tasks to increase efficiency and reduce manual effort.
- Partner closely with development teams to support and influence operational architecture and design changes.
- Lead cross-functional, technically complex projects, driving execution and alignment across teams.
- Act as a strategic technical resource across the organization, developing and delivering presentations for internal teams, customers, and external conferences.
- Mentor junior team members, fostering growth, knowledge sharing, and operational excellence.
- Set a high standard for documentation and runbook quality, leading by example to promote clarity, consistency, and operational readiness.
- 3+ years of experience in a Linux engineering role, preferably in a SaaS-based company.
- Solid understanding of Linux system administration in distributed environments.
- Experience with configuration management tools such as Chef, Puppet, or Ansible.
- Experience with virtualization and container technologies (e.g., Docker, Kubernetes).
- Programming/scripting experience (Python, Shell, Go).
- Knowledge of security as it relates to Linux systems, applications, and networking.
- High-level understanding of networking…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).