Senior Site Reliability Engineer
Listed on 2026-06-13
-
IT/Tech
Cloud Computing: Infrastructure & Operations, SRE/Site Reliability, Systems Engineer
QGenda is redefining healthcare workforce management everywhere care is delivered. We're on a mission to empower the healthcare industry to better onboarding, deploy, and manage their workforce. Over 4,500 healthcare organizations have trusted us to help them make strategic workforce decisions through our unified software platform. With more than 800 employees across the US, we are united in our vision and culture to make a difference for our customers, while enjoying the day-to-day.
At QGenda, we value our employees and their contributions toward the success of the business. We strive to create a dynamic work environment that fosters growth, innovation, and collaboration, where employees can be proud of the work they do and the impact it has on the healthcare industry.
As a Senior Site Reliability Engineer, you will work with our Infrastructure and Product Development Teams to increase the scalability, reliability, and performance of our systems and services. You will build and extend existing automation for configuration and monitoring of our AWS hosted applications. You will have the opportunity to evaluate new AWS services and tools to determine if they could be utilized in our environments.
You’ll bring a focus to platform health and monitoring to allow us to deliver the best possible experience for our customers. This is an excellent opportunity to have a significant impact on the stability of our systems and contribute to the evolution of our technology stack.
NOTE:
This role is hybrid with one required day in our Buckhead (Atlanta, Georgia) or our Uniontown, Ohio office depending on your current location.
As a Senior Site Reliability Engineer, you will work with our Infrastructure and Product Development Teams to increase the scalability, reliability, and performance of our systems and services. You will build and extend existing automation for configuration and monitoring of our AWS hosted applications. You will have the opportunity to evaluate new AWS services and tools to determine if they could be utilized in our environments.
You’ll bring a focus to platform health and monitoring to allow us to deliver the best possible experience for our customers. This is an excellent opportunity to have a significant impact on the stability of our systems and contribute to the evolution of our technology stack.
System Reliability & Performance:
- Design, implement, and manage scalable systems that ensure high availability, fault tolerance, and optimal performance.
- Continuously monitor and enhance system health and performance through data analysis and metrics.
Automation & Tooling:
- Develop and advocate for automation tools to eliminate repetitive manual processes and improve efficiency.
- Build and enhance CI/CD pipelines to streamline software delivery and deployments.
- Participate in on‑call rotation to respond to incidents, troubleshoot problems, and minimize downtime.
- Conduct root cause analyses and implement permanent solutions to recurring issues.
Infrastructure Management:
- Manage our cloud‑based infrastructure environment in AWS.
- Optimize costs and resources while maintaining robust and scalable systems.
- Serve as a technical advisor to engineering teams on infrastructure and operations best practices.
- Actively contribute to fostering an SRE culture within the organization by promoting observability, retrospectives, and continuous improvement.
- Curiosity-driven mindset with a desire to continuously learn and improve systems
- Strong sense of ownership — you see problems through to resolution, not just escalation
- Comfortable navigating ambiguity and making pragmatic tradeoffs under pressure
- Availability for off‑hours deployment and upgrades of production systems during release and maintenance windows
- Strong problem‑solving skills and ability to work effectively under pressure.
- Excellent communication skills for cross‑functional collaboration as well as documentation creation.
- B.S. in Computer Science, Computer Information Systems, or Computer Engineering from a major U.S. university or equivalent industry experience
- 7+ years of experience as a Dev…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).