Senior Software Engineer, SRE
Listed on 2025-12-23
-
Software Development
Software Engineer, Cloud Engineer - Software, DevOps
About Abridge
Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most—their patients.
Our enterprise-grade technology transforms patient-clinician conversations into structured clinical notes in real-time, with deep EMR integrations. Powered by Linked Evidence and our purpose-built, auditable AI, we are the only company that maps AI-generated summaries to ground truth, helping providers quickly trust and verify the output. As pioneers in generative AI for healthcare, we are setting the industry standards for the responsible deployment of AI across health systems.
We are a growing team of practicing MDs, AI scientists, PhDs, creatives, technologists, and engineers working together to empower people and make care make more sense. We have offices located in the Mission District in San Francisco, the SoHo neighborhood of New York, and East Liberty in Pittsburgh.
The RoleAbridge’s services and engineering team are in hyperscale mode. We are looking for experienced SREs to join our team and help improve the performance, stability, and scalability of our software by multiples. This is a distributed systems oriented role and is approximately 80% software focused and 20% cloud infrastructure focused.
You will help us build load testing and chaos engineering into our CI pipelines, leverage observability and profiling tools to identify performance bottlenecks and resolve them, work with diverse teams to help rehome their applications onto more scalable infrastructure, and ensure a smooth ride as we hyperscale our application adoption in the healthcare space. You may be embedded with other teams for weeks or months.
The platform we are building needs to maximize both engineering velocity and security, will be under tremendous scale, and presents many opportunities to leverage creativity, autonomy, and leadership to take things 0 to 1. This is a unique opportunity in the industry to rapidly grow your career in a rapidly growing company leveraging the best of emerging technologies.
What You'll DoLeverage load testing, chaos engineering, and other test practices to identify performance and latency bottlenecks across all of our systems, and make changes to application code to resolve them.
Drive software changes that can rehome applications at the code level onto new infrastructure (run times, event driven infrastructure, databases, and more) in order to dramatically improve scalability as well as enable multi-tenant deployments.
Identify and implement software configuration changes and performance tuning parameters that will dramatically improve performance and scalability.
Build developer tools and software modules that help engineers build code faster and more effectively with more enablements to the entire engineering organization.
Work with the Platform team to develop, and application teams to adopt, emerging elements of our internal developer platform, such as service templates and self-serve infrastructure.
Work with application teams to establish and adopt SLOs and error budgets, and drive better metrics for application health that can drive automated canary releases, improved health monitoring, and better engineering practices.
Uplevel our ability to respond to incidents by improving observability, runbooks, and incident response muscle across the organization.
Evangelize, document, and train the engineering team on the solutions being built and uplevel them on cloud native design strategies and tools.
Be a public evangelist for Abridge in the global platform engineering community, including conferences, open source, and research as we pioneer new AI-first cloud-native-first security-first implementations at scale.
8+ years of software engineering experience focused on distributed systems or tooling, with an interest in engineering enablement and software scaling.
At least 2 years experience as a back-end engineer focused on system performance and scalability.
Experience reducing latency in software by multiples through leveraging observability and profiling tools and deriving great pleasure from doing so.
Experience building on Kubernetes and scaling compute services on Kubernetes; experience with related cloud native technologies including ArgoCD, Argo Rollouts, Istio, etc.
Comfortable implementing and securing services in Google Cloud Platform with Infrastructure as Code, including GCP Projects, VPC Networks, Google Kubernetes Engine, and IAM Roles, Groups and policies. Candidates without GCP experience but who have experience with Kubernetes are encouraged to apply.
Experience building software with backend languages (e.g. Python, GoLang, Node, and Rust).
Experience monitoring distributed systems with Prometheus, Open Telemetry Collector, and Grafana (or something similar), including metrics collection,…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).