×
Register Here to Apply for Jobs or Post Jobs. X

SRE Principal Engineer

Job in Atlanta, Fulton County, Georgia, 30383, USA
Listing for: Waystar, Inc.
Full Time position
Listed on 2025-11-25
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer, SRE/Site Reliability
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below

ABOUT THIS POSITION

SRE Principal Engineer

We are seeking a highly skilled SRE Principal Engineer with Site Reliability Engineering (SRE) expertise to design, build, scale and optimize our cloud platform and infrastructure. This role demands deep hands‑on experience with AWS cloud services across compute, storage, databases, networking, and security, combined with strong cost optimization strategies. The AWS Cloud Architect will help define the cloud roadmap and strategy, design scalable solutions, and ensure the reliability, security, and cost‑efficiency of the platform and infrastructure.

The role will be responsible for the scalability of the platform and infrastructure, ensuring it can support business growth while maintaining high availability and performance. Additional responsibilities will include mentoring junior members on the SRE team, reviewing and approving infrastructure code, and participating in key architectural discussions with product engineering and security teams to ensure new and existing services follow best practices and meet operational excellence standards.

If you are an experienced SRE Principal Engineer with a strong SRE mindset, passionate about high availability, security, automation, and cost efficiency, we would love to hear from you.

WHAT YOU'LL DO
  • Key Responsibilities
    Cloud Strategy & Roadmap

  • Help define and implement the cloud roadmap and strategy to drive scalability, reliability, security, and cost efficiency.

  • Lead and contribute in cloud adoption initiatives, ensuring alignment with business objectives.

  • Provide technical leadership and expertise on cloud governance, architectural best practices, and modernization strategies.

  • Incident Response & Operational Excellence

  • Participate and help refine incident management processes for the SRE team, ensuring minimal downtime and fast recovery.

  • Collaborate with Engineering and other teams to define SLOs, SLIs, and error budgets to drive system reliability.

  • Participate in post‑mortems and root cause analysis to prevent recurring issues.

  • Engineering Leadership & Code Review

  • Approve merge and pull requests, ensuring high‑quality, scalable, and secure infrastructure code.

  • Mentor and upskill the junior members of the SRE team, fostering a culture of continuous learning.

  • Participate in architecture discussions with product engineering teams for onboarding new services, ensuring they are scalable, cost‑optimized, and aligned with best engineering practices.

  • Collaborate with software developers to optimize application performance and cloud‑native designs.

  • Automation & Reliability Engineering

  • Develop Infrastructure as Code (IaC) using Terraform, Cloud Formation, or AWS CDK for fully automated provisioning and deployment.

  • Implement self‑healing, fault‑tolerant architectures that can automatically recover from failures.

  • Optimize infrastructure monitoring and observability using Prometheus, Grafana, Loki, Tempo, Mimir, AWS Cloud Watch, AWS Cloud trail and New Relic.

  • Security, Compliance, and Best Practices

  • Ensure cloud security best practices are embedded into all solutions, including IAM policies, VPC security, encryption, and compliance with industry standards (such as SOC 2, HIPAA).

  • Implement least privilege access, network segmentation, and automated security controls across AWS services.

  • Collaborate with Info Sec teams to enforce threat detection, logging, and security monitoring using tools such as AWS Guard Duty, Security Hub, Cloud Trail, Reliaquest Grey matter and Google Chronicle.

  • AWS Cost Optimization & Fin Ops

  • Continuously monitor and optimize AWS infrastructure costs using AWS Cost Explorer, Trusted Advisor, and Savings Plans/Reserved Instances.

  • Drive Fin Ops culture, ensuring teams design and deploy cost‑efficient cloud solutions.

  • Implement auto‑scaling, rightsizing strategies, and storage lifecycle policies to reduce costs.

  • Solution Architecture & Infrastructure Design

  • Design and build highly available, scalable, and fault‑tolerant AWS architecture using AWS services such as EC2, S3, RDS, Document

    DB, Lambda, EKS, Secrets Manager, SSM, API Gateway, and Cloud Front and other related technologies such as Hashicorp Terraform, Vault and Consul…

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary