Lead DevOps/SRE Engineer
Listed on 2026-05-17
-
IT/Tech
SRE/Site Reliability
Overview
Launch Potato is a profitable digital media company that reaches over 30M+ monthly visitors through brands such as Finance Buzz, All About Cookies, and Only In Your State .
Why Join UsAt Launch Potato, you’ll accelerate your career by owning outcomes, moving fast, and driving impact with a global team of high-performers.
Base Salary$160,000 to $190,000 per year, paid semi‑monthly.
Must Haves- 5+ years of production AWS infrastructure experience with deep Terraform expertise.
- Hands‑on experience building the SRE function from scratch with complete ownership.
- Experience with a multi‑site company where PaaS or microservices are required.
- CI/CD pipeline ownership in one or more previous roles.
- Pager Duty experience and standing up an on‑call rotation.
5+ years hands‑on with AWS, Terraform, CI/CD pipeline ownership, and SRE tooling (Open Telemetry, Grafana, Pager Duty or equivalent) in a production environment.
Your RoleOwn and evolve Launch Potato’s cloud infrastructure, CI/CD platform, and compliance posture. Build the SRE function from the ground up so product teams can ship faster without compromising reliability, security, or cost control.
Outcomes- Stand up the SRE practice from scratch: on‑call rotation, Pager Duty configuration, SLA/SLO definitions for core infrastructure services, runbook library, and observability dashboards that tie site performance to business metrics.
- Complete the AWS multi‑account migration: move production workloads to an isolated account with zero unplanned downtime.
- Deliver SOC 2 Type I audit‑ready infrastructure evidence package: own the technical controls implementation end‑to‑end.
- Version and publish the Terraform module library (30+ modules) to a private registry to eliminate ad hoc git consumption by product teams.
- Implement automated deployment rollback for ECS and Lambda: gate production on integration test passage.
- Stand up monthly cost reporting to leadership: budget anomaly detection, savings plan recommendations, spend by service/team/environment.
- Ownership orientation:
You don’t wait to be assigned a problem. If something is broken, undocumented, or a risk, you flag it and fix it. - Documentation discipline:
You write runbooks, decision rationales, architecture patterns, incident post‑mortems so the next person can understand your work without asking you. - Cost consciousness:
You think about the business impact of infrastructure decisions and can explain spending anomalies to a CFO in plain language. - Calm under pressure:
Production incidents happen. You triage clearly, communicate proactively with technical and non‑technical stakeholders, and run a tight post‑mortem without blame. - Cross‑functional communication:
You can work with product engineers, legal/compliance, and executive leadership in the same week without switching communication modes awkwardly. - Proactive reliability: A good SRE reacts to outages. A great SRE catches degradation before it becomes an outage by building alerting against patterns, not just failures.
Base salary is set according to market rates for the nearest major metro and varies based on Launch Potato’s Levels Framework. Your compensation package includes a base salary, profit‑sharing bonus, and competitive benefits. Launch Potato is a performance‑driven company, which means once you are hired, future increases will be based on company and personal performance, not annual cost‑of‑living adjustments.
EEO StatementWe are proud to be an Equal Employment Opportunity company. We value diversity, equity, and inclusion. We do not discriminate based on race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).