Senior Site Reliability Engineer
Oregon, Lucas County, Ohio, 43616, USA
Listed on 2026-02-16
-
Software Development
At Shippo, our vision is bold and clear:
we are the shipping layer of the internet. Our mission is to make every merchant successful through excellent shipping, delivering world-class logistics technology and infrastructure. We’re building the backbone of global e-commerce — connecting merchants to carriers worldwide through a single API and intuitive dashboard.
As a remote-first and globally distributed team
, we believe flexibility fuels trust, autonomy, and performance. Our diverse perspectives — across continents, cultures, and time zones — drive our innovation and enable us to build solutions used by businesses everywhere. We invest in modern, scalable technology so our teams can build, ship, and iterate with confidence.
Your impact starts here: every person at Shippo plays a direct role in shaping the infrastructure that powers global commerce and makes shipping simpler for businesses around the world.
How we will deliver success togetherAs a Senior Site Reliability Engineer (SRE) on our team, you will leverage platform engineering principles to ensure that Shippo's services are reliable, scalable, and performant. You will be a hybrid software development and operations engineer, responsible for designing, building, and maintaining the infrastructure that supports our applications. Your work will directly impact our ability to meet and exceed SLAs, and you will collaborate closely with other engineering teams to create services that are automatable, measurable, and resilient to failure.
- Design, scale, and secure infrastructure to stay ahead of business needs through fault-tolerant architecture design, performance testing, profiling, and tuning, and capacity planning
- Design, build, deploy, and maintain automation, monitoring, and alerting systems, as well as design, implement, and test disaster recovery solutions
- Ensure scalability and maintainability through microservices adoption, decoupling of concerns and data model, queuing of jobs and application layering
- Enhance and maintain our CI/CD pipeline for smooth and safe production releases via automated testing and verification
- Verify and ensure performance and correctness of systems in response time and throughput
- Participate in peer reviews and testing and contribute to automated test suites and in design reviews for new features, products, and systems
- Participate in an on-call rotation
- Experience developing, managing and troubleshooting highly available distributed systems, including operational experience with Kubernetes in a production environment
- Extensive expertise with at least one public cloud provider (AWS, GCP, Azure)
- Exceptional verbal, written, and interpersonal communication skills
- Interest in and understanding of best-in-class security practices, and automation and testing methods
- Familiarity with configuration and maintenance of common infrastructure components such as Redis, Elasticsearch, and Hadoop
- Deep understanding of customer needs and passion for customer success
- BS or MS degree in Computer Science or equivalent experience
- Advanced knowledge of managing and optimizing Postgresql server configuration
- 3+ years of experience in software development
- E xperience with:
- Defining and monitoring Service-Level Objectives (SLOs) and Service-Level Agreements (SLAs) to ensure that systems meet reliability and performance targets;
- Monitoring Tools like New Relic, Prometheus, Grafana and/or Datadog
- Open Telemetry knowledge for distributed tracing and metrics collection and experience on using it in production environments
- Managing Python and Golang applications in production
- Dev Ops tooling such as Docker, Terraform, ArgoCD, Argo Workflows, Circle
CI, Github Actions, New Relic, Pager Duty, etc - AWS/Cloud services such as EKS, EC2, S3, Lambda, Route 53, Cloud Front, Cloudflare, IAM, etc.
- Healthcare coverage for medical, dental, and vision (90% covered by the company, incl. dependents). Pets coverage is also available!
- Take-as-much-as-you-need vacation policy & flexible working hours
- One week-long company wide winter slow down
- WFH stipend to set up your home office
- Charity donation match up to $100
- Dedic…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).