Senior Site Reliability Engineer - Data Infrastructure
Listed on 2025-12-27
-
IT/Tech
Cloud Computing, Systems Engineer, SRE/Site Reliability
Senior Site Reliability Engineer - Data Infrastructure
Location:
Seattle
Team:
Technology
Employment Type:
Regular
Job Code: A32035
ResponsibilitiesTeam Introduction:
Our Site Reliability Engineering (SRE) team blends software and systems engineering to build and operate large‑scale data infrastructure with high reliability and efficiency. We provide a dependable cloud environment that powers our global business. In this role, you will leverage your expertise in data center architecture, data infrastructure services, and systems and tools development to solve complex scaling and reliability challenges.
We’re looking for a Sr SRE who can provide deep technical leadership, drive architectural improvements, and collaborate effectively across multiple organizations. You’ll partner with engineering, product, data, and infrastructure teams to deliver resilient, scalable platforms. This is a highly technical, hands‑on role that requires strong problem‑solving ability, clear communication, and the ability to influence without formal authority.
- Strong hands‑on skills in the design, development, and operation of large‑scale cloud infrastructure and distributed systems.
- Collaborate with cross‑functional teams (e.g., Advertising, Machine Learning, E‑commerce, and Core Infra) to drive system reliability, performance, and scalability.
- Lead initiatives to automate operations, eliminate toil, and improve overall system efficiency.
- Troubleshoot complex production issues, perform root‑cause analysis, and drive long‑term reliability improvements.
- Promote best practices in system design, observability, performance optimization, and cost efficiency.
- Communicate complex technical concepts effectively to both technical and non‑technical stakeholders.
Minimum Qualifications:
- 5+ years of experience in Site Reliability Engineering, Software Development, or related fields, with a strong focus on designing, building, scaling, and operating cloud‑based systems.
- Deep hands‑on expertise in at least one of the following areas:
Databases (SQL/No
SQL), Kubernetes or container orchestration, Big Data processing and storage systems (streaming and batch), strong knowledge of system architecture, distributed systems, and performance bottlenecks. - Excellent communication and collaboration skills, with experience working across engineering, product, and data science teams.
Preferred Qualifications:
- Proven track record of driving automation, tooling, and process improvements that enhance reliability and efficiency.
- Experience in cost optimization and performance tuning at scale, backed by data‑driven decision making.
- Thought leadership in adopting new technologies, improving operational practices, and influencing system design.
The base salary range for this position in the selected city is $177,688 - $341,734 annually.
Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.
BenefitsBenefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short‑term and long‑term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).
The Company reserves the right to modify or change these benefits programs at any time, with or without notice.
For Los Angeles County (unincorporated) Candidates:Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:
- Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;
- Appropriately handling and managing confidential information including proprietary and trade secret information;
- Exercising sound judgment.
Byte Dance is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).