Site Reliability Engineer, Infrastructure and Assurance Services - USDS
Listed on 2026-04-23
-
IT/Tech
Cloud Computing, SRE/Site Reliability, Systems Engineer
Site Reliability Engineer, Infrastructure and Assurance Services - USDS
The Infra SRE‑Infrastructure‑Assurance team extends Tik Tok infrastructure's operability, observability, visibility, and automation. We aim to provide holistic insights and solutions to Tik Tok infrastructure with minimal manual interventions, delivering transparency, collaboration, hard work, and innovation across a fast‑growing, hyper‑growth team. Our focus is on long‑term strategies rather than short‑term gains, tackling large‑scale complex issues with fearless curiosity.
Our organization follows a hybrid work schedule that requires employees to work in the office three days a week, or as directed by their manager or department. We regularly review this model and the specific requirements may change.
Responsibilities- Perform SRE duties and operations on supported services in production, including on‑call rotations, maintenance, change management, monitoring, incident response, capacity planning, and disaster recovery.
- Maximize system uptime, availability and stability to meet functional and performance SLAs.
- Contribute to existing documentation and build effective runbooks, SOPs, SLAs/SLOs.
- Initiate and lead scripting, tooling, and automation to streamline processes and minimize human resource involvement.
- Work cross‑functionally and regionally with SRE, Dev, QA, and PM teams to handle incidents and improve processes.
- Manage and prioritize tasks and projects for high productivity and precise deliveries.
- Bachelor's degree in Computer Science, a related field, or equivalent practical experience.
- Demonstrated experience in software development with one or more programming languages.
- Experience with Linux operating systems, networking, database concepts, monitoring, and shell scripting.
- Superb analytical ability, problem‑solving, and critical thinking skills.
- Excellent communicator, team player, self‑starter, and fast learner.
- Master's degree in Computer Science, Engineering, or a related field.
- Proficiency in Python, Go, or C++.
- Expertise in SRE philosophy, AIOps, APM, or disaster recovery.
- Experience with Kubernetes, Elastic Search, Click House, message queues, OpenTSDB, service mesh.
U.S. Data Security (“USDS”) is a subsidiary of Tik Tok focused on protecting sensitive data and ensuring compliance with security‑first governance. Our work centers on oversight and protection of the Tik Tok platform and U.S. user data, enabling millions of Americans to engage safely and confidently.
Data Security StatementThis role requires the ability to work with and support systems designed to protect sensitive data and information. As such, this role will be subject to strict national security‑related screening.
USDS Reasonable AccommodationUSDS is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs, or other protected reasons. If you need assistance or a reasonable accommodation, please reach out to us at
Job InformationCompensation:
The base salary range for this position in the selected city is $112,725 – $177,840 annually. Compensation may vary outside this range depending on a candidate’s qualifications, skills, and experience. Base pay is one part of the total package, which may include discretionary bonuses, incentives, and restricted stock units.
Benefits:
Employees have access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short‑term and long‑term disability coverage, life insurance, wellbeing benefits, and 10 paid holidays, 10 paid sick days, and 17 days of paid personal time (prorated on hire with increasing accruals by tenure).
Interview tip:
Referrals increase your chances of interviewing at Tik Tok by two times.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).