Senior Software Engineer - Site Reliability
Listed on 2026-02-18
-
Software Development
Software Engineer
Senior Software Engineer - Site Reliability at Roblox
San, California, United States - Full Time
Start Date
Immediate
Expiry Date
06 Mar, 26
289460.0
Posted On
06 Dec, 25
Experience
2 year(s) or above
Remote Job
Yes
Telecommute
Yes
Sponsor Visa
No
Skills
Site Reliability Engineering, Software Development, Performance Testing, Chaos Experimentation, Infrastructure Resiliency, Performance Monitoring, Observability Services, Problem Solving, Project Planning, Programming Languages, Go, C#, Java, Fault-Tolerance, Resilience, Collaboration
Software Development
DescriptionEvery day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators. At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.
We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there. A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone. Are you a seasoned engineer with a passion for reliability and scalability?
We’re looking for exceptional Software Engineers to join the Reliability team this pivotal role, you will drive the evolution of our systems, ensuring they meet the highest standards of performance, reliability, and efficiency. You’ll collaborate with cross-functional teams to build robust infrastructure that supports our growth. If you have a track record of solving complex technical challenges, we want to hear from you.
Join us in shaping the future of our platform and delivering unparalleled value to our users. At Roblox, our vision is to achieve 1 billion daily active users. We believe this engineer will be instrumental in driving us towards that ambitious goal.
Create software and libraries that promote fault-tolerance and resilience.
Design and develop frameworks and tools to support performance testing, chaos experimentation, and improve infrastructure resiliency.
Develop and implement performance monitoring and observability services to proactively identify and understand infrastructure issues and platform degradations.
Experience:
you have a BS degree (or equivalent professional experience) in Computer Science or related engineering field with at least 3-4 years of experience with added advantage working in the Site Reliability space in SRE or Software Engineering Passion for systems:
You have experience and good habits around building software and tools and getting them adopted. Your system's focus informs a view of code needing to be deeply reliable.
Prior experience developing, deploying and maintaining LLM-based agents or RAG systems in production is a plus.
A Partner:
You know that the best tools integrate broadly with the tooling ecosystem. You approach partners and processes with curiosity and seek to understand a problem deeply before you start coding.
A Coder: you have experience writing common programming languages ( Go, C#, Java…).Self-organized: you're excited about getting in front of complex problems, organizing your work by any means possible; overcome emergent issues and contributing to long-running projects as a part of the team.
Problem Solver: you ask the right questions to solve issues within your expertise and you use data to test your theories.
Planner - You have experience in large project life cycles. You have experienced working in sprints, breaking down complex tasks into milestones, and reporting status to keep project scheduling accurate.
For roles that are based at our headquarters in San Mateo, CA:
The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).