Senior Lead Reliability Engineer
Listed on 2026-05-29
-
Software Development
Role Summary
We are evolving our Reliability Engineering team to move beyond support and operations. As a Senior Engineer in Site Reliability, you will be part of a diverse and inclusive organization that has full ownership of the availability, performance, and scalability of one of the most critical shared services at LSEG.
You will maintain Service Level Objectives for the systems you own, constantly measuring and improving availability, latency, and overall system health. You will write automation to scale systems sustainably, prevent service issues, or quickly recover service when they occur, and partner with development teams to improve system reliability, observability, and release velocity.
You will participate in on‑call rotations, incident response, post‑mortems, and root cause analysis and resolution, and advocate strong engineering practices that allow us to build, deploy, and run scalable, reliable, and performant services.
Responsibilities- Lead project priorities, deadlines, and outcomes.
- Utilize deep knowledge of site reliability, software engineering, programming languages, tooling, frameworks, infrastructure and systems for each task.
- Lead designs of software components, systems, and features to improve the availability, scalability, latency, and efficiency of LSEG's services.
- Lead sustainable incident response and production improvements.
- Provide mentorship and advice to team members on availability and performance of critical services, build automation to prevent problem recurrence, and build automated responses for non‑exceptional service conditions.
- Mentor and train team members on design techniques and coding standards, cultivating innovation and collaboration.
- Write and review highly optimised and accurate code for LSEG products and solutions, and provide feedback and suggested improvements to team members.
- Partner with architects to decompose solutions for technology systems and products.
- Proactively build and apply relevant domain knowledge that may relate to workflows, data pipelines, business policies, configurations and constraints.
- Support essential processes while ensuring high quality standards are met.
- A Bachelor's degree in computer science, a related technical field involving software/systems engineering, or equivalent practical experience.
- Experience with object‑oriented programming languages such as Java, C#, Python, or Go.
- Experience with Unix/Linux and Windows operating systems.
- Hands‑on experience with one of the following cloud platforms:
Azure, AWS, or GCP. - Understanding of Dev Ops concepts and working style.
- Experience with algorithms and data structures.
- Observability practices with logging, metrics, tracing, and alerting.
- Infrastructure as Code.
- Understanding of identity and access management, and application security.
We use Datadog and Big Panda for our observability stack, Terraform for our cloud infrastructure, and Entra
ID as our IAM solution, but we are open to incorporating your experience with other tools. Does this sound like a challenge you'd be interested in taking on?
LSEG is an equal opportunities employer and does not discriminate on the basis of race, religion, colour, national origin, gender, gender identity, sexual orientation, marital status, age, disability or any other protected class. We can accommodate reasonable religious practices and physical ability requirements as required by applicable law.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).