SENIOR SUSTAINING ENGINEER
Team:
Engineering
Location:
Ontario, Canada
Employment status:
Full-time, Permanent
Reporting to:
Client Programme Director
Working week:
Monday-Friday
Hyperlayer is a B2B product company, building a platform to revolutionise the world of payments. Hyperlayer transforms digital wallets into smart wallets that change the way people, and businesses, spend money.
Wallets powered by Hyperlayer let consumers withdraw money from multiple accounts with a single tap, for a specific purpose.
Our platform enables capabilities such as connecting specific accounts with targeted brand spending, it gives merchants and customers access to cutting‑edge rewards, and since spending is social, Hyperlayer supports group saving, sharing and spending. All in one platform.
Hyperlayer is working with large retailers and financial institutions. Our product introduces innovation to these industries to create solutions: helping consumers, families, and groups improve their financial wellbeing by making their money go further.
About the roleAs a small, agile company, we thrive on collaboration, innovation, and accountability.
As a Senior Sustaining Engineer on our Production Support team, you’ll be an essential part of maintaining the stability and reliability of our fintech platform's B2B, B2C, and SaaS production environments. You’ll lead the technical response to complex incidents, work closely with cross‑functional and global teams to resolve recurring issues, and contribute to the development of customer‑requested features.
With a focus on improving software maintainability, system resilience, and meeting contractual SLAs, you’ll mentor engineers across the team and drive continuous improvement in our ITILv4‑aligned processes and codebase. This role supports our follow‑the‑sun 24x7 model, with on‑call rotations across UK, US/Canada, and NZ/India teams.
This position operates during standard business hours (Monday–Friday). The role also includes participation in an out‑of‑hours on‑call rotation, with additional compensation provided in accordance with company policy. While we collaborate closely with global teams (UK, US/Canada, NZ/India) in a follow‑the‑sun model, primary work hours remain aligned to the weekday schedule.
Key Responsibilities- Act as a primary responder in a 24x7 on‑call rotation for high‑priority incidents, ensuring fast acknowledgment (MTTA targets) and resolution to minimize customer impact in our event‑driven fintech platform.
- Conduct root‑cause analysis (RCA) for complex issues, collaborating closely with development teams to implement robust solutions and deliver RCAs within 5 business days for Sev1/Sev2 incidents.
- Lead the development and deployment of small, customer‑facing features and improvements, ensuring alignment with business needs and system requirements while adhering to change success rates ≥99 %.
- Work with mid‑ and junior‑level engineers, providing guidance in incident response, troubleshooting best practices, and coding standards within a global rota, including handovers and knowledge sharing via tools like Rootly.
- Take ownership of software maintainability initiatives, identifying and implementing optimizations, and enhancing system performance to achieve availability ≥99.99 % (four nines).
- Participate in regular post‑incident reviews (blameless retros), documenting lessons learned and suggesting improvements to incident response processes and runbooks for our technology stack.
- Collaborate with the infrastructure team to monitor system health and proactively identify areas for improvement in stability and efficiency using tools like Datadog, Rootly, and Cloud Watch/App Dynamics.
- Bachelor's degree in computer science, Engineering, or a related field.
- Minimum of 5+ years of experience in sustaining engineering, Dev Ops, or software engineering with a focus on incident response and system reliability in fintech or regulated environments.
- Advanced troubleshooting skills and experience with Golang (preferred), Java, or similar languages, plus familiarity with event‑driven architectures (e.g., NATS/Jet Stream, Redis clustering).
- Strong…
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: