Senior DevOps Engineer/Site Reliability Engineer Job Vancouver area,BC Canada,IT/Tech

Position: Senior DevOps Engineer / Site Reliability Engineer

Join our innovative Vancouver-based company as we reimagine the future of online poker through a revolutionary Poker-as-a-Service platform. Our mission is to help iGaming operators attract and retain customers through poker gameplay. We are seeking a highly skilled Senior Dev Ops Engineer / Site Reliability Engineer to take ownership of our production cloud infrastructure, ensuring our platform is secure, scalable, and highly available.

This role offers the opportunity to work closely with back-end engineers, security, and product leadership to build and operate the systems that power real-time poker gameplay ’ll inherit an existing production environment and elevate it through automation, observability, operational maturity, and reliability best practices.

Responsibilities

Own and evolve our production environment, building on existing infrastructure and processes to improve stability, scalability, and security.
Design and maintain cloud infrastructure (networking, compute, storage, IAM) to support a real-time, highly available gaming platform.
Operate and improve our Kubernetes/container platform, including upgrades, scaling, cluster reliability, and operational tooling.
Build and maintain robust CI/CD pipelines to support safe, frequent deployments (rollback strategies, progressive delivery where appropriate).
Implement and expand Infrastructure as Code (Terraform/Pulumi, Helm, Git Ops workflows) to ensure reproducible, auditable environments.
Establish strong observability (metrics, logging, tracing, alerting, dashboards) and drive SLO/SLI practices to keep signal high and noise low.
Lead incident response and post-incident learning: runbooks, postmortems, root-cause analysis, and preventative remediation.
Drive performance and capacity planning, including load testing strategies, autoscaling policies, cost optimization, and tuning.
Strengthen security posture: secrets management, least-privilege IAM, vulnerability management, network segmentation, and secure SDLC practices.
Own and test backup, disaster recovery, and business continuity plans (RPO/RTO definition, rehearsal, automation).
Mentor engineers on operational excellence and best practices; help raise the team’s production readiness and reliability culture.

Requirements

6+ years of Dev Ops / SRE / Platform Engineering experience with hands-on production ownership.
Strong experience designing and operating cloud infrastructure (AWS, GCP, Azure), including networking and IAM.
Strong experience running Kubernetes (or equivalent container orchestration) in production environments.
Proficiency with Infrastructure as Code (Terraform and/or Pulumi), plus modern deployment tooling (Helm, Git Ops, etc.).
Experience building and improving CI/CD pipelines, release processes, and deployment strategies.
Strong Linux fundamentals; proficiency scripting in Bash/Python/Go (or similar) for automation and tooling.
Proven ability to troubleshoot production issues across distributed systems (latency, scaling, networking, dependency failures).
Experience implementing monitoring/alerting and operational practices (on-call readiness, postmortems, runbooks).
Excellent communication and collaboration skills; able to work cross-functionally and lead operational initiatives.

Nice to Haves

Experience operating real-time / multiplayer / gaming systems.
Experience with multi-region or high-availability architecture patterns.
Familiarity with streaming/messaging systems (e.g., NATS, Kafka, Rabbit

MQ).
Hands-on experience with Google Cloud Platform (GCP).
Hands-on experience with AWS (in addition to or beyond primary cloud experience).
Experience working in compliance-heavy environments (SOC 2, ISO 27001, regulated gaming).

Why Join Us?

Be part of a fast-paced, creative environment with the chance to reimagine the future of online poker.
Own the systems that keep real-time poker gameplay reliable, secure, and scalable—where your impact is immediate and measurable.
Collaborate with a talented team building the next generation of poker experiences.
Flexible work environment, with opportunities for personal and professional growth.

#J-18808-Ljbffr


Increase/decrease your Search Radius (miles)



Job Posting Language

Senior DevOps Engineer​/Site Reliability Engineer

Senior DevOps Engineer/Site Reliability Engineer