×
Register Here to Apply for Jobs or Post Jobs. X

Senior DevOps Engineer​/Site Reliability Engineer

Job in Vancouver, BC, Canada
Listing for: PokerLab
Full Time position
Listed on 2026-02-14
Job specializations:
  • IT/Tech
    SRE/Site Reliability, Cloud Computing
Salary/Wage Range or Industry Benchmark: 80000 - 100000 CAD Yearly CAD 80000.00 100000.00 YEAR
Job Description & How to Apply Below
Position: Senior DevOps Engineer / Site Reliability Engineer

Join our innovative Vancouver-based company as we reimagine the future of online poker through a revolutionary Poker-as-a-Service platform. Our mission is to help iGaming operators attract and retain customers through poker gameplay. We are seeking a highly skilled Senior Dev Ops Engineer / Site Reliability Engineer to take ownership of our production cloud infrastructure, ensuring our platform is secure, scalable, and highly available.

This role offers the opportunity to work closely with back-end engineers, security, and product leadership to build and operate the systems that power real-time poker gameplay ’ll inherit an existing production environment and elevate it through automation, observability, operational maturity, and reliability best practices.

Responsibilities
  • Own and evolve our production environment, building on existing infrastructure and processes to improve stability, scalability, and security.
  • Design and maintain cloud infrastructure (networking, compute, storage, IAM) to support a real-time, highly available gaming platform.
  • Operate and improve our Kubernetes/container platform, including upgrades, scaling, cluster reliability, and operational tooling.
  • Build and maintain robust CI/CD pipelines to support safe, frequent deployments (rollback strategies, progressive delivery where appropriate).
  • Implement and expand Infrastructure as Code (Terraform/Pulumi, Helm, Git Ops workflows) to ensure reproducible, auditable environments.
  • Establish strong observability (metrics, logging, tracing, alerting, dashboards) and drive SLO/SLI practices to keep signal high and noise low.
  • Lead incident response and post-incident learning: runbooks, postmortems, root-cause analysis, and preventative remediation.
  • Drive performance and capacity planning, including load testing strategies, autoscaling policies, cost optimization, and tuning.
  • Strengthen security posture: secrets management, least-privilege IAM, vulnerability management, network segmentation, and secure SDLC practices.
  • Own and test backup, disaster recovery, and business continuity plans (RPO/RTO definition, rehearsal, automation).
  • Mentor engineers on operational excellence and best practices; help raise the team’s production readiness and reliability culture.
Requirements
  • 6+ years of Dev Ops / SRE / Platform Engineering experience with hands-on production ownership.
  • Strong experience designing and operating cloud infrastructure (AWS, GCP, Azure), including networking and IAM.
  • Strong experience running Kubernetes (or equivalent container orchestration) in production environments.
  • Proficiency with Infrastructure as Code (Terraform and/or Pulumi), plus modern deployment tooling (Helm, Git Ops, etc.).
  • Experience building and improving CI/CD pipelines, release processes, and deployment strategies.
  • Strong Linux fundamentals; proficiency scripting in Bash/Python/Go (or similar) for automation and tooling.
  • Proven ability to troubleshoot production issues across distributed systems (latency, scaling, networking, dependency failures).
  • Experience implementing monitoring/alerting and operational practices (on-call readiness, postmortems, runbooks).
  • Excellent communication and collaboration skills; able to work cross-functionally and lead operational initiatives.
Nice to Haves
  • Experience operating real-time / multiplayer / gaming systems.
  • Experience with multi-region or high-availability architecture patterns.
  • Familiarity with streaming/messaging systems (e.g., NATS, Kafka, Rabbit

    MQ).
  • Hands-on experience with Google Cloud Platform (GCP).
  • Hands-on experience with AWS (in addition to or beyond primary cloud experience).
  • Experience working in compliance-heavy environments (SOC 2, ISO 27001, regulated gaming).
Why Join Us?
  • Be part of a fast-paced, creative environment with the chance to reimagine the future of online poker.
  • Own the systems that keep real-time poker gameplay reliable, secure, and scalable—where your impact is immediate and measurable.
  • Collaborate with a talented team building the next generation of poker experiences.
  • Flexible work environment, with opportunities for personal and professional growth.
#J-18808-Ljbffr
Position Requirements
10+ Years work experience
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary