×
Register Here to Apply for Jobs or Post Jobs. X

AI Platform Engineer - Inference & Agentic Systems

Job in Toronto, Ontario, M5A, Canada
Listing for: Paytm
Full Time position
Listed on 2026-06-04
Job specializations:
  • Software Development
    AI Engineer, Machine Learning/ ML Engineer
Job Description & How to Apply Below
Position: Staff AI Platform Engineer - Inference & Agentic Systems
About the Role
We are a small team of AI builders in Paytm Labs.

As a Staff AI Platform Engineer, you will work across inference and agentic systems. You willcontribute to Paytm's AI inference platform (Pi), serving internal teams and enterprise customers- running our own coding and domain-specific models (voice, vision, risk, fintech workflows) aswell as third-party models. You will also architect and build the platform that enablesautonomous AI agents to operate safely and reliably in production - the runtime, orchestration,and developer tooling for agents to reason, plan, use tools, and execute complex multi-stepworkflows, automating both software development and business processes.

You will work at the intersection of LLMs, distributed systems, and production fintechinfrastructure, helping define how inference and agentic AI are built and deployed acrosspayments, risk, fraud, collections, support, and developer experience.

What You'll Do

  • Inference & Model Serving
  • Build and operate multi-model serving across modalities (text, voice, code, vision) on shared infrastructure
  • Own the model lifecycle: download, deploy, serve, monitor, update, swap
  • Drive inference optimization: latency, throughput, cost - including quantization, batching, caching, and routing strategies
  • Ensure inference is fast and reliable for the agents and systems that depend on it
  • Agentic Systems
  • Architect and build the Agentic AI Platform - runtime infrastructure, orchestration systems, and developer tooling for autonomous agents
  • Design multi-agent coordination systems enabling agents to collaborate and solve complex workflows
  • Build robust tool-use infrastructure that allows agents to interact with APIs, databases, and services safely
  • Implement workflow automation: agents that execute multi-step business and engineering tasks with appropriate guardrails
  • Build safety and guardrail systems including permissioning, sandboxing, and human-in-the-loop workflows
  • Develop evaluation and observability frameworks to measure agent behaviour, detect regressions, and debug failures
  • Develop SDKs and APIs that allow internal teams to build and deploy agents quickly and safely
  • Platform & Technical Leadership
  • Define technical direction and architecture for agentic systems across the organization
  • Build patterns and standards for agent design, tool calling, and evaluation
  • Partner closely with ML, product, and security teams to deliver production-grade agent systems
  • Mentor engineers and contribute to best practices for agent system design
  • What You'll Bring

  • 8+ years of software engineering experience, with 3+ years in AI systems or LLM applications
  • Strong understanding of LLM-based agent architectures (ReAct, RAG, tool use, multi-agent systems)
  • Experience building highly reliable distributed systems
  • Proficiency in Python and experience working with modern LLM APIs or open-source models
  • Experience with or strong interest in model serving (vLLM, Tensor

    RT-LLM, Triton)
  • Understanding of distributed systems: task queues, event-driven architectures, state management
  • Experience with cloud platforms (AWS, GCP) and containerized deployments
  • Strong understanding of security risks in agentic systems (prompt injection, privilege escalation, data leakage)
  • Demonstrated experience leading complex technical initiatives
  • Strong written and verbal communication skills
  • Nice to Have

  • Experience building agentic systems in regulated industries (fintech, healthcare, enterprise)
  • Familiarity with Model Context Protocol (MCP) or agent communication standards
  • Experience with model fine-tuning, quantization, or LoRA
  • Experience building CI/CD automation and developer tooling
  • Experience adapting workflow orchestration systems (Temporal, Airflow, Prefect) for AI workloads
  • Experience with voice models, multimodal models, or edge inference
  • Experience designing human-in-the-loop or oversight systems
  • Interest in testing and verification for non-deterministic AI systems
  • Go Big or Go Home!Paytm Labs believes in diversity and equal opportunity and we will not tolerate any forms of discrimination or harassment. Our people are critical to our success and we know the more inclusive we are, the better our work will be. We thank all applicants, however, only those selected for an interview will be contacted. Paytm Labs is committed to meeting the accessibility needs of all individuals in accordance with the Accessibility for Ontarians with Disabilities Act (AODA) and the Ontario Human Rights Code (OHRC).

    Should you require accommodations during the recruitment and selection process, please let us know. We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
    Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
    To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
     
     
     
    Search for further Jobs Here:
    (Try combinations for better Results! Or enter less keywords for broader Results)
    Location
    Increase/decrease your Search Radius (miles)
    0
    200
    Filters
    Education Level
    Experience Level (years)
    Posted in last:
    Salary