×
Register Here to Apply for Jobs or Post Jobs. X

AI Engineer​/Cloud Engineer

Job in Guelph, Ontario, Canada
Listing for: SageBeans RPO
Full Time position
Listed on 2026-06-17
Job specializations:
  • IT/Tech
    AI Engineer (Applied/Software)
Salary/Wage Range or Industry Benchmark: 80000 - 100000 CAD Yearly CAD 80000.00 100000.00 YEAR
Job Description & How to Apply Below
Position: AI Engineer / Cloud Engineer

AI Engineer / Cloud Engineer

Core Stack: AWS Bedrock Agent Core
• Azure AI Foundry
• MCP Governance

Function: Foundations / Infrastructure & Operations

Level: Senior / Staff

Type: Full Time

Location: Remote / Hybrid

Reports to: Director, AI Architecture

About the Role

The Foundations team serves as the enterprise AI governance control plane for Infrastructure & Operations (I&O), responsible for the infrastructure, observability, security, and policy layer across a multi‑cloud AI agent ecosystem.

We are seeking a senior AI/Cloud Engineer to design, build, and operate production‑grade AI agent infrastructure across Amazon Web Services Bedrock Agent Core and Microsoft AI Foundry, with deep integration across MCP (Model Context Protocol) connectors, LLM gateways, and enterprise data systems.

This role sits at the intersection of AI platform engineering, cloud infrastructure, governance, and enterprise security. You will partner closely with the I&O Architecture Engineering team to ensure AI agents are observable, measurable, secure, and fully governed across the enterprise estate.

Key Responsibilities
1. Agent Infrastructure & Platform Engineering
  • Design and deploy production AI agent workloads on AWS Bedrock Agent Core, including runtime configuration, memory stores, and Data Dog observability instrumentation.
  • Build and maintain Azure AI Foundry agent pipelines with Application Insights telemetry, Azure APIM‑based token attribution, and Azure Monitor integration for safety and red‑teaming signals.
  • Architect MCP connector infrastructure, including tool‑call routing, RBAC enforcement, OAuth2 / Entra , and end‑to‑end audit logging.
  • Maintain and evolve the enterprise LLM gateway as the centralized routing, policy enforcement, and instrumentation layer across Bedrock, Azure OpenAI, and Claude‑based endpoints.
2. Governance, Security & Observability
  • Instrument agent systems to capture Tier 1 audit KPIs such as tool‑call completeness, policy violations, RBAC coverage, and authentication failure rates aligned with compliance requirements.
  • Define per‑connector security policies ensuring Finance, HR, Legal, and Client data systems remain fully governed and least‑privileged.
  • Build unified observability dashboards across Cloud Watch, Azure Monitor, and Data Dog for AI system health and executive reporting.
  • Participate in CDR (Critical Design Review) processes for all AI agent deployments, ensuring adherence to reliability, security, and observability standards.
  • Design SOC 2‑aligned audit log pipelines for all agent tool‑calls to support compliance and forensic traceability.
3. Integration & Platform Interoperability
  • Integrate AI agent systems with enterprise platforms such as Service Now, Data Dog, Apptio/Cloudability, and internal data platforms via MCP connectors and REST APIs.
  • Support CI/CD automation for AI agents using Git Hub Actions, including environment promotion, rollback strategies, and pipeline replay mechanisms.
4. AI Quality & Evaluation
  • Define and instrument agent KPIs including task completion rate, hallucinated tool‑call detection, escalation rate, and context efficiency metrics.
  • Leverage AWS Agent Core evaluation frameworks and Azure AI Foundry evaluation tooling to assess groundedness, safety, tool accuracy, and reliability.
  • Build golden dataset regression suites to detect performance degradation across model updates, prompt changes, and connector modifications.
Required Qualifications Platform Experience (Must Have)
  • AWS Bedrock Agent Core (production workloads)
  • Azure AI Foundry (agent pipelines & evaluation systems)
  • AWS Lambda / EKS
  • Azure API Management (APIM)
  • Cloud Watch (metrics, logs, traces)
  • Azure Monitor + Application Insights
  • LLM Gateway architecture experience
  • MCP (Model Context Protocol) servers
  • OAuth2 / Entra  / IAM‑based credential scoping
Engineering Skills
  • 5+ years software/platform engineering experience; 2+ years in AI/ML infrastructure or LLM‑based systems
  • Strong proficiency in Python;
    Type Script/Node.js preferred for orchestration layers
  • Experience with REST APIs, async processing, and event‑driven architectures
  • Hands‑on CI/CD experience (Git Hub Actions preferred)
  • Infrastructure‑as‑Code and container‑based…
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary