Engineer, AI/ML
Listed on 2026-04-21
-
IT/Tech
AI Engineer, Machine Learning/ ML Engineer
Corporation's corporate functions. Working in close partnership with the Data Platform & Analytics Engineer, this role takes curated, AI-ready data and transforms it into production-grade intelligent systems. The ideal candidate brings hands‑on experience in large language model orchestration, agentic workflow design, and the full lifecycle of deploying and maintaining AI solutions in cloud environments. This is a highly technical individual contributor role at the intersection of applied AI and enterprise automation.
EssentialFunctions
- AI Agent Design & Development
- Design and build AI agents and multi‑agent systems using Snowflake Cortex AI, Snowflake ML, Amazon Bedrock, Lang Graph, Lang Chain, Auto Gen, and CrewAI.
- Develop agentic workflows that automate business processes across corporate functions.
- Translate business requirements into robust, maintainable agent architectures.
- Implement prompt engineering strategies, tool use, and memory patterns for production‑grade agents.
- Model Development, Fine‑Tuning & Evaluation
- Develop, fine‑tune, and evaluate ML models using Snowflake ML, AWS Sage Maker, and Bedrock foundation models.
- Design evaluation frameworks to measure model accuracy, reliability, and alignment with business objectives.
- Select appropriate foundation models and orchestration strategies based on use‑case requirements.
- Manage model versioning, experimentation tracking, and performance benchmarking.
- Production Deployment & Infrastructure
- Deploy AI and agent workloads to AWS infrastructure including ECS and S3.
- Build CI/CD pipelines for model and agent deployment, ensuring reliable and repeatable release processes.
- Manage containerized AI workloads and ensure high availability and scalability in production.
- Leverage Snowflake AI tooling and Cortex capabilities to power data‑driven AI features.
- Monitoring, Maintenance & Reliability
- Monitor production AI systems for drift, degradation, and anomalous behavior.
- Own incident response and root cause analysis for AI and agent failures in production.
- Implement logging, observability, and alerting frameworks across all deployed AI solutions.
- Continuously improve agent performance based on production feedback and stakeholder input.
- Data Collaboration & AI Readiness
- Partner closely with a Data Platform & Analytics Engineer to ensure curated data layers meet AI consumption requirements.
- Define feature requirements, data contracts, and schema standards needed for agent and model development.
- Provide inputs on data architecture decisions that impact AI workload performance.
- Governance, Security & Responsible AI
- Ensure all AI solutions adhere to Carnival Corporation’s AI governance standards and responsible AI principles.
- Apply data privacy controls and access management within AI pipelines.
- Document agent architectures, model cards, and deployment runbooks to support audit and compliance requirements.
The AI / ML Engineer operates with a high degree of independence in the design and execution of AI and agentic solutions. The role holds decision‑making authority over agent architecture, model selection, prompt engineering strategy, and the deployment patterns used to bring AI capabilities into production. Day‑to‑day responsibilities span the full AI development lifecycle: from requirements analysis and prototype development through to production deployment, monitoring, and ongoing maintenance.
The role works closely with the AI Product Owner to translate business use cases into engineering deliverables, and partners with the Data Platform & Analytics Engineer to ensure data readiness for AI workloads. Core capabilities include complex problem‑solving at the intersection of data, AI, and business operations; designing reliable multi‑agent systems; ensuring AI outputs meet governance and compliance standards; diagnosing production AI issues;
applying systematic experimentation and evaluation; and balancing trade‑offs between speed, robustness, scalability, and maintainability. The individual should influence outcomes through technical leadership, provide clear guidance to stakeholders, advocate for responsible AI, and lead incident response &…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).