ML Ops Engineer
Listed on 2026-02-12
-
IT/Tech
AI Engineer, Machine Learning/ ML Engineer, Data Engineer, Data Scientist
A Little About Us
Zelis is modernizing the healthcare financial experience across payers, providers, and healthcare consumers. We serve more than 750 payers, including the top five national health plans, regional health plans, TPAs and millions of healthcare providers and consumers across our platform of solutions. Zelis sees across the system to identify, optimize, and solve problems holistically with technology built by healthcare experts – driving real, measurable results for clients.
ALittle About You
You bring a unique blend of personality and professional expertise to your work, inspiring others with your passion and dedication. Your career is a testament to your diverse experiences, community involvement, and the valuable lessons you've learned along the way. You are more than just your resume; you are a reflection of your achievements, the knowledge you've gained, and the personal interests that shape who you are.
Position OverviewThe MLOps Engineer will work closely with the Data Science, Analytics, and Data Engineering & Services teams. This position will lead efforts in supporting Generative AI, traditional ML, and Advanced Analytics initiatives. The role emphasizes automation, security, and scalability across the ML lifecycle, including modern containerization and orchestration practices.
Essential Duties & Functions- Build and maintain monitoring infrastructure for conventional machine learning models, with capabilities for performance tracking, drift detection, and alerting.
- Research, evaluate, and implement monitoring strategies and tools for Generative AI systems, including LLMs and Agentic AI architectures.
- Collaborate with ML Engineers, Data Scientists, and Dev Ops teams to deploy, manage, and monitor models in production.
- Develop and support scalable, secure, and automated data pipelines using Snowflake, SQL, and Python for training, serving, and monitoring ML and GenAI models.
- Leverage AutoML tools and frameworks (e.g., MLflow, Kubeflow, Sage Maker Autopilot) to streamline experimentation and deployment.
- Design dashboards and reporting systems to visualize model health metrics and surface key operational insights.
- Ensure auditability, reproducibility, and compliance for model performance and data flow in production environments, with consideration for regulatory standards like GDPR and HIPAA.
- Maintain CI/CD workflows and version-controlled codebases (e.g., Git) for ML infrastructure and pipelines.
- Utilize containerization and orchestration technologies (e.g., Docker) to manage scalable ML infrastructure.
- Leverage tools such as Streamlit and Python visualization libraries to present insights from model and data monitoring.
- Perform root cause analyses on model degradation or data quality issues, and proactively implement improvements.
- Stay current on industry developments related to ML observability, model governance, responsible GenAI practices, and AI security.
- Contribute to analytics projects and data engineering initiatives as needed.
- Provide off-hours support for critical deployments or urgent data/model issues.
- 2–5 years of experience in ML Ops, ML Engineering, or a related role with a focus on production-level model monitoring, automation, and deployment.
- Strong experience with ML observability tools or custom-built monitoring systems.
- Experience with monitoring LLMs and Generative AI models, including prompt evaluation, hallucination tracking, and agent behavior auditing.
- Experience in deploying and managing ML workloads using containerization and orchestration platforms such as Docker, Kubernetes, Kubeflow, or Tensor Flow Extended.
- Familiarity with AutoML pipelines and workflow management tools (e.g., MLflow, Sage Maker Autopilot).
- Experience working in cloud environments, preferably AWS (e.g., Sage Maker, S3, Lambda, ECS/EKS).
- Understanding of ML lifecycle tools (e.g., MLflow, Sage Maker Pipelines) and CI/CD practices.
- Strong security and compliance awareness, particularly related to model/data governance (e.g., HIPAA, GDPR).
- Proficiency in Python and key data libraries (Pandas, Numpy, Matplotlib, etc.).
- Advanced SQL skills and experience…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).