×
Register Here to Apply for Jobs or Post Jobs. X

Machine Learning Operations

Job in Bengaluru, 560001, Bangalore, Karnataka, India
Listing for: IBU
Seasonal/Temporary position
Listed on 2026-07-03
Job specializations:
  • IT/Tech
    Cloud Computing: Infrastructure & Operations, Data Engineering, AI Engineer (Applied/Software), Machine Learning/ ML Engineer
Job Description & How to Apply Below
Location: Bengaluru

JOB DETAILS :
Role: ML Ops Engineer

Location:

Bangalore, India (Hybrid)
Type of Hiring:
Permanent

Job Description:

We are seeking a visionary and highly experienced MLOps Eng. to lead the design, development, and implementation of our enterprise-grade MLOps platform on Kubernetes. This pivotal role requires an individual with deep expertise across the entire ML lifecycle, from data ingestion and feature engineering to model training, deployment, monitoring, and governance. The MLOps Lead will be responsible for architecting a scalable, secure, multi-tenant, and compliant platform that empowers our data scientists and becomes a core product offering for our customers.

This individual will act as the technical lead, guiding cross-functional teams (Dev Ops, Data Science, Security) and setting the strategic direction for our MLOps ecosystem.

Responsibilities:
Strategic Platform Architecture:
Lead the architectural vision, design, and continuous evolution of the Platform, ensuring alignment with business objectives, security standards, and scalability requirements.
Drive the adoption and integration of open-source MLOps tools (Kubeflow, MLflow, Feast, KServe, Alibi-Detect, Evidently AI, Spark, etc.) into a cohesive, production-ready enterprise solution.
Define platform standards, best practices, and architectural patterns for MLOps development and operations.

Technical Leadership & Implementation Oversight :
Act as the primary technical authority and lead for the MLOps initiative, guiding both Dev Ops/Platform and MLOps/Data Science teams through the phased development plan.
Oversee the implementation of core platform components, ensuring robust integration, performance, and adherence to architectural blueprints.
Provide expert guidance on Kubernetes-native MLOps practices, distributed computing for ML (Spark, Kubeflow Training Operators), and model serving strategies (KServe).

Enterprise Security, Governance & Multi-Tenancy:

Architect and oversee the implementation of enterprise-grade security features including SSO (Keycloak), secrets management (Hashi Corp Vault), and fine-grained access control (Kubernetes RBAC, OPA Gatekeeper) for data and platform resources.
Design and enforce multi-tenancy models that provide strong isolation, resource governance, and secure data access for internal teams and external customers.
Ensure the platform meets stringent compliance requirements through comprehensive audit logging, tracing (Fluentd, ELK/Open Search, Prometheus/Grafana), and data lineage considerations.

ML Lifecycle & Data Management Expertise:
Architect and integrate a robust Feature Store (tool like Feast) for consistent feature engineering, management, and serving across training and inference.
Lead the integration of MLflow for experiment tracking, model versioning, and a centralized model registry.
Design and implement comprehensive model monitoring solutions, including data drift and model quality detection (Alibi-Detect/Evidently AI), with integrated alerting.

Developer Experience & Customization:
Champion the developer experience for data scientists, ensuring ease of use, self-service capabilities, and efficient workflows (e.g., automated namespace provisioning, notebook environment management).
Provide architectural guidance for building a custom, branded UI layer on top of the open-source components, enhancing usability and aligning with product offerings.

Collaboration & Mentorship:
Collaborate extensively with Data Science, Dev Ops, Security, Product Management, and Business stakeholders to gather requirements, communicate technical vision, and drive platform adoption.
Mentor and upskill engineering teams in MLOps best practices, cloud-native development, and advanced ML techniques.

Required Skills & Expertise:
5 to 8 yrs years of progressive experience in software engineering, data engineering, or MLOps, with at least 5 years in a lead or architect role focused on building and managing production of large-scale ML platforms.
Expert-level proficiency with Kubernetes and its ecosystem (operators, CRDs, Helm, networking, storage).
Experience in building/managing ML platform tools such as MLflow , Kubeflow, Airflow, Sage Maker, Vertex AI, or Azure Machine Learning.
Deep hands-on experience with Kubeflow (Pipelines, Notebooks, Training Operators, KServe) in production environments.
Extensive experience with MLflow for experiment tracking, model registry, and model lifecycle management.
Proven expertise in designing and implementing Feature Stores (e.g., Feast) for both online and offline serving.
Strong background in distributed data processing technologies like Apache Spark/PySpark, especially on Kubernetes.
Architectural experience with enterprise security solutions including SSO (Keycloak, OAuth/OIDC), secrets management (Hashi Corp Vault), and policy enforcement (Kubernetes RBAC, OPA Gatekeeper).
Demonstrated ability to implement comprehensive monitoring and observability stacks (Prometheus, Grafana, ELK/Open Search, Fluentd, Jaeger) for…
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary