More jobs:
Job Description & How to Apply Below
Skill:
Parallel File Systems + Object Storage + AI Infrastructure
Location:
Noida/Bangalore
Skill Requirement
Strong hands-on expertise in parallel file systems, object storage platforms, and AI infrastructure environments
Responsible for deployment, administration, troubleshooting, and performance optimization of storage systems supporting AI/ML, HPC, and Kubernetes workloads
Ensures data availability, performance, and reliability for large-scale AI training and inference pipelines
Storage Platforms (L3)
Certifications (Preferred):
Net App Certified Data Administrator (NCDA) / Net App Certified Technology Associate (NCTA)
Dell EMC Proven Professional (Power Scale / Vast Data/ Weka.IO/ECS / Power Store)
Ceph Certification (Red Hat Ceph Storage / SUSE Storage)
Vendor certifications in:
Cohesity / Commvault / Veeam / Rubrik
Experience:
5–10 years in enterprise storage / backup / data platform operations
Hands-on experience working with:
Parallel file systems (Lustre, BeeGFS, IBM Spectrum Scale – awareness or hands-on)
Network File System (NFS over RDMA)
Object storage platforms (Ceph, MinIO, Net App Storage GRID, Dell ECS)
Software-defined storage (SDS) platforms
Exposure to:
AI / HPC environments where high-throughput storage is critical
Multi-tenant storage environments and large-scale datasets
Skill Depth:
Strong in:
Storage troubleshooting, performance tuning, and RCA
Ability to manage and optimize:
High-throughput data pipelines for AI workloads
Storage performance (IOPS, throughput, latency)
Hands-on experience with:
File system tuning, metadata performance, striping, caching
Object storage operations (bucket management, lifecycle policies)
Working knowledge of:
Cloud storage platforms (AWS S3, Azure Blob, GCP Storage)
Backup and recovery tools (snapshot, replication, DR operations)
Understanding of:
Storage integration with:
Kubernetes (PVC, CSI drivers)
AI/ML workflows (data ingestion, model training pipelines)
Support:
Data lifecycle management, backup scheduling, and restore operations
AI Infrastructure & Data Layer (L3)
Skill Depth:
Understanding of:
Role of high-performance storage in AI infrastructure (feeding GPU workloads efficiently)
Experience supporting:
AI/ML workloads requiring:
Large-scale dataset ingestion and processing
High read/write throughput systems
Exposure to:
Data layer in AI pipelines:
Training datasets
Model artifacts
Inference data pipelines
Cloud Storage & Backup (L3)
Certifications (Preferred):
AWS Certified Solutions Architect – Associate
Azure Fundamentals / Azure Administrator
GCP Associate Cloud Engineer
Experience:
Experience working with:
Cloud-native storage services (S3, Azure Blob, GCS)
Hybrid storage architectures (on-prem + cloud integration)
Skill Depth:
Manage:
Object lifecycle policies, versioning, replication
Support:
Backup, restore, and disaster recovery operations
Understanding of:
Cost optimization basics for cloud storage
Data movement between on-prem and cloud
Role Expectation (L3)
Acts as independent storage SME
Owns:
Complex storage incidents, RCA, and performance optimization
Supports:
Production AI infrastructure environments
Works closely with:
L4 architects for capacity planning and design improvements
Contributes to:
Standardization, documentation, and runbook development
Qualifications & Experience
Bachelor’s degree in Computer Science / IT / Engineering
8–12 years overall infrastructure experience
5–8 years in storage / data platform / backup domain
Hands-on experience in:
Enterprise storage platforms (NAS/SAN/Object storage)
Parallel file systems and distributed storage
AI/ML infrastructure (preferred)
Experience in:
Kubernetes environments (basic storage integration)
Backup and data protection platforms
Certifications Required (L3)
At least one major Storage OEM Certification :
Net App / Dell EMC / Cohesity / Commvault / Rubrik
Cloud (at least one):
AWS / Azure / GCP (Associate level)
NVIDIA Certified Professional: AI Infrastructure & Operations
NVIDIA DLI – Building AI Infrastructure with NVIDIA Technologies
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×