Senior Linux & Cloud Infra Engineer; OCI/Azure
JOB DESCRIPTION
Own day-to-day operations and continuous improvement of enterprise Linux platforms and storage services across on-premises data centers and cloud environments (OCI and Microsoft Azure). Ensure platforms are stable, secure, and highly available, and provide expert-level troubleshooting and guidance to cross-functional teams.
Position Overview
JOB DESCRIPTION
Own day-to-day operations and continuous improvement of enterprise Linux platforms and storage services across on-premises data centers and cloud environments (OCI and Microsoft Azure). Ensure platforms are stable, secure, and highly available, and provide expert-level troubleshooting and guidance to cross-functional teams.
Job Summary
We are seeking a highly skilled Senior Linux Administrator with strong expertise in on-premises infrastructure, Oracle Cloud Infrastructure (OCI), and Microsoft Azure Cloud. The ideal candidate will be responsible for managing, maintaining, and optimizing enterprise Linux environments across hybrid infrastructures, ensuring high availability, performance, and security. Knowledge of Oracle Database is mandatory and the role will partner closely with database teams to support Oracle Database platforms on Linux.
Responsibilities
Key Responsibilities
Linux System Administration
- Install, configure, and maintain Linux servers (RHEL, Oracle Linux, CentOS, Alma Linux).
- Perform system upgrades, patching, and security hardening using enterprise patching solutions (e.g., Red Hat Satellite) and standard change management practices.
- Monitor system performance, troubleshoot issues, and ensure high availability.
- Manage user access, permissions, and system security.
- Deploy, configure, and manage Linux workloads on OCI and Azure platforms.
- Design and maintain scalable, secure cloud infrastructure.
- Work with OCI services including Compute, VCN, Block Volumes, File Storage, Load Balancer, IAM, and monitoring/logging capabilities.
- Manage Azure services including Virtual Machines, Virtual Networks, Azure Storage (Managed Disks, Files), Azure Backup (Recovery Services Vault), monitoring, and Microsoft Entra (Azure AD).
- Implement backup, disaster recovery, and high availability solutions in cloud environments using OCI/Azure-native capabilities (e.g., Azure Backup via Recovery Services Vault, Azure Site Recovery (ASR), and OCI backup features) and/or enterprise backup tools as applicable.
- Apply cloud governance and operational best practices (tagging, least privilege, patching/maintenance windows) including patch orchestration with Azure Update Manager for Azure VMs; optimize performance and cost for Linux workloads.
- Manage physical and virtualized environments (VMware) including provisioning, lifecycle management, and performance tuning.
- Administer enterprise storage platforms and services including SAN/NAS, RAID concepts, LUN provisioning, zoning/masking (as applicable), and NFS/SMB integrations; experience with storage arrays such as Net App (ONTAP), HPE (3
PAR), and IBM storage is highly desirable; familiarity with SAN fabrics (Brocade and/or Cisco MDS) is a plus. - Configure and troubleshoot Linux storage stack components such as multipathing, LVM, udev rules, and file systems (ext4, XFS); perform capacity management and growth activities.
- Ensure seamless integration between on-prem and cloud (hybrid connectivity, routing/DNS, identity, and operational processes) and perform ongoing infrastructure optimization.
- Implement and validate backup/restore and disaster recovery procedures; hands-on experience with enterprise backup platforms such as Commvault, Veritas Net Backup, and Cohesity; leverage storage snapshots and replication (vendor-native replication, Net App, replication where applicable) and participate in DR drills and recovery testing.
- Develop automation scripts using Bash, Python, or Shell scripting.
- Use configuration management tools like Ansible.
- Manage Infrastructure as Code (IaC) tools (Terraform preferred).
- Implement and manage monitoring tools.
- Analyze system logs and proactively resolve issues.
- Ensure SLA compliance and system uptime.
- Implement security best practices and vulnerability management.
- Ensure compliance with organizational and regulatory standards.
- Manage firewall rules, SELinux policies, and access controls.
- Work closely with Dev Ops, application, and database teams.
- Provide L2/L3 support and participate in incident management.
- Prepare technical documentation and SOPs.
Required
Skills & Qualifications
- Bachelor’s degree in Computer Science, Information Technology, or equivalent practical experience.
- 8+ years of hands-on Linux system administration experience in enterprise environments (production operations and on-call support).
- Strong expertise in:
- Enterprise Linux (RHEL/Oracle Linux; systemd, networking, troubleshooting, performance tuning)
- Oracle Cloud…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).