Lead Associate Principal, Cloud Engineering
Listed on 2026-02-16
-
IT/Tech
Systems Engineer, Cloud Computing
What You’ll Do
This role will perform a range of activities required to both maintain and continuously automate a large, complex Kubernetes-based cloud computing environment. In addition, you will provide technical guidance to the team and, when called upon, serve as a technical liaison between internal departments. This will involve utilizing best practices for the management, architecture, configuration, high availability, disaster recovery, administration, and automation of Kubernetes clusters and containerized workloads with cloud-native technologies.
These activities will drive the creation of new infrastructure and environments, which will be critical to continued growth and adoption of broad Kubernetes/container orchestration goals across the business. The ideal candidate is passionate about cloud-native technologies and Kubernetes ecosystem tools to accomplish complex project initiatives and implement mission critical systems, while keeping current with trends in the Kubernetes and CNCF spaces for areas to improve, with a steady eye towards the extensive regulatory/compliance demands on our company (e.g. CIS, NIST, etc).
Duties and Responsibilities
To perform this job successfully, an individual must be able to perform each primary duty satisfactorily.
- Reports to the Director of Platform Automation and Cloud Engineering
- Design, configure, implement and manage Kubernetes clusters and maintain a fully automated workflow for provisioning and managing a complex, highly available container orchestration environment using infrastructure as code
- Develop and maintain Kubernetes operators, controllers, and custom resources to extend cluster functionality and automate application lifecycle management
- Manage Dev Ops development activities and complex development tasks that will involve working with tools such as Docker, Kafka, container runtimes, and Kubernetes ecosystem tools
- Lead and participate in Kubernetes cluster build-outs, upgrades, software installation, maintenance and support, including but not limited to, patches, security fixes, end-of-life preparation, and version upgrades
- Implement and manage Kubernetes networking solutions, service mesh architectures, runtime security policies, and RBAC configurations to ensure secure and efficient cluster operations
- Ensure the reliability of Kubernetes platforms and containerized services your area of responsibility provide and manage to both specific and implied SLAs to help the organization achieve both internal and external quality standard excellence for the cloud platform
- Assess and plan for capacity needs within Kubernetes clusters and the underlying cloud platform and forecast accordingly
- Implement and manage initiatives within your assigned area of responsibility with accountability for results and compliance with all controls and security requirements
- Lead in the development of technology roadmaps and end-of-life technology plans for Kubernetes versions, container runtimes, and related cloud-native technologies
- Write and maintain documentation of relevant Kubernetes architectures, systems, procedures and processes
- Effectively communicate project and operational service issues to senior management promptly with observations, decisions, and recommendations for corrective measures
- Manage and participate in the implementation of production changes during defined maintenance windows and support on call rotation
- Maintain appropriate work/personal balance within your team
- Serve as a point of escalation within the team for Kubernetes and containerization support issues
- Implement and manage rotational support schedules for after hours and weekend work for area of responsibility
- Foster an atmosphere of trust, respect, and high performance while displaying strong ethics and integrity
- Manage project and daily task planning and prioritization and meeting project deadlines while also maintaining a high quality of work
- Institutes corrective actions to address audit and other regulatory or compliance findings
- Operate within budget;
Establish and assure adherence to schedules, work plans, and performance requirements - Other duties as assigned
- None
The requirements listed are representative of the knowledge, skill, and/or ability required. Reasonable accommodations may be made to enable individuals with disabilities to perform the primary functions.
- [Required] Good consultative, communication, team player and analytical skills are a must, as you will be regularly interacting between various teams distributed across the US
- [Required] Working knowledge of Kubernetes architecture, container orchestration, and cloud-native infrastructure design and components, such as: etcd, networking, storage, and container runtimes
- [Required] Extensive hands-on experience with Kubernetes cluster creation, maintenance, support, and administration in production environments
- [Required] Deep understanding and practical implementation experience with…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).