Lead Machine Learning Infrastructure Engineer
Listed on 2025-12-27
-
IT/Tech
Cloud Computing, Systems Engineer, Data Engineer, AI Engineer
About Convene Inc.
Convene, Inc. is a Tampa based, award-winning technology services organization with offices and resources throughout the US, Mexico, and India. We have successful, referenceable customers, competitive benefits, and high-growth opportunities.
Lead ML Infrastructure Engineer Mountain View, CA/ Dallas, TX / Chicago, IL / NYC, NY About the RoleWe are seeking a highly skilled Lead ML Infrastructure Engineer to spearhead the development, deployment, and scaling of machine learning infrastructure. This pivotal role involves collaborating closely with data scientists, ML engineers, and operations teams to build robust, scalable, and efficient machine learning pipelines. The ideal candidate will be passionate about pushing the boundaries of ML infrastructure, and possess a deep understanding of cloud platforms, containerization, and big data technologies.
Responsibilities- Lead the design, implementation, and maintenance of scalable ML infrastructure solutions.
- Collaborate with data science and ML teams to optimize model deployment workflows.
- Develop and manage CI/CD pipelines to automate deployment processes.
- Architect and implement containerized environments using Docker and Kubernetes.
- Ensure infrastructure security, reliability, and compliance across cloud platforms.
- Optimize resource utilization and cost-efficiency in cloud environments.
- Drive best practices in Infrastructure as Code (IaC) with tools like Terraform.
- Stay current with the latest advancements in ML frameworks, cloud services, and infrastructure tooling.
- Mentor junior team members and promote a culture of continuous improvement.
- Proven experience as a Machine Learning Engineer or Infrastructure Engineer, with a focus on ML infrastructure.
- Strong expertise in programming languages Python and Java.
- Hands-on experience working with cloud platforms, with a strong preference for Google Cloud Platform; AWS and Azure experience are also valuable.
- Familiarity with popular machine learning frameworks such as Tensor Flow and PyTorch, along with libraries like scikit-learn.
- Solid understanding of Dev Ops principles and experience with CI/CD pipelines.
- Experience with Infrastructure as Code tools, especially Terraform.
- Proficiency in containerization technologies including Docker and Kubernetes.
- Knowledge of big data processing tools like Apache Spark and Hadoop is highly preferred.
- Excellent problem-solving abilities combined with effective communication skills.
- Ability to work collaboratively in a fast-paced, dynamic environment.
- Master’s or Bachelor’s or Master’s in Computer Science, Data Science, or related field.
- Certifications in cloud platforms (e.g., Google Cloud Platform Professional Cloud Architect, AWS Certified Solutions Architect).
- Demonstrated experience leading a team or managing complex infrastructure projects.
- Contributions to open-source ML or Dev Ops projects.
- Experience with monitoring and logging tools such as Prometheus, Grafana, ELK stack.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).