Software Developer, Cloud Computing, Machine Learning/ ML Engineer
Listed on 2026-03-15
-
IT/Tech
Cloud Computing, Machine Learning/ ML Engineer
Posting Description
OverviewSOFTWARE DEVELOPER 2, Chemical Engineering (ChemE) - Machine Learning for Pharmaceutical Discovery and Synthesis (MLPDS) Consortium, to support the development, deployment, and operation of machine learning–enabled software systems for research and engineering applications. Will work closely with faculty, researchers, and graduate students to translate scientific and engineering workflows into robust, scalable, and reproducible computational applications. Will focus on backend software development, cloud-based deployment, and the management of machine learning applications in containerized and Kubernetes-based environments.
Responsibilities include software and machine learning development of machine learning models using established frameworks such as PyTorch and PyTorch Lightning; cloud infrastructure and containerization using Docker and Kubernetes, with a primary focus on AWS Elastic Kubernetes Service (EKS); design and maintain continuous integration and continuous deployment (CI/CD) pipelines to automate testing, validation, and deployment of research software; and support reproducible build and deployment processes consistent with academic and research best practices.
Will implement and maintain application and infrastructure monitoring using tools such as Prometheus and Grafana; collect and analyze application usage and performance metrics using platforms such as Google Analytics or comparable tools; and assist in diagnosing and resolving software and infrastructure issues affecting research applications.
- Software and Machine Learning Development of ML models using PyTorch and PyTorch Lightning.
- Cloud infrastructure and containerization using Docker and Kubernetes, focused on AWS EKS.
- Design and maintain CI/CD pipelines to automate testing, validation, and deployment of research software.
- Support reproducible build and deployment processes aligned with academic and research best practices.
- Implement and maintain application and infrastructure monitoring with Prometheus and Grafana; collect and analyze usage and performance metrics; assist in diagnosing and resolving issues affecting research applications.
- REQUIRED
:
Bachelor’s degree in Computer Science, Chemical Engineering, Chemistry, or a closely related discipline; a minimum of five years of relevant programming experience; demonstrated experience programming in Python, JavaScript, and Bash. - Practical experience with Docker and Kubernetes, including deployment on AWS EKS; experience deploying and managing Kubernetes applications using Helm; proficiency in Kubernetes tooling, including Lens for app/cluster management; proficiency with Git and collaborative development workflows (Git Hub and/or Git Lab); experience developing and maintaining CI/CD pipelines; experience with one or more machine learning frameworks including PyTorch; experience developing backend software, including asynchronous or distributed processing systems;
and familiarity with monitoring tools such as Prometheus and Grafana. - PREFERRED
:
Familiarity with RDKit or other cheminformatics or molecular modeling libraries; experience supporting computational research or research software platforms; experience working in interdisciplinary academic or research environments; and familiarity with best practices for research data management and open-source development.
Location:
Massachusetts Institute of Technology
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).