Automation Engineer - Scientific Data,AI/ML Pipelines & Integration Dev Job Boston area,Massachusetts USA,IT/Tech

Location: Boston, Indianapolis, RTP North Carolina & Chicago

Zifo is a global specialist scientific and process informatics services company supporting life sciences, biotech, and pharmaceutical organizations. We enable digital transformation across R&D, manufacturing, and quality by delivering data-driven, scalable, and compliant software solutions.

Zifo is seeking a passionate Software Developer who can work at the intersection of science, data, and technology. The role requires strong expertise in Benchling, Python, SQL/No

SQL, AWS and FastAPI, along with the ability to work directly with scientists performing assay-based experiments. The successful candidate will translate experimental workflows into robust data components, scientific system integrations, AI-enabled insights, and next-generation data pipelines

Requirements

Collaborate with scientists, assay teams, and lab operations to capture end-to-end assay and experimental workflows, from sample onboarding and execution through data ingestion, validation, and downstream analytics
Translate scientific and operational requirements into well-defined functional, technical, and data requirements for laboratory platforms, system integrations, and next-generation data pipelines
Design, develop, and maintain Python-based backend services, APIs, microservices, and data pipelines on AWS using FastAPI and supporting frameworks such as Flask or Django, including integrations with scientific systems such as Benchling, Signals, LIMS, ELN, CDS, and SDMS.
Design and optimize SQL and No

SQL data models and build ETL/ELT and next-generation data pipelines to support structured, semi-structured, and high-volume scientific data, analytics, and AI/ML workloads, including dataset preparation, feature engineering, and model integration into pipelines and applications.
Implement and maintain CI/CD pipelines for automated build, testing and deployment
Ensure solutions meet performance, data integrity, security, and regulatory compliance requirements (e.g., GxP, 21 CFR Part 11)
Perform code reviews, debugging, and performance optimization
Coordinate across cross-functional and geographically distributed teams, managing dependencies and ensuring delivery alignment
Create ready to deliver technical documentation and track deliverables using JIRA and Confluence

Required Qualifications

Bachelor's or master's degree in computer science, Engineering, Life Sciences with 3-8 years of hands-on experience in Python development with FastAPI
Proficiency in SQL, including schema design, complex queries, and performance optimization
Relational databases such as Postgre

SQL, MySQL, Oracle, AWS RDS/Aurora, No

SQL databases such as Dynamo

DB, Mongo

DB, or equivalent
Experience with scientific data and laboratory informatics, including familiarity with Benchling or similar scientific data platforms ELN like Benchling, LIMS, ELN, SDMS, CDS,, within the life sciences or pharmaceutical industry. (Preferred)
AWS experience, including S3, EC2, Lambda, Step Functions, RDS / Aurora, IAM, monitoring, and logging
Proficiency with Git-based collaborative development, including branch management, pull requests, code reviews, and integration with CI/CD pipelines (Git Hub Actions, Git Lab CI, Jenkins, AWS Code Pipeline) to ensure reliable and traceable software delivery
Hands-on experience with Test-Driven Development and Python testing frameworks such as pytest, unittest, and mocking libraries
Working knowledge of AI/ML concepts, including data preparation, feature engineering, model integration, and inference workflows
Exposure to the data and ML libraries such as pandas, Num Py, and scikit-learn (exposure to Tensor Flow or PyTorch is a plus)
Ability to design data models aligned to scientific and assay workflows & integrating scientific or enterprise systems and working directly with scientists or lab users
Knowledge of containerization (Docker) and modern deployment best practices
Familiarity with Agile/Scrum & SDLC development methodologies & Solid understanding of REST APIs, microservices, and integration patterns
Strong communication, stakeholder engagement, and cross-team coordination skills

Additional…

Automation Engineer - Scientific Data, AI​/ML Pipelines & Integration Dev

Automation Engineer - Scientific Data, AI/ML Pipelines & Integration Dev