×
Register Here to Apply for Jobs or Post Jobs. X

Automation Engineer - Scientific Data, AI​/ML Pipelines & Integration Dev

Job in Boston, Suffolk County, Massachusetts, 02298, USA
Listing for: Zifo
Full Time position
Listed on 2026-06-05
Job specializations:
  • IT/Tech
    Data Engineer, Data Analyst, Data Science Manager, AI Engineer
Job Description & How to Apply Below
Location: Boston, Indianapolis, RTP North Carolina & Chicago

Zifo is a global specialist scientific and process informatics services company supporting life sciences, biotech, and pharmaceutical organizations. We enable digital transformation across R&D, manufacturing, and quality by delivering data-driven, scalable, and compliant software solutions.

Zifo is seeking a passionate Software Developer who can work at the intersection of science, data, and technology. The role requires strong expertise in Benchling, Python, SQL/No

SQL, AWS and FastAPI, along with the ability to work directly with scientists performing assay-based experiments. The successful candidate will translate experimental workflows into robust data components, scientific system integrations, AI-enabled insights, and next-generation data pipelines

Requirements
  • Collaborate with scientists, assay teams, and lab operations to capture end-to-end assay and experimental workflows, from sample onboarding and execution through data ingestion, validation, and downstream analytics
  • Translate scientific and operational requirements into well-defined functional, technical, and data requirements for laboratory platforms, system integrations, and next-generation data pipelines
  • Design, develop, and maintain Python-based backend services, APIs, microservices, and data pipelines on AWS using FastAPI and supporting frameworks such as Flask or Django, including integrations with scientific systems such as Benchling, Signals, LIMS, ELN, CDS, and SDMS.
  • Design and optimize SQL and No

    SQL data models and build ETL/ELT and next-generation data pipelines to support structured, semi-structured, and high-volume scientific data, analytics, and AI/ML workloads, including dataset preparation, feature engineering, and model integration into pipelines and applications.
  • Implement and maintain CI/CD pipelines for automated build, testing and deployment
  • Ensure solutions meet performance, data integrity, security, and regulatory compliance requirements (e.g., GxP, 21 CFR Part 11)
  • Perform code reviews, debugging, and performance optimization
  • Coordinate across cross-functional and geographically distributed teams, managing dependencies and ensuring delivery alignment
  • Create ready to deliver technical documentation and track deliverables using JIRA and Confluence
Required Qualifications
  • Bachelor's or master's degree in computer science, Engineering, Life Sciences with 3-8 years of hands-on experience in Python development with FastAPI
  • Proficiency in SQL, including schema design, complex queries, and performance optimization
  • Relational databases such as Postgre

    SQL, MySQL, Oracle, AWS RDS/Aurora, No

    SQL databases such as Dynamo

    DB, Mongo

    DB, or equivalent
  • Experience with scientific data and laboratory informatics, including familiarity with Benchling or similar scientific data platforms ELN like Benchling, LIMS, ELN, SDMS, CDS,, within the life sciences or pharmaceutical industry. (Preferred)
  • AWS experience, including S3, EC2, Lambda, Step Functions, RDS / Aurora, IAM, monitoring, and logging
  • Proficiency with Git-based collaborative development, including branch management, pull requests, code reviews, and integration with CI/CD pipelines (Git Hub Actions, Git Lab CI, Jenkins, AWS Code Pipeline) to ensure reliable and traceable software delivery
  • Hands-on experience with Test-Driven Development and Python testing frameworks such as pytest, unittest, and mocking libraries
  • Working knowledge of AI/ML concepts, including data preparation, feature engineering, model integration, and inference workflows
  • Exposure to the data and ML libraries such as pandas, Num Py, and scikit-learn (exposure to Tensor Flow or PyTorch is a plus)
  • Ability to design data models aligned to scientific and assay workflows & integrating scientific or enterprise systems and working directly with scientists or lab users
  • Knowledge of containerization (Docker) and modern deployment best practices
  • Familiarity with Agile/Scrum & SDLC development methodologies & Solid understanding of REST APIs, microservices, and integration patterns
  • Strong communication, stakeholder engagement, and cross-team coordination skills
Additional…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary