×
Register Here to Apply for Jobs or Post Jobs. X

Senior Engineer II - Agent Harness & Workflow Engineering

Job in Lowell, Middlesex County, Massachusetts, 01856, USA
Listing for: IBM
Full Time position
Listed on 2026-05-16
Job specializations:
  • Software Development
    AI Engineer, Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below

Overview

At IBM Software, we transform client challenges into solutions. Building the world’s leading AI-powered, cloud-native products that shape the future of business and society. Our legacy of innovation creates opportunities for IBMers to learn, grow, and make an impact on a global scale. You’ll work with diverse technologies, partners, and industries to design, develop, and deliver solutions that power digital transformation.

IBM Software values innovation, growth, and continuous learning, placing you at the heart of IBM’s product and technology landscape.

Your Role And Responsibilities

We are an agent-first engineering team building intelligent infrastructure automation  platform combines frontier and small language models with strong systems engineering to deliver production‑grade automation backed by rigorous evaluation, telemetry, and quality gates. You’ll work side‑by‑side with both humans and AI agents, where writing code and orchestrating agents are equally core to the role.

What You’ll Do
  • Build and evolve type-safe, composable AI pipelines for intent analysis, planning, knowledge enrichment, and infrastructure generation
  • Extend pipeline routing and outputs across Z, Power, and Cloud platforms (Terraform, Ansible, Pulumi, native APIs)
  • Design context management, telemetry, and production observability for multi-step AI workflows
  • Own a markdown-based workflow engine supporting parallel execution, rollback, approvals, and cross-domain orchestration
  • Develop and scale a plugin-based tool provider system, integrating Power and Cloud APIs with secure session management
  • Advance a robust evaluation framework (LLM-as-judge, regression detection, eval-as-CI) to ensure AI-generated outputs are production-safe
  • Help build a cross-platform workflow catalog used by humans and AI agents for discovery, routing, and execution
  • Extend the agent-to-agent (A2A) interface for capability discovery and delegation as the platform scales
First 90 Days

Month 1:
Onboard onto the Go agent codebase. Run the full eval suite, understand the pipeline composition model, execute scripted workflows against the infrastructure simulator. Trace a request end-to-end from the A2A interface through intent analysis, pipeline routing, tool dispatch, and infrastructure code output. Understand the eval case format, run cross-model comparison, review regression baselines.

Month 2:
Extend the eval corpus — add new cases covering under-tested pipelines (iterative planning, deep research, workflow composition). Begin workflow catalog design — metadata schema, platform tagging, versioning model. Prototype a Power tool provider interface based on Power HMC API surface.

Month 3:
Ship workflow catalog MVP with the existing Z workflows cataloged and tagged. Deliver first Power or Cloud tool provider prototype. Implement eval-as-CI with quality gates. Begin cross-platform pipeline routing — intent analysis that distinguishes Z, Power, and Cloud prompts and routes to the correct pipeline/tool provider combination.

Education And Experience

Preferred Education — Bachelor’s Degree

Required Technical And Professional Expertise
  • Strong Go engineering. The entire harness is Go — pipeline composition, workflow execution, tool dispatch, eval framework, agent protocol server. You need to be productive in Go immediately.
  • You’ve built execution systems. Workflow engines, pipeline frameworks, DAG executors, orchestration infrastructure — you understand step composition, error propagation, checkpoint/rollback, and what it takes to make multi-step execution reliable.
  • LLM integration experience. You’ve worked with LLM APIs in production — prompt construction, structured output parsing, function/tool calling, context window management. You understand the practical challenges of making LLM outputs reliable.
  • You care about quality measurement. You’ve built or maintained evaluation, testing, or quality systems. You understand regression detection, baseline management, and why “it works on my machine” isn’t good enough for infrastructure automation.
Preferred Technical And Professional Experience

You don’t need all of these coming in. The team will bring you up to speed:

  • IBM Z, Power, and Cloud infrastructure APIs and the operational domain knowledge encoded in our workflows
  • Terraform, Ansible, and infrastructure-as-code patterns — what correct generated infrastructure code looks like
  • Our pipeline composition model, scripted workflow engine, and tool provider plugin system
#J-18808-Ljbffr
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary