Applied Research PhD Intern
Listed on 2026-06-24
-
IT/Tech
AI Engineer (Applied/Software), AI Evaluation
This role can be based remotely in United States
Description and RequirementsBMC empowers nearly 80% of the Forbes Global 100 to accelerate business value, faster than humanly possible. Our industry-leading portfolio unlocks human and machine potential to drive business growth, innovation, and sustainable success. BMC does this in a simple and optimized way by connecting people, systems, and data that power the world’s largest organizations so they can seize a competitive advantage.
BMC Software runs the systems that the world’s largest enterprises depend on — mainframes, automation, and the control plane underneath them. Putting agentic AI into that environment raises the bar: every agent's action must be grounded, auditable, and reversible. The Office of the CTO is working on the AI Foundation that makes this possible across BMC's product lines, and the heart of it is an Enterprise Agent Gym — the evaluation harness and experimentation loop that turns “the prototype worked” into “the agent is safe to promote to production.”
Howyou will contribute to BMC’s and your own success
Work directly with members of Technical Staff in the Office of the CTO, on the evals and experimentation layer that BMC AI products are built on.
- Design evaluations that catch the failure modes of enterprise agents: hallucinated tool calls, policy violations, context collapse, regression under distribution shift, etc.
- Build the Agent Gym — task definitions, graders, reward signals, and trajectory capture — for multi-step agentic workflows.
- Run experimentation sweeps across prompts, models, and scaffolds; quantify trade-offs between accuracy, cost, and latency.
- Turn eval results into promotion gates and readiness reports that product teams can act on.
- Contribute to our Responsible AI tooling — grounding checks, policy enforcement, and human-in-the-loop escalation paths.
Your project will be part of the BMC AI Foundation’s active work streams and shaped as a focused PhD-level research internship:
Agent Gym evaluations, grader design, experimentation tooling, dataset curation, or trace / replay infrastructure. Exact scope is matched to your doctoral research strengths during onboarding, with your technical mentors, and is sized to produce a concrete research artifact, prototype, or evaluation result within 12 weeks.
- Are currently pursuing a PhD in Computer Science, Machine Learning, AI, or a closely related field, with active research in LLMs, agents, reinforcement learning, AI safety, or evaluation methodology.
- Have produced non-trivial research or systems that work on modern LLM and agent stacks — multi-step tool-using agents, RAG pipelines, evaluation harnesses, and post-training.
- Can turn an open research question into testable hypotheses, choose strong baselines and ablations, interpret learning curves or reward trajectories honestly, and communicate findings clearly.
- Treat evaluation as a first-class AI research and engineering problem, not just a reporting layer.
- Published, submitted, or in-progress PhD research on LLM evaluation, agent benchmarks, alignment, RL environments, or related systems.
- Hands-on research experience with RLHF / RLVR, reward modeling, synthetic data generation, red-teaming, or scalable evaluation design.
- Contributions to open-source eval harnesses, agent scaffolds, observability tooling, or reproducible research infrastructure.
- Clear thinking about AI safety, deployment risk, benchmark validity, and the gap between academic results and enterprise production use.
BMC is committed to equal opportunity employment regardless of race, age, sex, creed, color, religion, citizenship status, sexual orientation, gender, gender expression, gender identity, national origin, disability, marital status, pregnancy, disabled veteran or status as a protected veteran. If you need a reasonable accommodation for any part of the application and hiring process, visit the accommodation request page.
BMC Software maintains a strict policy of not requesting any form of payment in exchange for employment opportunities, upholding a fair and ethical hiring process.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).