More jobs:
Principal Product Manager, SeekrFlow Evaluations
Job in
Austin, Travis County, Texas, 78716, USA
Listed on 2026-06-19
Listing for:
Seekr
Full Time
position Listed on 2026-06-19
Job specializations:
-
IT/Tech
AI Evaluation
Job Description & How to Apply Below
Seekr Flow is an end-to-end AI development platform used to build, fine-tune, evaluate, and deploy AI systems in production environments. As AI adoption accelerates across enterprises and government agencies, the ability to rigorously evaluate AI models and applications, and to trust those evaluations, has become a critical capability. We're hiring a Principal Product Manager to own Seekr Flow's evaluations product area end to end.
This is a senior individual contributor role with broad scope. You will define strategy and drive execution across all evaluation surfaces inside Seekr Flow: base models, fine-tuned models, agents, distilled applications, and beyond. You'll work in close partnership with other product leaders, ensuring that evaluation capabilities inside Seekr Flow connect seamlessly with scoring, certification, and governance workflows across all Seekr products.
You'll operate with a high degree of autonomy, partnering directly with Engineering, Research, Design, and GTM to make AI evaluation rigorous, scalable, and customer ready.
What You'll Own
- Evaluations product strategy & roadmap: Define and own the multi-year strategy and roadmap for Seekr Flow's evaluation capabilities, covering the full spectrum of model types (base models, fine-tuned models, agents, and distilled applications) and ensuring alignment with platform and business priorities.
- End-to-end product ownership: Drive evaluation features from discovery and specification through delivery, launch, and iteration, working in close partnership with Engineering and Design. Own prioritization decisions and hold the line on quality and scope.
- Cross-surface evaluation coverage: Develop deep expertise in the unique evaluation requirements for each model type and application surface, translating those requirements into coherent product direction across Seekr Flow.
- Seekr Guard alignment: Partner closely with other leaders in the technology organization to ensure Seekr Flow evaluation outputs connect directly to Seekr Guard's risk scoring and evaluation workflows, creating a seamless end-to-end trust pipeline.
- Research & capability translation: Engage with Research and Engineering teams to identify emerging evaluation techniques and benchmarks, and determine how to turn experimental work into scalable, customer-facing product capabilities.
- Customer & market insight: Work directly with enterprise and government customers to understand evaluation requirements, workflow gaps, and trust needs. Use those insights to validate direction and sharpen the roadmap.
- Technical specification: Write clear, detailed product specifications that define evaluation workflows, metrics, API/SDK surfaces, UI behavior, and expected system behavior across SaaS and self-hosted deployments.
- Metrics & outcomes ownership: Define success metrics for evaluation capabilities and track adoption, performance, and customer outcomes to drive continuous improvement.
- 8-12+ years of product management experience, with demonstrated ownership of complex, technically deep platform or infrastructure products
- Strong understanding of AI/ML model evaluation concepts, including benchmarking, bias and reliability testing, task-specific evaluation frameworks, and application types (base models, fine-tuned models, agents)
- Experience productizing research-driven or experimental technology areas where requirements evolved through iteration and technical discovery
- Proven ability to operate autonomously as a senior IC: setting direction, writing specifications, and driving cross-functional execution
- Comfortable engaging deeply with engineers, designers, and researchers on technical tradeoffs, system design decisions, and emerging research
- Experience building for enterprise or government customers in B2B software environments, including multi-stakeholder requirements and compliance-sensitive contexts
- Strong product writing skills: able to produce structured, precise specifications that enable high-quality engineering execution
- Clear communicator capable of making complex evaluation concepts legible to both technical teams and executive or customer…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×