×
Register Here to Apply for Jobs or Post Jobs. X

ML Engineer; Evaluation and Experimentation

Job in Arlington, Arlington County, Virginia, 22201, USA
Listing for: Cynnovative
Full Time position
Listed on 2026-05-08
Job specializations:
  • IT/Tech
    Machine Learning/ ML Engineer, Data Scientist, Data Analyst, Data Science Manager
Salary/Wage Range or Industry Benchmark: 60000 - 80000 USD Yearly USD 60000.00 80000.00 YEAR
Job Description & How to Apply Below
Position: ML Engineer (Evaluation and Experimentation)

At Cynnovative, we leverage machine learning, computer science, and software engineering to address high-impact problems in the cyber domain, specifically those which are critical to U.S. national security. We primarily extend fundamental research to invent, design, develop, and deploy prototype solutions that support persistent problems in this domain.

Job Overview

As a Machine Learning Engineer (Evaluation & Experimentation) at Cynnovative, you will build and maintain systems that run large-scale experiments and evaluate LLM outputs. This role is crucial to rapid, experiment-driven iteration on LLM systems in support of U.S. national security efforts.

NOTE:
This role requires an active TS/SCI security clearance and is located on-site in Northern Virginia.

Responsibilities (May Include)

Design and implement evaluation pipelines for LLM experimentation

  • Implement and apply metrics over model outputs at scale
  • Build automated evaluation workflows across large experiment sets
  • Execute statistical analysis and testing over experimental results
  • Ensure consistency and comparability of results across runs, configurations, and datasets

Develop experiment tracking and logging specifications

  • Define schemas for capturing prompts, perturbations, outputs, and configurations
  • Specify and validate logging of token-level probabilities, scores, and derived metrics
  • Ensure experiment data is structured, complete, and queryable for downstream analysis

Build and maintain datasets and evaluation inputs

  • Curate prompt sets, perturbation strategies, and test cases provided by the research team
  • Maintain versioned datasets and experiment inputs
  • Enable rapid iteration on experiment configurations and evaluation coverage

Collaborate cross-functionally

  • Work closely with ML systems engineers to ensure correct data capture at scale
  • Provide feedback on experiment execution, data quality, and metric behavior
  • Support interpretation of experimental results through reliable measurement
Requirements (Must Have)
  • B.S. in Computer Science, Data Science, or related field (M.S. or Ph.D. preferred)
  • Strong communication skills and ability to collaborate cross-functionally
  • Proficiency in Python and data processing
  • Experience building experiment, evaluation, or analytics pipelines
  • Familiarity with experiment tracking tools (MLflow or similar)
  • Experience working with large-scale or batch data processing workflows
  • Understanding of statistical methods
  • Experience working with structured and semi-structured data
  • Experience with version control systems, particularly Git
  • U.S. Citizenship and active TS/SCI security clearance
Desired Skills (Nice To Have)
  • Familiarity with prompt sensitivity, perturbation analysis, or robustness testing
  • Prior experience in a research-to-product environment
  • Understanding of A/B testing and large-scale experimentation
  • Familiarity with cyber-related data, tools, and techniques
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary