×
Register Here to Apply for Jobs or Post Jobs. X

AI EVAL Engineer

Job in Bellevue, King County, Washington, 98009, USA
Listing for: Akkodis Group Nordics
Full Time position
Listed on 2026-02-16
Job specializations:
  • IT/Tech
    AI Engineer, Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 50 - 53 USD Hourly USD 50.00 53.00 HOUR
Job Description & How to Apply Below

Akkodis is seeking an AI EVAL Engineer for a Contract with a client in Bellevue, WA (Remote). Candidates must have strong Python programming skills and hands-on experience with AI evaluation frameworks and metrics in a Linux environment.

Rate Range: $50/hour to $53/hour;
The rate may be negotiable based on experience, education, geographic location, and other factors.

Responsibilities
  • Design, implement, and automate evaluation test suites to measure LLM accuracy, relevance, safety, latency, and cost across zero-shot, few-shot, and system-prompt scenarios.
  • Define and apply robust evaluation metrics (e.g., precision/recall, BLEU/ROUGE, F1, hallucination rate, throughput, cost-per-output) and establish reproducible baselines for model comparison.
  • Build datasets, ground-truth references, and benchmarks, and maintain versioned test cases for consistent, repeatable scoring.
  • Develop batch evaluation pipelines in Python (and other languages as needed) with API integrations, integrating frameworks like OpenAI Evals, Hugging Face evals, Promptfoo, Ragas, Deep Eval, or LM Eval Harness.
  • Conduct performance benchmarking and analysis across Azure OpenAI (and other providers), reporting insights on speed, scalability, and resource efficiency.
  • Assess and mitigate AI safety, bias, and hallucination risks, while collaborating with product, research, and platform teams to improve prompts, guardrails, and overall model quality.
Required Qualifications
  • Bachelor’s or master’s in computer science, Data Science, AI/ML, or related field.
  • 3–5+ years in AI/ML evaluation, benchmarking, or applied ML (including LLMs and generative AI).
  • Strong Python skills with hands-on experience in evaluation frameworks (e.g., OpenAI Evals, Hugging Face evals, Promptfoo, Ragas, Deep Eval, LM Eval Harness) and defining/applying metrics (precision/recall, BLEU/ROUGE, F1, hallucination rate, latency, cost).
  • Practical experience with Azure OpenAI (and/or OpenAI/Anthropic/Google AI), test automation pipelines, and benchmarking across zero-/few-shot prompts; familiarity with RAG evaluation and AI safety/bias testing is a plus.
Pay and Benefits

Pay Details: $50.00 to $53.00 per hour

Benefit offerings available for our associates include medical, dental, vision, life insurance, short-term disability, additional voluntary benefits, EAP program, commuter benefits and a 401K plan. Our benefit offerings provide employees the flexibility to choose the type of coverage that meets their individual needs. In addition, our associates may be eligible for paid leave including Paid Sick Leave or any other paid leave required by Federal, State, or local law, as well as Holiday pay where applicable.

Equal

Opportunity and Privacy

Equal Opportunity Employer/Veterans/Disabled

Military connected talent encouraged to apply

To read our Candidate Privacy Information Statement, which explains how we will use your information, please navigate to

Additional Notes

The Company will consider qualified applicants with arrest and conviction records in accordance with federal, state, and local laws and/or security clearance requirements, including, as applicable:

  • The California Fair Chance Act
  • Los Angeles City Fair Chance Ordinance
  • Los Angeles County Fair Chance Ordinance for Employers
  • San Francisco Fair Chance Ordinance

Massachusetts Candidates Only: It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary