×
Register Here to Apply for Jobs or Post Jobs. X

Machine Learning Engineer, Prompt Safety and Agent Security

Job in Mountain View, Santa Clara County, California, 94039, USA
Listing for: The Mom Project
Full Time position
Listed on 2026-06-03
Job specializations:
  • IT/Tech
    AI Engineer, Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 125000 - 150000 USD Yearly USD 125000.00 150000.00 YEAR
Job Description & How to Apply Below

Overview

Our Customer is a Silicon Valley-based company that is engaged in researching emerging technologies. We are seeking a contract Machine Learning Engineer to help support our Customer's business needs. This role is on-site in Mountain View, CA.

Responsibilities
  • Design and train prompt injection detection models and prompt safety classifiers for agentic AI systems.
  • Build safety models that evaluate both inputs and outputs across AI workflows.
  • Develop hybrid deployment pipelines that split safety inference between on-device and cloud environments.
  • Optimize safety inference systems for latency, privacy, and detection coverage.
  • Apply post-training techniques such as RLHF, reward modeling, DPO, RLAIF, and policy optimization to improve guardrail model performance.
  • Improve model calibration, stability, and robustness against adaptive adversarial attacks.
  • Curate and generate adversarial training data, including prompt injections, jailbreaks, tool-use exploits, and unsafe-output cases.
  • Leverage red-teaming outputs and production signals to improve training datasets.
  • Build evaluation harnesses to measure attack success rate, false positive rate, latency, and on-device footprint.
  • Evaluate model iterations across threat categories and deployment environments.
  • Partner with agent, device, and platform teams to integrate safety models into mobile agents, XR/AR assistants, and cloud agentic workflows.
  • Close the loop between production incidents, model evaluation, and training data improvements.
  • Collaborate cross-functionally with security researchers, modeling teams, and product engineers.
  • Document technical methods and contribute to patents, publications, or open-source work where appropriate.
Skills and Qualifications
  • M.S. or Ph.D. in Computer Science, Machine Learning, Electrical Engineering, or related field, or B.S. with equivalent industry experience.
  • 3+ years of industry experience in ML engineering or applied AI research with ownership of production ML systems.
  • 2+ years of industry experience in software engineering.
  • Strong proficiency in Python and PyTorch, JAX, or Tensor Flow.
  • Strong software engineering fundamentals, including version control, testing, and reproducible experimentation.
  • Hands-on experience post-training LLMs using RLHF, DPO, RLAIF, or reward modeling.
  • Experience with reward design, preference data curation, and training stability.
  • Hands-on experience training and deploying classifier or guardrail models for safety, content moderation, abuse detection, or adversarial robustness.
  • Familiarity with prompt injection, jailbreak detection, and agentic AI threat models.
  • Experience with distributed training frameworks such as Deep Speed, FSDP, or Accelerate.
  • Strong experience in machine learning engineering, applied AI research, and software engineering.
  • Strong understanding of safety model deployment, classifier training, and guardrail model training.
  • Strong analytical, documentation, and cross-functional collaboration skills.
Preferred Qualifications
  • Experience building safety or moderation systems for agentic AI.
  • Experience with tool-use guardrails, indirect prompt injection defenses, or output filtering for autonomous agents.
  • Experience with red-teaming, adversarial data generation, or automated attack pipelines such as GCG, PAIR, or generator-critic frameworks.
  • Experience with on-device or edge ML deployment using Execu Torch, Core ML, TFLite, MLC-LLM, or vendor NPU tool chains.
  • Experience with model compression techniques such as quantization, distillation, or pruning for safety models.
  • Experience with telemetry, logging, or user-facing data systems on mobile, XR/AR, or consumer platforms.
  • Experience with privacy-preserving user data handling, including anonymization, on-device processing, or federated approaches.
  • Publications at top-tier ML, NLP, or security venues.
  • Patents or open-source contributions in safety, alignment, or AI security.

An Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or protected veteran status and will not be discriminated against on the basis of disability.

Contractor benefits are available through our 3rd Party Employer of Record (Available upon completion of waiting period for eligible engagements).

Benefits include: Medical, Dental, Vision, 401k.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary