×
Register Here to Apply for Jobs or Post Jobs. X

Software Engineer-Data Engineering, Machine Learning; ML

Job in Arlington, Arlington County, Virginia, 22201, USA
Listing for: AAMVA (American Association of Motor Vehicle Administrators)
Full Time position
Listed on 2026-06-03
Job specializations:
  • IT/Tech
    Data Engineer, Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below
Position: Software Engineer-Data Engineering, Machine Learning (ML)

Position Summary

The IT Division is responsible for the development and operations of information systems for the State and Federal agencies doing business related to or using information from the administration of motor vehicles and driver licenses. The Machine Learning (ML) Data Engineer position has core responsibilities for the design, development, deployment, and operational support of machine learning solutions on cloud infrastructure. This includes the full model lifecycle — from data acquisition and dataset preparation through feature engineering, experimentation, model training, validation, production deployment, and ongoing monitoring.

Current applications include anomaly detection across high-volume messaging networks, but the scope encompasses any ML capability that strengthens system reliability, operational intelligence, and data-driven decision making across AAMVA systems.

Essential Duties and Responsibilities

We are seeking a talented Data Engineer with machine learning experience to join our team. You will design, build, and operationalize ML solutions running on cloud infrastructure (Azure or AWS). You will work across the full model lifecycle: preparing datasets, engineering features, running experiments, deploying models to production, and operating them on cloud infrastructure.

Key Responsibilities
  • Designing and building dataset preparation pipelines — acquiring, cleaning, transforming, and versioning data for ML training and evaluation
  • Engineering features that extract meaningful signals from structured and semi‑structured data sources (time‑series patterns, statistical profiles, categorical encodings)
  • Running structured experimentation — testing multiple algorithms against defined scenarios, measuring performance, and documenting findings
  • Training, evaluating, and tuning ML models including regression, classification, clustering, anomaly detection, and ensemble methods
  • Deploying models to production on cloud infrastructure and building the pipelines that keep them running (re‑training, scoring, threshold management)
  • Monitoring model performance in production — tracking drift, false positive rates, and detection efficacy over time
  • Building and maintaining batch and streaming data pipelines using Synapse, Fabric, Spark, and Event Hubs that feed ML systems
  • Writing and optimizing analytical queries (SQL, KQL, PySpark) for data exploration, statistical profiling, and real‑time analysis
  • Creating validation frameworks — synthetic test data generation, backtesting against historical logs, and shadow‑mode evaluation
  • Building dashboards and visualizations that communicate model outputs to technical and non‑technical stakeholders
  • Collaborating with cross‑functional teams to identify ML opportunities and translate operational problems into data solutions; communicating findings, trade‑offs, and model behavior clearly to technical and non‑technical audiences across IT, operations, and leadership.
Direct Reports

None

Qualifications

Formal

Education:

Bachelor's degree in computer science, data science, statistics, mathematics, or related quantitative field. Equivalent work experience may be substituted.

Key Knowledge, Skills, and Abilities
  • 3–5 years of hands‑on experience in data engineering, ML engineering, or applied analytics.
  • Hands‑on cloud platform experience (Azure or AWS) building and deploying data or ML solutions on managed cloud services; specific platform less important than depth of experience.
  • Working knowledge of statistical foundations: distributions, variance, standard deviation, trend vs. seasonality, hypothesis testing, and how to apply them to real operational data.
  • Experience with the ML experiment‑to‑production cycle: dataset preparation, feature engineering, model training, evaluation, and deployment.
  • Proficiency in Python for data processing, statistical analysis, and ML model development.
  • Strong SQL skills with understanding of relational database fundamentals: data modeling, query optimization, indexing strategies, and how SQL Server infrastructure supports production workloads (T‑SQL, stored procedures, Availability Groups).
  • Experience building data pipelines that handle batch and…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary