×
Register Here to Apply for Jobs or Post Jobs. X

Software Engineer, Data Infrastructure

Job in Washington, District of Columbia, 20022, USA
Listing for: TryApplyNow
Full Time position
Listed on 2026-06-25
Job specializations:
  • Software Development
    Software Engineer, AI Engineer (Applied/Software), Data Engineering, Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 186400 - 233000 USD Yearly USD 186400.00 233000.00 YEAR
Job Description & How to Apply Below
# Software Engineer, Data Infrastructure Scale AIFull Timemid Washington, District of Columbia, USPosted Yesterday##

Role Overview Scale AI is hiring a mid-level Software Engineer, Data Infrastructure. This is a full-time role in Washington. posted yesterday. Full responsibilities, required qualifications, and the apply link are listed in the description below.## Resume Keywords to Include Make sure these keywords appear in your resume to improve ATS scoring

Python Java Go Rust Spark Telemetry OR Compensation

Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score##

Job Description Scale AI is seeking a highly skilled and motivated Mission Software Engineer to join our dynamic Federal Engineering team. As a part of this team, you will play a critical role in supporting Scale’s government customers by scoping and developing onsite solutions. Our scalable, high-performance platform is the foundation for these customer solutions, and your expertise will be instrumental in designing and implementing systems that can handle interactions with existing customer systems to help our products integrate into existing customer workflows.

The Role
* We are looking for an exceptional Senior Software Engineer to architect and build the foundational data infrastructure that will serve as the brain of a project ecosystem.
* We are not looking for someone to stitch together off-the-shelf data frameworks. You will be responsible for designing highly novel data models and processing pipelines capable of handling massive quantities of output data from complex simulations.
* At the core of this role is the challenge of building a foundational data ensemble—a unified architecture that seamlessly aggregates, structures, and stages diverse sources of simulation outputs and user inputs. Your systems will manage enormous batch throughput jobs with strict, minimal latency requirements, ensuring that downstream AI systems and language models have the exact context they need to action ably reason over complex, multi-dimensional scenarios.

Key Responsibilities
* Architect the Data Ensemble:
Design and implement the architecture to ensemble various sources of injected context (deeply structural simulation data, historical game states, and dynamic user inputs) into a unified, highly queryable format optimized for LLM consumption.
* Massive Batch

Infrastructure: Build highly scalable, resilient data architectures from scratch. You will optimize for moving, transforming, and processing massive quantities of simulation output data via enormous batch jobs, maintaining the minimal latency required for rapid wargame iterations.
* Complex Data Modeling:
Design sophisticated, highly relational data models that accurately represent massive, state-based simulation environments, making them easily interpretable by machine learning models.
* First-Principles Problem Solving:
Navigate highly ambiguous product requirements to design custom, ground-up systems where existing open-source or enterprise tools simply cannot handle the structural complexity or scale.
* Technical Leadership:
Set the technical standard for the data infrastructure team, driving rigorous code quality, system performance, and architectural clarity.

What We’re Looking For

* Experience:

5+ years of backend or data infrastructure experience, operating at a Senior, Staff, or Principal level.
* Engineering Excellence:
Deep, expert-level proficiency in systems languages (e.g., Rust, Go, C++, or highly optimized Python/Java, Spark) and a fundamental understanding of memory management, compute limits, and distributed systems architecture.
* High-Throughput / Low-Latency Data:
Proven track record of processing massive datasets. You understand how to optimize massive batch jobs and parallel processing across distributed simulation nodes without sacrificing speed.
* Information Retrieval & Context Surfacing:
You don't need a background in AI agents, but you must be an expert in surfacing the right needle from an ocean of hay to feed decision-making engines. We highly value engineers with backgrounds in:
* Search & Rec Sys:
Building complex information…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary