×
Register Here to Apply for Jobs or Post Jobs. X

Senior Machine Learning Engineer, GenAI Data

Job in San Mateo, San Mateo County, California, 94409, USA
Listing for: Roblox
Full Time position
Listed on 2026-06-06
Job specializations:
  • Software Development
    Data Engineer, AI Engineer, Data Scientist, Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators.

At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there.

A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.

As a Senior Software Engineer on the Foundation AI organization, you will sit at the epicenter of our foundation model efforts. While the research world is focused on architecture, you’ll be the architect of the data flywheel that makes Video Gen and 3DGen possible. You aren’t just building pipelines; you are building the infrastructure that defines how our models perceive and generate virtual worlds in three dimensions and across time.

In this role, you will partner directly with our AI researchers to advance beyond experimental datasets and into the realm of dynamic, high-fidelity data synthesis and evaluation. You will bridge the gap between research prototypes working locally to scaling for millions of users. You will design, implement, and scale robust, high-performance infrastructure to crawl, create, curate, store, and serve the massive datasets required for these models.

We are seeking accomplished software engineers with a passion for data, experience building large distributed systems, and a commitment to writing high-quality, well-tested code to solve complex data challenges r contributions will ensure that our foundation models receive the highest quality data, thereby supporting the next generation of creative AI.

You Will
  • High-Scale Data Orchestration:
    Architect and maintain automated pipelines for the ingestion, cleaning, and pre-processing of multi-modal datasets (video, 3D,) spanning petabytes of data
  • Synthetic Data Generation:
    Leverage image and video generation models to scale multi-modal synthetic datasets
  • Research-to-Production Bridge:
    Partner with research teams to create training data for research experiments – research and implement synthetic data creation pipelines
  • Scalable Evaluation Frameworks:
    Build and own evaluation—automating both heuristic-based metrics and human-in-the-loop interfaces to evaluate and benchmark training datasets and in-house foundation models
  • Model Deployment & API Architecture:
    Design and optimize high-throughput, low-latency inference APIs for internal and external consumer access
  • Autonomous SOTA Tracking:
    Actively participate in literature reviews and paper reading groups to identify and implement the latest optimizations in generative modeling
  • Resource Efficiency & Observability:
    Implement monitoring pipeline health, optimize data loading to ensure GPUs are used efficiently
You Have
  • 8+ years of experience as a research-focused data systems engineer (preferably working with 3D and video foundation models)
  • Expertise in building scalable ML data pipelines for both batch and real-time environments. Experience working with and processing very large datasets (petabytes or more).
  • Versatile:
    Generalist and comfortable with several languages and technologies; adaptable in any situation
  • Team-Player & Technical Leader:
    Collaborative team member who mentors peers, drives technical excellence, and takes ownership of leading and delivering key features and projects across team boundaries
  • Python Proficiency:
    Write high-quality Python code for automation, tooling, and infrastructure management
  • Experience with cloud data platforms and distributed processing technologies (e.g., Spark, Ray, Kubeflow, S3, etc.)
  • Passionate about the potential of generative AI, particularly in creative domains like 3D/4D content
  • A Bachelor's degree or equivalent experience in Computer Science, Computer Engineering, or a similar…
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary