Senior Language Engineer, Artificial General Intelligence - Data Services
Listed on 2026-01-09
-
IT/Tech
Data Scientist, AI Engineer, Data Analyst, Machine Learning/ ML Engineer
Description
Amazon Artificial General Intelligence (AGI) Data Services organization is responsible for developing diverse datasets to train and evaluate the Amazon AI models. We are looking for Senior Language Engineers to join our science and engineering team to support the development of complex, multimodal datasets, using a range of approaches including synthetic data generation, model-supported data generation, and human-in-the-loop data collections.
You will play a critical role in driving innovation and advancing the state-of-the-art in evaluating and training AI models. You will work closely with cross-functional teams, including product managers, engineers, and data scientists to ensure that our AI systems are best in class.
Key job responsibilities- Define and lead the organization's data creation strategies for our science partners
- Design and lead complex data collections with human participants in response to science needs: author instructions, define and implement quality targets and mechanisms, provide day-to-day coordination of data collection efforts (including planning, scheduling, and reporting), and be responsible for the final deliverables
- Design and conduct complex data creation tasks using synthetic and model-based data generation methods, following state-of-the-art approaches
- Analyze and extract insights from large amounts of data
- Build tools or tool prototypes for data analysis or data creation, using Python or another scripting language
- Use modeling tools to bootstrap or test new AI functionalities
- Collaborate with scientists, software engineers, and other data creators to evaluate performance of AI models
Amazon strives to be the world's most customer-centric company, where customers can research and purchase anything they might want online or offline. We set big goals and are looking for people who can help us reach and exceed them. The AGI organization provides AI capabilities for a variety of Amazon products and searches. We provide secure, flexible, cost effective, and high-quality data development services to our customers, that enables them to build advanced ML models.
BasicQualifications
- Master's degree in a relevant field (Computational Linguistics or equivalent field with computational analysis)
- PhD in Computational Linguistics (or equivalent field with computational emphasis)
- 5+ years of experience creating AI datasets for complex and quickly evolving requirements using a range of approaches: model-based, human in the loop, synthetic/code-based, etc.
- 5+ years of experience working with speech, text, and multimodal data, including in multiple languages
- 5+ years of experience defining and leading cross-team data creation strategies for long‑term science customers
- 5+ years of experience with Machine Learning training and evaluations, specifically regarding the types of data needed for different training types
- Familiarity with technical concepts such as APIs, knowledge of version control and agile development procedures, familiarity with database queries and data analysis processes (SQL, R, Matlab, etc.)
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Los Angeles County applicants:
Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position.
These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company's reputation. Pursuant to the Los Angeles County Fair Chance…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).