More jobs:
Senior AI Tools Engineer, SRE Operations - GeForce
Job in
California, Moniteau County, Missouri, 65018, USA
Listed on 2026-06-03
Listing for:
NVIDIA Gruppe
Full Time
position Listed on 2026-06-03
Job specializations:
-
IT/Tech
AI Engineer, Data Engineer, Cloud Computing
Job Description & How to Apply Below
Location: California
Responsibilities
What you will be doing:
- Build and implement robust AI/ML tools capable of analyzing production data to identify root causes for complex incidents and identify future operational trends.
- Lead the development of brand-new LLM- and Agent-based systems to improve operational efficiency.
- Establish and maintain excellent data management practices, including building pipelines to transform and handle large-scale data sources vital for model development.
- Take charge of and enhance LLM-based pipelines while integrating a strong grasp of LLM progress into product development.
- Act as a resident authority on AI Frameworks, recommending the best platforms, toolsets, and architectural approaches to ensure the long-term technical sustainability of the product.
- B.S. in Computer Science, Statistics, or Engineering (or equivalent experience), and 5+ years of experience.
- Strong proficiency in Python; familiarity with Go or other systems languages is a plus.
- Practical experience building, optimizing, and deploying AI tools.
- Strong knowledge of the AI space and current developments, including understanding how LLM-based platforms are built, optimized, and which platforms work best.
- Hands‑on experience with container orchestration (Kubernetes) and cloud environments (AWS cloud).
- Active engagement with developments in the AI field and the ability to distinguish meaningful advances from noise when making technical decisions.
- Expertise in automation and handling large-scale data pipelines.
- Experience applying monitoring and visualization tools, such as Grafana, to interact with data.
- Excellent ability to handle data sources and pipelines to transform and manage data.
- Understanding of SRE principles and experience managing production environments.
- Strong in LLM improvement pipelines as well as a strong grasp of recent developments in LLM training.
- Someone with excellent knowledge of LLMs and AI Models who can reason and recommend an approach that sustains the team and product long term. This person helps prevent grave mistakes by avoiding the wrong platform choice.
- Understanding of SRE concepts and managing production environments as well as experience with Kubernetes, AWS, and other cloud technologies.
- Proficiency in automation.
Base salary range: $144,000 - $230,000 per year. Eligible for equity and benefits.
Equal Employment OpportunityNVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. We do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
#J-18808-LjbffrPosition Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×