Sr Data Scientist GenAI
Job in
Austin, Travis County, Texas, 73301, USA
Listed on 2026-05-30
Listing for:
Select Minds LLC
Full Time
position Listed on 2026-05-30
Job specializations:
-
IT/Tech
AI Engineer, Machine Learning/ ML Engineer, Data Scientist, Data Analyst
Job Description & How to Apply Below
Overview
Sr Data Scientist (NLP / LLM / Generative AI)
Location:
Dallas, TX
- Competitive salary
- Flexible schedule
- Opportunity for advancement
- Design, build, fine-tune, and deploy LLMs, transformer-based NLP models, and GenAI solutions for both batch and real-time/streaming contexts.
- Own all major components of ML pipelines: data ingestion, cleaning, pre-processing (structured & unstructured), embedding, search & retrieval, prompt engineering, RAG (Retrieval-Augmented Generation).
- Collaborate closely with ML Engineers, MLOps, software engineering, product, compliance, legal etc., to move models from prototype to production—ensuring reliability, scalability, monitoring, and maintainability.
- Define and implement evaluation frameworks: accuracy, bias, fairness, hallucination, consistency, latency; run UAT, stress-tests, drift detection.
- Optimize models and pipelines for performance, cost, and efficiency.
- Ensure best practices in model development: version control, repeatability, documentation, governance, and ethical AI use.
- Mentor more junior data scientists; help build team skills in NLP, GenAI practices, prompt engineering, fine-tuning.
- Identify new use cases; prototype innovations in GenAI/NLP; keep up with latest research and open source developments, decide what to adopt.
- 10+ years of experience in data science / ML, with substantial work in NLP, LLMs, or Generative AI.
- Deep hands-on experience in Python, using frameworks like PyTorch, Tensor Flow, Hugging Face etc.
- Proven track record building transformer/NLP / LLM models; experience with fine-tuning, prompt engineering.
- Solid experience with information retrieval / search: keyword + semantic search, embeddings, vector databases.
- Experience working in production / deploying models (batch and streaming), working with MLOps practices.
- Strong algorithmic / statistical / mathematical fundamentals. Ability to reason about model behaviour, bias, uncertainty.
- Good communicator: able to translate complex technical detail to business / non-technical stakeholders.
- Master's in Computer Science, Computational Linguistics, Statistics, Machine Learning or related field.
- Experience with multimodal models (vision + text) or emerging LLMs and agent-based systems.
- Experience with open source LLMs & toolkits; familiarity with Lang Chain or similar frameworks.
- Prior experience in regulated environments (finance, risk, legal, compliance) with strong governance, privacy requirements.
Work remote temporarily due to COVID-19.
#J-18808-LjbffrTo View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×