B4 Lead Data Scientist; GenAI
Listed on 2026-02-21
-
IT/Tech
AI Engineer, Machine Learning/ ML Engineer, Data Scientist, Data Analyst
Select how often (in days) to receive an alert:
Requisition 39619: B4 Lead Data Scientist (GenAI)
A resume helps you stand out to hiring managers and recruiters; your resume communicates your experience and your brand. While it is not required, we encourage you to include an up-to-date resume along with a completed job application to give you the best opportunity to be considered. A complete resume helps us to better understand your unique background, relevant experiences, and passions.
We look forward to learning about you.
Norfolk Southern offers a unique opportunity to be part of our proud legacy that spans nearly 200 years. We are a customer-centric, operations-driven team dedicated to advancing safety, serving communities, and driving innovation for tomorrow's rail. As part of Norfolk Southern, you’ll join a collaborative team where there are opportunities for growth across the organization. We are building a culture where everyone can thrive by owning and driving exceptional results, being humble and leading with trust, serving our customers with excellence, and collaborating and coaching to win.
Job DescriptionWho we are and what we do:
Norfolk Southern Corporation is seeking a Lead Data Scientist (GenAI) to join our enterprise AI team. This role focuses on developing, optimizing, and enhancing intelligent solutions using Generative AI technologies, including Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and Model Context Protocol (MCP) servers. The ideal candidate will partner with technical and business leaders across the enterprise to deliver scalable, secure, and high-performing AI capabilities that drive business value.
The position involves working with cloud-based platforms, modern AI/ML frameworks, and collaborating across business units to solve complex problems.
What kind of problems do we solve:
- Document Intelligence: Analyze and extract insights from thousands of corporate documents, reports, and operational data using advanced NLP techniques.
- Conversational AI: Build intelligent chat systems that help employees retrieve information and interact with enterprise data.
- GenAI Solutions: Creating and deploying Generative AI models for document processing, visual data analysis, and natural language understanding.
- Model Context Protocol (MCP): Enhance AI agent capabilities through custom tool integrations and context-aware servers.
- Lead design and architecture of enterprise GenAI solutions tailored to business use cases.
- Design modular, framework‑agnostic LLM pipelines using libraries such as Lang Chain, Llama Index, or similar.
- Define prompting strategies, agent patterns, and RAG architectures. Ensure approaches are scalable, reusable company‑wide and drive adoption across teams.
- Oversee lifecycle of usecases:
Development to Deployment to Production and MLops. - Ensure responsible AI practices, including explainability, governance, and compliance.
- Evaluate emerging AI trends and help guide the adoption of high value innovations.
- Collaborate with cross-functional teams to understand business needs and deliver configurable, scalable solutions.
- Bachelor’s in computer science, data science, machine learning, NLP, AI, linguistics, or related field required.
- 4-7+ years of experience in Data Science, Machine Learning, NLP or AI positions.
- Extensive background in building, fine tuning and operationalizing large language models (LLMs) in production environment.
- Proficiency in Python, including libraries such as Scikit-learn, PyTorch, Pandas, Num Py, spaCy, NLTK, and Matplotlib.
- Skilled in SQL, No
SQL, Milvus, Pinecone, PGVector databases.
- Advanced degree, e.g., Ph.D. or M.S. computer science, data science, machine learning, NLP, AI, linguistics, or related field preferred.
- Highly experienced in creating Agentic LLMs and knowledge of orchestration frameworks such as Lang Graph.
- NLP and AI/ML Frameworks:
Experience with training NLP models from scratch, working with Large Language Models, and using libraries such as spaCy, NLTK, and Transformers. - Highly experienced with cloud platforms, preferably AWS (ECS, S3, Lambda, Bedrock,…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).