×
Register Here to Apply for Jobs or Post Jobs. X
More jobs:

Senior AI Engineer

Job in Frisco, Collin County, Texas, 75034, USA
Listing for: SoFi
Full Time position
Listed on 2026-05-16
Job specializations:
  • Software Development
    AI Engineer
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below
Position: Senior Staff AI Engineer

Staff AI Engineer – So Fi

What you’ll do
  • Architecture and Strategy:
    Define the long‑term technical architecture and strategy for our next‑generation AI platform, focusing on robust, scalable agentic frameworks and LLM deployment patterns.
  • Advanced LLM Orchestration:
    Architect and standardize the use of graph‑based LLM orchestration, leveraging expert‑level mastery of Lang Graph to solve highly complex, multi‑stage reasoning problems at scale.
  • Distributed Agent Memory & State:
    Develop robust, persistent infrastructure for agentic state management, ensuring long‑running agent workflows maintain context and reliability across distributed nodes and regional failovers.
  • Deep Model Optimization:
    Pioneer advanced parameter‑efficient fine‑tuning (PEFT) and compression techniques to maximize model performance and minimize operational costs across the organization.
  • Model Serving Infrastructure:
    Support the development of a unified model serving platform designed to host internally fine‑tuned and custom‑trained models to ensure high‑throughput, low‑latency inference across diverse hardware footprints.
  • Operational Excellence:
    Define and enforce high standards for AI operationalization, requiring mastery in designing and deploying comprehensive AI observability solutions and advanced tracing/testing frameworks that guarantee production quality, compliance, and reliability.
  • Mentorship:
    Mentor senior and junior AI Engineers, elevating the overall engineering quality.
  • Cross‑Functional

    Collaboration:

    Coordinate with cross‑functional teams to distill specific requirements, project roadmaps, and ensure accurate and on‑time project deliveries.
  • AI Innovation:
    Stay up‑to‑date with the latest trends and advancements in GenAI, LLMs, and NLP, evaluating and experimenting with new techniques and tools to push the boundaries of AI innovation in the banking sector.
What you’ll need
  • Bachelor’s or Master’s degree in Computer Science, Data Science, AI, Machine Learning, or a related field. PhD is a plus.
  • 8+ years software development experience, with 3+ years of hands‑on experience in developing and successfully deploying production‑level AI applications used by real customers or internal stakeholders.
  • Expert‑level experience with Lang Graph to model and orchestrate complex, stateful multi‑step reasoning and control flow in LLM applications.
  • Expert‑level proficiency in developing sophisticated agentic solutions, with a portfolio demonstrating advanced use of planning, memory management, tool integration, and control flow.
  • Deep understanding of Large Language Model (LLM) architectures, prompt engineering, retrieval‑augmented generation (RAG), and advanced text generation techniques.
  • Proven experience implementing parameter‑efficient fine‑tuning (PEFT) techniques (e.g., LoRA) to customize and optimize pre‑trained models for specific tasks with minimal computational overhead.
  • Deep expertise in building or extending inference engines (e.g., vLLM, NVIDIA Triton, or TGI) and managing the underlying Kubernetes/GPU orchestration for custom model deployments.
  • Deep experience designing and institutionalizing AI observability solutions (e.g., Lang Smith, Arize, Deepchecks) and advanced tracing and testing methodologies for LLM and agentic systems.
  • Experience with cloud platforms (AWS, Azure, or GCP) and containerization technologies (Docker, Kubernetes).
  • Expert level Python is required.
  • React is strongly preferred.
  • Experience with large‑scale data handling, including unstructured and structured data pipelines, with a strong preference for Snowflake and Dynamo

    DB.
  • Experience developing and integrating AI‑powered APIs and microservices architecture into banking applications.
  • Experience with vector databases and retrieval‑augmented generation (RAG) techniques using systems like Elasticsearch, Pinecone, or FAISS for enhancing LLM performance.
  • Exceptional ability to communicate complex technical concepts, drive consensus among senior technical leaders, and influence organizational AI strategy.
  • Strong analytical and problem‑solving skills with attention to detail and an ability to work with complex, large‑scale systems.
  • Strong collaboration skills, with experience…
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary