AI Engineer; On-site
Listed on 2026-02-15
-
IT/Tech
AI Engineer, Data Scientist, Machine Learning/ ML Engineer, Data Analyst
Location: New York
Apply now
Xenoss is an AI engineering and integration services company, helping medium to large enterprises run AI transformation end-to-end, from situation analysis and goals framing to data discovery and preparation, pipeline building, model development, retraining pipeline design, solution deployment, and support.
We build a broad spectrum of AI solutions such as user behaviour prediction, content generation, NLP, audience segmentation, pathfinding solutions, AI assistants, edge computer vision, fraud detection, and others.
We work with prominent companies such as Microsoft, Toshiba, AstraZeneca, Activision Blizzard, Verve Group, Voodoo Games, and Telefonica, among others.
We’re included in the top 100 software companies on the Inc. 5000 list.
About the roleWe’re hiring a Staff AI Engineer to lead fine-tuning and domain adaptation of large language models on top of one of the most complex enterprise datasets you can work with: a multi-year archive of real customer conversations from a global financial institution.
This role sits at the intersection of speech, language, and operational decision systems. The objective is not generic chatbot improvement. The objective is to turn raw audio and transcripts into production-grade intelligence: classification, intent detection, risk signals, quality insights, and agent copilots.
You will own how these models are trained, evaluated, and ope rationalised.
What you will doYou’ll operate across the full fine-tuning lifecycle from dataset engineering to model deployment.
You’ll work with large volumes of audio and transcript data, transforming unstructured conversational artefacts into structured instruction datasets suitable for supervised and alignment training.
Core work includes:
- Designing fine-tuning strategies for conversational financial data
- Structuring transcript corpora into task-ready training formats
- Running LoRA / QLoRA training pipelines on open-weight LLMs
- Defining evaluation frameworks and quality benchmarks
- Leading structured error analysis and iteration cycles
- Optimising models for latency, cost, and deployment constraints
- Partnering with MLOps on serving and monitoring
You’re expected to be deeply hands‑on in training infrastructure and experimentation.
This is not an oversight‑only Staff role.
Technology landscapeYou’ll operate within the modern open-model fine-tuning ecosystem, including, but not limited to:
- Open-weight LLMs (LLaMA, Mistral class)
- Parameter‑efficient training (LoRA / QLoRA)
- Alignment optimisation where relevant
- PyTorch training pipelines
- HF ecosystem (Transformers, TRL, PEFT)
- Quantisation and optimised inference runtimes
We optimise for production viability, not academic benchmarks.
Scope of ownership and delivery contextAt Staff level, you’ll own both the fine‑tuning architecture and its execution across the full lifecycle, from dataset engineering through production deployment.
Core ownership- Define fine‑tuning and iteration strategies
- Establish evaluation frameworks and acceptance criteria
- Drive trade‑offs between model quality, cost, and latency
- Act as an escalation point for performance and architecture decisions
- Work within a cross‑functional team spanning AI engineering, MLOps, data engineering, and client stakeholders
- Mentor engineers running training pipelines
- Partner with domain SMEs on labelling frameworks
- Hands‑on LLM fine‑tuning beyond
- Experience with conversational or speech‑derived corpora
- Deep familiarity with LoRA / QLoRA and PEFT methods
- Ability to design evaluation frameworks, not just run them
- Comfort working with messy enterprise data
- Experience deploying models into production stacks
- Financial services domain exposure
- Speech / ASR pipeline familiarity
- Model governance and auditability experience
See all our open positions and learn why your should consider joining the Xenoss team.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).