Tech Lead, LLM & Generative AI; Full Remote
Pine Bluff, Jefferson County, Arkansas, 71601, USA
Listed on 2026-06-02
-
IT/Tech
AI Engineer
The Opportunity
EverAI is building a massive‑scale conversational intelligence. We process 80 million tokens per day and are looking for a Tech Lead to take the helm of our LLM team (currently 3 engineers) and own the architecture, training, and deployment of the models that power our core product.
Why this role is distinctMassive Scale: Solve latency, throughput, and context management challenges at a scale of 80M+ daily tokens.
Production Focus: This is not a research sandbox. If you fine‑tune a model today, it impacts millions of users tomorrow.
Nuanced Engineering: Because we are uncensored, we cannot rely on blanket safety filters. Build sophisticated classifiers and alignment strategies to balance user freedom with strict safety guardrails.
1. Ship Code & Lead from the Front
Act as a player/coach: architect the system and mentor the team, while spending time hands‑on in the codebase (Python/PyTorch).
Own the core chat loop: optimize context windows, memory/RAG retrieval, and inference latency to ensure a real‑time experience.
2. Own the Model Lifecycle
Drive strategy for SFT (Supervised Fine‑Tuning) and RLHF/DPO (Preference Optimization). Decide when to prompt, fine‑tune, or build a new RAG pipeline.
Manage the data engine: oversee the sourcing, labeling, and cleaning of diverse datasets to improve model steerability and multicultural performance.
3. Architect High‑Precision Moderation
Build the immune system of the platform: design and train custom classifiers to detect and filter non‑consensual or illegal content within an explicit environment.
Move beyond binary flags to create nuanced, context‑aware moderation systems.
The Non‑Negotiables:
You have shipped AI at scale. 8+ years of engineering experience, with a significant portion dedicated to shipping ML/LLM features to millions of active users.
You are deeply technical. Proficient in Python/PyTorch and comfortable with modern LLM stack (vLLM, Hugging Face, fine‑tuning pipelines, evaluation frameworks).
You understand the Uncensored challenge. Comfortable working with NSFW content and the technical rigor required to moderate it effectively.
Intuition for Alignment. Understand prompt conditioning, temperature, and sampling to shape a chatbot’s behavior.
Doer Mindset. Value velocity, distinguishing between an academic solution and a shippable production solution.
Owner. Obsess over metrics, regressions, and user experience long after code is merged.
Scale: 80M+ daily tokens.
Team: 1 lead + 3 AI engineers + Dev Ops/Web collaborators.
Culture: High autonomy, low bureaucracy, direct line to the CTO.
Contract Type: Prefer B2B, but flexible for long‑term impact.
Work From Anywhere: Fully remote. Choose your environment.
Paid Time Off: 4 weeks (20 working days) PTO per year.
Annual Gathering: Yearly in‑person meetup.
Health & Wellness Support: Monthly allowance of $100 for health insurance expenses and unlimited 1:1 sessions with psychologists and lifestyle experts (open to three family members).
Co‑Working Space Budget: Up to twice per month to a co‑working space ($35-$40 per visit).
Learning Budget: Dedicated funds for courses, books, conferences, events, or certifications.
Equipment: Company laptop + monitor budget up to $250.
AI Tools Access: Premium access to ChatGPT, Cursor, Hugging Face, Claude Code, and other tools needed to excel.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).