More jobs:
LLM Infrastructure Engineer
Job in
Houston, Harris County, Texas, 77246, USA
Listed on 2026-03-15
Listing for:
AMSYS Innovative Solutions
Full Time
position Listed on 2026-03-15
Job specializations:
-
Software Development
AI Engineer, Machine Learning/ ML Engineer
Job Description & How to Apply Below
We are looking for a Senior Python / AI API Engineer to build and deploy production-grade services powering Large Language Model (LLM) applications. This role focuses on developing high-performance APIs for model inference, optimizing GPU workloads, and deploying AI services in cloud environments.
This is an engineering-focused role, not research. We are looking for someone who has built and shipped AI systems into production and understands the challenges of scalable inference and model serving.
Key Responsibilities- Develop high-performance APIs using Python (3.10+) and FastAPI
- Build and deploy LLM inference services using Hugging Face Transformers and Py Torch
- Optimize GPU workloads and CUDA memory usage
- Implement streaming inference APIs for real-time model responses
- Containerize and deploy services using Docker and GPU-enabled infrastructure
- Deploy AI workloads in Azure environments (AKS, ACI, or Container Apps)
- Hands-on experience building production APIs with FastAPI
- Experience with Hugging Face Transformers and Py Torch
- Solid understanding of REST API design
- Experience deploying containerized applications with Docker
- Experience with OpenAI-compatible APIs, vLLM, or Text Generation Inference (TGI)
- Experience deploying AI workloads on Azure GPU infrastructure
- Familiarity with LoRA / PEFT fine-tuning
- Exposure to legal or financial NLP use cases
A hands-on engineer who understands how LLM systems run in production—from model loading and tokenization to GPU deployment and scalable APIs.
#J-18808-LjbffrTo View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×