More jobs:
Senior Inference Engineer - AI
Remote / Online - Candidates ideally in
Eagan, Dakota County, Minnesota, USA
Listed on 2026-04-29
Eagan, Dakota County, Minnesota, USA
Listing for:
Thomson Reuters
Part Time, Remote/Work from Home
position Listed on 2026-04-29
Job specializations:
-
IT/Tech
AI Engineer, Systems Engineer, Cloud Computing, Data Engineer
Job Description & How to Apply Below
About the Role
- As a Senior Inference Engineer, AI
, responsibilities include collaborating with Platform Engineering and Enterprise AI Services to product ionize, optimize, and scale AI and LLM workloads that power Thomson Reuters' AI driven products. - This role ensures that trained models—from classical ML to generative AI—run efficiently across Thomson Reuters' multi-cloud footprint (AWS, Azure, GCP, OCI), meet enterprise reliability requirements, and integrate with our data backbone (Snowflake, Open Search vector search, API managed model routing).
- The successful candidate will help build the next generation of Thomson Reuters' AI infrastructure, working alongside cloud engineering, data engineering, product teams, and AI Services.
- Optimize LLMs and ML models for high-performance inference using techniques such as quantization, pruning, distillation, and hardware-specific tuning.
- Deploy and scale inference workloads on GPUs across AWS, Azure, GCP and internal Kubernetes clusters, ensuring predictable performance during peak traffic hours, especially during business hours.
- Implement routing and failover strategies for OpenAI/Anthropic/Vertex AI traffic.
- Integrate models into production-grade APIs supporting Thomson Reuters products and enterprise workflows.
- Develop highly optimized environments and eliminate performance bottlenecks to reduce latency.
- Collaborate with Platform Engineering teams (Landing Zones, Network, Storage, Compute, AI) to ensure inference workloads align with Thomson Reuters cloud-native patterns (AWS, Azure, GCP, OCI).
- Build and optimize containerized inference pipelines using Kubernetes for large-scale distributed workloads.
- Ensure compliance with Thomson Reuters AI standards for deployment, monitoring, governance, and drift detection.
- Profile inference performance, identify GPU/CPU bottlenecks, and optimize compute utilization across heterogeneous hardware.
- Implement observability and health monitoring for inference pipelines, ensuring reliability of enterprise AI services.
- Collaborate with AI engineers to invent new quantization techniques, improve numerical precision, and explore non-standard architectures, and support the scale out of AI infrastructure during critical releases and global product rollouts.
- Partner with Cloud Engineers (Azure, AWS, GCP) to develop guardrails and automation that support inference workloads.
- 5+ years of relevant experience.
- Strong understanding of ML/LLM fundamentals and inference optimization techniques.
- Hands-on experience with GPU programming (CUDA preferred), inference runtimes (Tensor
RT, ONNX Runtime), and deep learning frameworks (PyTorch/Tensor Flow). - Proficiency in Python and at least one systems language (C++ strongly preferred for performance-critical inference paths).
- Experience deploying AI workloads to AWS/GCP/Azure and Kubernetes.
- Familiarity with vector search systems (Open Search vectors) and retrieval augmented generation pipelines.
- Knowledge of distributed systems, microservices, CI/CD, and cloud-native architecture.
- Hybrid Work Model: We've adopted a flexible hybrid working environment (2-3 days a week in the office depending on the role) for our office-based roles while delivering a seamless experience that is digitally and physically connected.
- Flexibility & Work-Life Balance: Flex My Way is a set of supportive workplace policies designed to help manage personal and professional responsibilities, including work from anywhere for up to 8 weeks per year.
- Career Development and Growth: We foster a culture of continuous learning and skill development, with Grow My Way programming to help you grow, lead, and thrive in an AI-enabled future.
- Industry Competitive Benefits: Comprehensive benefit plans including flexible vacation, mental health days, Headspace, retirement savings, tuition reimbursement, and wellbeing resources.
- Culture: Globally recognized for inclusion and belonging, with values:
Obsess over our Customers, Compete to Win, Challenge (Y) our Thinking, Act Fast / Learn Fast, and Stronger Together. - Social Impact: Two paid volunteer days off annually and opportunities for pro-bono projects and ESG initiatives.
- Making a Real-World Impact: Thomson Reuters helps customers pursue justice, truth, and transparency, providing trusted, unbiased information globally.
Position Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×