×
Register Here to Apply for Jobs or Post Jobs. X

Senior Inference Engineer - AI

Remote / Online - Candidates ideally in
Eagan, Dakota County, Minnesota, USA
Listing for: Thomson Reuters
Part Time, Remote/Work from Home position
Listed on 2026-04-29
Job specializations:
  • IT/Tech
    AI Engineer, Systems Engineer, Cloud Computing, Data Engineer
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below

About the Role

  • As a Senior Inference Engineer, AI
    , responsibilities include collaborating with Platform Engineering and Enterprise AI Services to product ionize, optimize, and scale AI and LLM workloads that power Thomson Reuters' AI driven products.
  • This role ensures that trained models—from classical ML to generative AI—run efficiently across Thomson Reuters' multi-cloud footprint (AWS, Azure, GCP, OCI), meet enterprise reliability requirements, and integrate with our data backbone (Snowflake, Open Search vector search, API managed model routing).
  • The successful candidate will help build the next generation of Thomson Reuters' AI infrastructure, working alongside cloud engineering, data engineering, product teams, and AI Services.
  • Optimize LLMs and ML models for high-performance inference using techniques such as quantization, pruning, distillation, and hardware-specific tuning.
  • Deploy and scale inference workloads on GPUs across AWS, Azure, GCP and internal Kubernetes clusters, ensuring predictable performance during peak traffic hours, especially during business hours.
  • Implement routing and failover strategies for OpenAI/Anthropic/Vertex AI traffic.
  • Integrate models into production-grade APIs supporting Thomson Reuters products and enterprise workflows.
  • Develop highly optimized environments and eliminate performance bottlenecks to reduce latency.
  • Collaborate with Platform Engineering teams (Landing Zones, Network, Storage, Compute, AI) to ensure inference workloads align with Thomson Reuters cloud-native patterns (AWS, Azure, GCP, OCI).
  • Build and optimize containerized inference pipelines using Kubernetes for large-scale distributed workloads.
  • Ensure compliance with Thomson Reuters AI standards for deployment, monitoring, governance, and drift detection.
  • Profile inference performance, identify GPU/CPU bottlenecks, and optimize compute utilization across heterogeneous hardware.
  • Implement observability and health monitoring for inference pipelines, ensuring reliability of enterprise AI services.
  • Collaborate with AI engineers to invent new quantization techniques, improve numerical precision, and explore non-standard architectures, and support the scale out of AI infrastructure during critical releases and global product rollouts.
  • Partner with Cloud Engineers (Azure, AWS, GCP) to develop guardrails and automation that support inference workloads.
About You
  • 5+ years of relevant experience.
  • Strong understanding of ML/LLM fundamentals and inference optimization techniques.
  • Hands-on experience with GPU programming (CUDA preferred), inference runtimes (Tensor

    RT, ONNX Runtime), and deep learning frameworks (PyTorch/Tensor Flow).
  • Proficiency in Python and at least one systems language (C++ strongly preferred for performance-critical inference paths).
  • Experience deploying AI workloads to AWS/GCP/Azure and Kubernetes.
  • Familiarity with vector search systems (Open Search vectors) and retrieval augmented generation pipelines.
  • Knowledge of distributed systems, microservices, CI/CD, and cloud-native architecture.
What's in it For You?
  • Hybrid Work Model: We've adopted a flexible hybrid working environment (2-3 days a week in the office depending on the role) for our office-based roles while delivering a seamless experience that is digitally and physically connected.
  • Flexibility & Work-Life Balance: Flex My Way is a set of supportive workplace policies designed to help manage personal and professional responsibilities, including work from anywhere for up to 8 weeks per year.
  • Career Development and Growth: We foster a culture of continuous learning and skill development, with Grow My Way programming to help you grow, lead, and thrive in an AI-enabled future.
  • Industry Competitive Benefits: Comprehensive benefit plans including flexible vacation, mental health days, Headspace, retirement savings, tuition reimbursement, and wellbeing resources.
  • Culture: Globally recognized for inclusion and belonging, with values:
    Obsess over our Customers, Compete to Win, Challenge (Y) our Thinking, Act Fast / Learn Fast, and Stronger Together.
  • Social Impact: Two paid volunteer days off annually and opportunities for pro-bono projects and ESG initiatives.
  • Making a Real-World Impact: Thomson Reuters helps customers pursue justice, truth, and transparency, providing trusted, unbiased information globally.
#J-18808-Ljbffr
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary