Lead Inference Platform Support Engineer - AI I

Thomson Reuters

📍 toronto, on, Canada 💼 Full-time 🕒 Posted June 04, 2026

Apply Now Similar Jobs

Job Description

About The Role Lead Inference Platform Engineer – specialized experience in machine learning/deep learning domains such as model compression, hardware‑aware model optimizations, hardware accelerators architecture, GPU/ASIC architecture, ML compilers, high‑performance computing, performance optimizations, numerics, or SW/HW co‑design. 
Responsibilities Optimize LLMs and ML models for high‑performance inference using quantization, pruning, distillation, and hardware specific tuning. 
Deploy and scale inference workloads on GPUs across AWS, Azure, GCP and internal Kubernetes clusters, ensuring predictable performance during peak traffic. 
Implement routing and fail‑over strategies for OpenAI / Anthropic / Vertex AI traffic. 
Integrate models into production‑grade APIs supporting TR products and enterprise workflows. 
Develop highly optimized environments and eliminate performance bottlenecks to reduce latency. 
            
            Ready to Apply?Submit your application today and join our talented team at Thomson Reuters.
Submit Application

Job Details

Location toronto, on
Job Type Full-time
Category Other-General
Posted Date June 04, 2026
Application Deadline July 14, 2026