NVIDIA Senior Engineer in AI Inference (Toronto)

NVIDIA
📍 toronto, on, Canada 💼 Full-time 🕒 Posted June 19, 2026

Job Description

Lead the charge in AI innovation as a Senior Engineer at NVIDIA, specializing in high-efficiency AI inference systems. Your work will optimize performance on large-scale AI models, utilizing cutting-edge GPU capabilities.

This position requires deep technical expertise in software engineering and a robust background in AI frameworks. You'll play a crucial role in optimizing inference stacks, contributing to groundbreaking research, and developing tools that empower developers to utilize GPU features effectively.

Key Responsibilities

  • Develop features for advanced AI models using vLLM
  • Optimize and benchmark GPU kernels and compilers
  • Create and define inference benchmarking strategies
  • Oversee the orchestration of inference deployments
  • Research and integrate novel ideas from ML publications

Requirements

  • PhD in related field or 7+ years experience in industry
  • Proficient in Python and C...

Ready to Apply?

Submit your application today and join our talented team at NVIDIA.

Submit Application

Job Details

  • Location toronto, on
  • Job Type Full-time
  • Category IT & Technology
  • Posted Date June 19, 2026
  • Application Deadline July 29, 2026