NVIDIA Senior Engineer AI Inference Solutions

NVIDIA Gruppe
📍 toronto, on, Canada 💼 Full-time 🕒 Posted June 19, 2026

Job Description

Drive innovation at NVIDIA as a Senior Software Engineer in AI inference. Collaborate directly with customers to optimize LLM serving and performance scalability.
This impactful role involves partnering closely with engineering teams at NVIDIA to refine large-scale LLM serving solutions. Engage in both profiling and optimization of GPU deployments, focusing on performance improvements through benchmarking campaigns in cloud environments. Your work will not only enhance customer solutions but also contribute massively to open-source projects like vLLM, ensuring shared knowledge enhances engineering practices.
Key Responsibilities:
• Collaborate with customers to analyze LLM serving architectures
• Implement detailed benchmarking campaigns in Kubernetes
• Optimize GPU cluster deployments for performance gaps
• Develop end-user tools for improved team efficiency
• Document findings and enhance community contributions
Requirements:
• Advanced degree in Computer S...

Ready to Apply?

Submit your application today and join our talented team at NVIDIA Gruppe.

Submit Application

Job Details

  • Location toronto, on
  • Job Type Full-time
  • Category IT & Technology
  • Posted Date June 19, 2026
  • Application Deadline July 29, 2026