AI Computing Development Engineer, TensorRT and TensorRT-LLM

NVIDIA
📍 Shanghai, China, China 💼 Full-time 🕒 Posted June 22, 2026

Job Description

NVIDIA is hiring software engineers for its AI Computing team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like generative AI, computer vision, speech recognition, recommender systems, and large-scale language and multimodal models. Join the team building the inferencing software (TensorRT/TensorRT-LLM) that will be used across our product lines. The ability to work in a fast-paced, delivery-focused environment is required, and excellent interpersonal skills are a must.


What you'll be doing:
+ Design and develop robust inferencing software (TensorRT/TensorRT-LLM) optimized for functionality and performance across platforms
+ Perform performance analysis, optimization, and tuning of deep learning inference workloads
+ Track and integrate academic and industry advancements in AI and feature-update TensorRT/TensorRT-LLM accordingly
+ Provide feedback into archit...

Ready to Apply?

Submit your application today and join our talented team at NVIDIA.

Submit Application

Job Details

  • Location Shanghai, China
  • Job Type Full-time
  • Category other-general
  • Posted Date June 22, 2026
  • Application Deadline June 27, 2026