Job Description
Software Engineer - Inference (Singapore)
Responsibilities
- Responsible for developing and optimizing LLM inference framework.
- Responsible for GPU and CUDA Performance optimization to create an industry-leading high-performance LLM inference engine.
Qualifications
- Minimum Qualifications:
- Bachelor's degree or above, major in computer/electronics/automation/software, etc.
- Proficient in C/C++, proficient in algorithms and data structures, familiar with Python.
- Understand the basic principles of deep learning algorithms, be familiar with the basic architecture of neural networks and understand deep learning training frameworks such as Pytorch.
- Preferred Qualifications:
- Proficient in GPU high-performance computing optimization technology on CUDA, in-depth understanding of computer architecture, familiar with parallel computing optimization, memory access opt...
Ready to Apply?
Submit your application today and join our talented team at ByteDance.
Submit ApplicationJob Details
- Location Singapore, Singapore
- Job Type Full-time
- Category Software Architecture & Engineering
- Posted Date February 22, 2026
- Application Deadline April 03, 2026