MVA Multi-Modality Interaction Developer

Mercedes-Benz
📍 Beijing, Beijing, China 💼 Full-time 🕒 Posted July 03, 2026

Job Description

Tätigkeitsbereich:Forschung & Entwicklung incl. DesignFachabteilung:RD ChinaGesellschaft:Mercedes-Benz Group China Ltd.Standort:Mercedes-Benz Group China Ltd., BeijingStartdatum:sofortVeröffentlichungsdatum:..6Stellennummer:MERJ6Arbeitszeit:Vollzeit BewerbenAufgabenKey Responsibilities
  • Develop based on the current mainstream speech systems, including SSPE, wakeup, vad, asr, nlu, dm, tts, LLM, and etc.
  • Design and implement multimodal fusion combining speech, DMS camera, OMS camera, Dash camera, microphone, sensors, audio system state, voice print, and vehicle state data.
  • Normalize and structure multimodal inputs into system context representations suitable for LLM reasoning to support future LLM-based assistant use cases, such as; context-aware dialogue, assistant memory collection and apply, and etc.
  • Design and maintain consistent multimodal data pipelines, handling time alignment, normalization, and state coherence as data flows from vehicle systems into LLM...
  • Ready to Apply?

    Submit your application today and join our talented team at Mercedes-Benz.

    Submit Application

    Job Details

    • Location Beijing, Beijing
    • Job Type Full-time
    • Category Computer Occupations
    • Posted Date July 03, 2026
    • Application Deadline August 12, 2026