-
公开(公告)号:US20240265924A1
公开(公告)日:2024-08-08
申请号:US18573846
申请日:2021-06-29
发明人: Jinyu LI , Long ZHOU , Xie SUN , Shujie LIU
CPC分类号: G10L15/32 , G10L15/005 , G10L15/063 , G10L15/30 , G10L2015/0635
摘要: Embodiments are provided for building a configurable multilingual model. A computing system obtains a plurality of language-specific automatic speech recognition modules and a universal automatic speech recognition module trained on a multi-language training dataset comprising training data corresponding to each of the plurality of different languages. The computing system then compiles the universal automatic speech recognition module with the plurality of language-specific automatic speech recognition modules to generate a configurable multilingual model that is configured to selectively and dynamically utilize a sub-set of the plurality of language-specific automatic speech recognition modules with the universal automatic speech recognition module to process audio content in response to user input identifying one or more target languages associated with the audio content.