专利检索 ap:("Shujie LIU" OR "Jinyu LI" OR "Long ZHOU" OR "Xie SUN" OR "Microsoft Technology Licensing, LLC") AND inv:"Long ZHOU" 第 1 页

1.

发明公开
CANONICAL TRAINING FOR HIGHLY CONFIGURABLE MULTILINGUAL SPEECH 审中-公开

公开(公告)号：US20240265924A1

公开(公告)日：2024-08-08

申请号：US18573846

申请日：2021-06-29

申请人： Shujie LIU , Jinyu LI , Long ZHOU , Xie SUN , Microsoft Technology Licensing, LLC

发明人： Jinyu LI , Long ZHOU , Xie SUN , Shujie LIU

IPC分类号： G10L15/32 , G10L15/00 , G10L15/06 , G10L15/30

CPC分类号： G10L15/32 , G10L15/005 , G10L15/063 , G10L15/30 , G10L2015/0635

摘要： Embodiments are provided for building a configurable multilingual model. A computing system obtains a plurality of language-specific automatic speech recognition modules and a universal automatic speech recognition module trained on a multi-language training dataset comprising training data corresponding to each of the plurality of different languages. The computing system then compiles the universal automatic speech recognition module with the plurality of language-specific automatic speech recognition modules to generate a configurable multilingual model that is configured to selectively and dynamically utilize a sub-set of the plurality of language-specific automatic speech recognition modules with the universal automatic speech recognition module to process audio content in response to user input identifying one or more target languages associated with the audio content.