-
公开(公告)号:US12033616B2
公开(公告)日:2024-07-09
申请号:US17571805
申请日:2022-01-10
Inventor: Junyao Shao , Xiaoyin Fu , Qiguang Zang , Zhijie Chen , Mingxin Liang , Huanxin Zheng , Sheng Qian
IPC: G10L15/06 , G10L15/16 , G10L15/183 , G10L15/28
CPC classification number: G10L15/063 , G10L15/16 , G10L15/183 , G10L15/28
Abstract: A method for training a speech recognition model, a device and a storage medium, which relate to the field of computer technologies, and particularly to the fields of speech recognition technologies, deep learning technologies, or the like, are disclosed. The method for training a speech recognition model includes: obtaining a fusion probability of each of at least one candidate text corresponding to a speech based on an acoustic decoding model and a language model; selecting a preset number of one or more candidate texts based on the fusion probability of each of the at least one candidate text, and determining a predicted text based on the preset number of one or more candidate texts; and obtaining a loss function based on the predicted text and a standard text corresponding to the speech, and training the speech recognition model based on the loss function.
-
2.
公开(公告)号:US11893977B2
公开(公告)日:2024-02-06
申请号:US17530276
申请日:2021-11-18
Inventor: Zhijian Wang , Sheng Qian , Qi Zhang
IPC: G10L15/183 , G10L15/00 , G10L15/32
CPC classification number: G10L15/005 , G10L15/183 , G10L15/32
Abstract: A method for recognizing a Chinese-English mixed speech, includes: determining pronunciation information and scores of a language model, of speech information, in response to receiving the speech information; determining whether an English word exists in content of the speech information based on the pronunciation information; determining a Chinese word corresponding to the English word based on a preset Chinese-English mapping table in response to the English word existing in the content of the speech information, in which the Chinese-English mapping table includes a mapping relationship of at least one pair of English word and Chinese word; determining a score of the Chinese word corresponding to the English word; replacing a score of the English word in the scores of the language model with the score of the Chinese word; and obtaining a speech recognition result for the speech information based on the replaced scores of the language model.
-