-
公开(公告)号:US12067977B2
公开(公告)日:2024-08-20
申请号:US17684681
申请日:2022-03-02
Inventor: Liao Zhang , Yinlou Zhao , Zhengxiang Jiang , Xiaoyin Fu , Wei Wei
IPC: G10L15/183 , G06N5/048
CPC classification number: G10L15/183 , G06N5/048
Abstract: The present disclosure discloses a speech recognition method and apparatus, and relates to the field of speech and deep learning technologies. A specific implementation scheme involves: acquiring candidate recognition results with first N recognition scores outputted by a speech recognition model for to-be-recognized speech, N being a positive integer greater than 1; scoring the N candidate recognition results based on pronunciation similarities between candidate recognition results and pre-collected popular entities, to obtain similarity scores of the candidate recognition results; and integrating the recognition scores and the similarity scores of the candidate recognition results to determine a recognition result corresponding to the to-be-recognized speech from the N candidate recognition results. The present disclosure can improve recognition accuracy.
-
公开(公告)号:US12033615B2
公开(公告)日:2024-07-09
申请号:US17499129
申请日:2021-10-12
Inventor: Yinlou Zhao , Liao Zhang , Zhengxiang Jiang
CPC classification number: G10L15/005 , G10L15/142 , G10L15/16 , G10L15/26
Abstract: The disclosure provides a method and an apparatus for recognizing a speech, an electronic device and a storage medium. A speech to be recognized is obtained. An acoustic feature of the speech to be recognized and a language feature of the speech to be recognized are obtained. The speech to be recognized is input to a pronunciation difference statistics to generate a differential pronunciation pair corresponding to the speech to be recognized. The text information of the speech to be recognized is generated based on the differential pronunciation pair, the acoustic feature and the language feature.
-
公开(公告)号:US20220028370A1
公开(公告)日:2022-01-27
申请号:US17499129
申请日:2021-10-12
Inventor: Yinlou Zhao , Liao Zhang , Zhengxiang Jiang
Abstract: The disclosure provides a method and an apparatus for recognizing a speech, an electronic device and a storage medium. A speech to be recognized is obtained. An acoustic feature of the speech to be recognized and a language feature of the speech to be recognized are obtained. The speech to be recognized is input to a pronunciation difference statistics to generate a differential pronunciation pair corresponding to the speech to be recognized. The text information of the speech to be recognized is generated based on the differential pronunciation pair, the acoustic feature and the language feature.
-
-