-
公开(公告)号:US20220108684A1
公开(公告)日:2022-04-07
申请号:US17644749
申请日:2021-12-16
Inventor: Xiaoyin FU , Mingxin LIANG , Zhijie CHEN , Qiguang ZANG , Zhengxiang JIANG , Liao ZHANG , Qi ZHANG , Lei JIA
IPC: G10L15/02 , G10L15/16 , G10L19/032
Abstract: The present disclosure provides a method of recognizing speech offline, electronic device, and a storage medium, relating to a field of artificial intelligence such as speech recognition, natural language processing, and deep learning. The method may include: decoding speech data to be recognized into a syllable recognition result; transforming the syllable recognition result into a corresponding text as a speech recognition result of the speech data.
-
公开(公告)号:US20220328040A1
公开(公告)日:2022-10-13
申请号:US17684681
申请日:2022-03-02
Inventor: Liao ZHANG , Yinlou ZHAO , Zhengxiang JIANG , Xiaoyin FU , Wei WEI
IPC: G10L15/183 , G06N5/04
Abstract: The present disclosure discloses a speech recognition method and apparatus, and relates to the field of speech and deep learning technologies. A specific implementation scheme involves: acquiring candidate recognition results with first N recognition scores outputted by a speech recognition model for to-be-recognized speech, N being a positive integer greater than 1; scoring the N candidate recognition results based on pronunciation similarities between candidate recognition results and pre-collected popular entities, to obtain similarity scores of the candidate recognition results; and integrating the recognition scores and the similarity scores of the candidate recognition results to determine a recognition result corresponding to the to-be-recognized speech from the N candidate recognition results. The present disclosure can improve recognition accuracy.
-
3.
公开(公告)号:US20240221727A1
公开(公告)日:2024-07-04
申请号:US18266432
申请日:2022-09-01
Inventor: Lanhua YOU , Lei JIA , Qi ZHANG , Zhengxiang JIANG
CPC classification number: G10L15/063 , G10L15/01 , G10L15/02 , G10L15/16
Abstract: The present disclosure provides a voice recognition model training method and apparatus, an electronic device and a storage medium, relating to the field of artificial intelligence technology, and in particular to the fields such as deep learning and voice recognition. The specific implementation scheme includes constructing a negative sample according to a positive sample to obtain a target negative sample for constraining a voice decoding path; obtaining training data according to the positive sample and the target negative sample; and training a first voice recognition model according to the training data to obtain a second voice recognition model.
-
-