Patent search ap:("Beijing Baidu Netcom Science Technology Co. Page Ltd.") AND inv:"Liao Zhang"

1.

发明授权
Method of recognizing speech offline, electronic device, and storage medium 有权

公开(公告)号：US12183323B2

公开(公告)日：2024-12-31

申请号：US17644749

申请日：2021-12-16

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Xiaoyin Fu , Mingxin Liang , Zhijie Chen , Qiguang Zang , Zhengxiang Jiang , Liao Zhang , Qi Zhang , Lei Jia

IPC: G10L15/02 , G10L15/16 , G10L19/032

Abstract: The present disclosure provides a method of recognizing speech offline, electronic device, and a storage medium, relating to a field of artificial intelligence such as speech recognition, natural language processing, and deep learning. The method may include: decoding speech data to be recognized into a syllable recognition result; transforming the syllable recognition result into a corresponding text as a speech recognition result of the speech data.

2.

发明授权
Speech recognition method and apparatus 有权

公开(公告)号：US12067977B2

公开(公告)日：2024-08-20

申请号：US17684681

申请日：2022-03-02

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Liao Zhang , Yinlou Zhao , Zhengxiang Jiang , Xiaoyin Fu , Wei Wei

IPC: G10L15/183 , G06N5/048

CPC classification number: G10L15/183 , G06N5/048

Abstract: The present disclosure discloses a speech recognition method and apparatus, and relates to the field of speech and deep learning technologies. A specific implementation scheme involves: acquiring candidate recognition results with first N recognition scores outputted by a speech recognition model for to-be-recognized speech, N being a positive integer greater than 1; scoring the N candidate recognition results based on pronunciation similarities between candidate recognition results and pre-collected popular entities, to obtain similarity scores of the candidate recognition results; and integrating the recognition scores and the similarity scores of the candidate recognition results to determine a recognition result corresponding to the to-be-recognized speech from the N candidate recognition results. The present disclosure can improve recognition accuracy.

3.

发明授权
Method for training a linguistic model and electronic device 有权

公开(公告)号：US11900918B2

公开(公告)日：2024-02-13

申请号：US17451380

申请日：2021-10-19

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Liao Zhang , Zhengxiang Jiang , Xiaoyin Fu

IPC: G10L15/06 , G06F40/253 , G06F40/30

CPC classification number: G10L15/063 , G06F40/253 , G06F40/30

Abstract: The present disclosure provides a method for training a linguistic model, related to fields of speech, natural language processing, deep learning technologies. A method includes: obtaining grammars corresponding to a plurality of sample texts and a slot value of a slot in each grammar by using semantic analysis; generating a grammar graph corresponding to each grammar based on the corresponding grammar and the slot value of the slot in the corresponding grammar; obtaining a weight of each grammar, a weight of each slot, and a weight of each slot value in each grammar graph based on the sample texts; determining at least one grammar frequency of each order based on the weight of each grammar, the weight of each slot, and the weight of each slot value in each grammar graph; and training the linguistic model based on the at least one grammar frequency of each order.

4.

发明申请
METHOD FOR TRAINING A LINGUISTIC MODEL AND ELECTRONIC DEVICE 有权

公开(公告)号：US20220036880A1

公开(公告)日：2022-02-03

申请号：US17451380

申请日：2021-10-19

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Liao Zhang , Zhengxiang Jiang , Xiaoyin Fu

IPC: G10L15/06 , G06F40/30 , G06F40/253

Abstract: The present disclosure provides a method for training a linguistic model, related to fields of speech, natural language processing, deep learning technologies. A method includes: obtaining grammars corresponding to a plurality of sample texts and a slot value of a slot in each grammar by using semantic analysis; generating a grammar graph corresponding to each grammar based on the corresponding grammar and the slot value of the slot in the corresponding grammar; obtaining a weight of each grammar, a weight of each slot, and a weight of each slot value in each grammar graph based on the sample texts; determining at least one grammar frequency of each order based on the weight of each grammar, the weight of each slot, and the weight of each slot value in each grammar graph; and training the linguistic model based on the at least one grammar frequency of each order.

5.

发明授权
Method and apparatus for recognizing speech, electronic device and storage medium 有权

公开(公告)号：US12033615B2

公开(公告)日：2024-07-09

申请号：US17499129

申请日：2021-10-12

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Yinlou Zhao , Liao Zhang , Zhengxiang Jiang

IPC: G10L15/00 , G10L15/14 , G10L15/16 , G10L15/26

CPC classification number: G10L15/005 , G10L15/142 , G10L15/16 , G10L15/26

Abstract: The disclosure provides a method and an apparatus for recognizing a speech, an electronic device and a storage medium. A speech to be recognized is obtained. An acoustic feature of the speech to be recognized and a language feature of the speech to be recognized are obtained. The speech to be recognized is input to a pronunciation difference statistics to generate a differential pronunciation pair corresponding to the speech to be recognized. The text information of the speech to be recognized is generated based on the differential pronunciation pair, the acoustic feature and the language feature.

6.

发明申请
METHOD AND APPARATUS FOR RECOGNIZING SPEECH, ELECTRONIC DEVICE AND STORAGE MEDIUM 有权

公开(公告)号：US20220028370A1

公开(公告)日：2022-01-27

申请号：US17499129

申请日：2021-10-12

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Yinlou Zhao , Liao Zhang , Zhengxiang Jiang

IPC: G10L15/00 , G10L15/14 , G10L15/16 , G10L15/26

Abstract: The disclosure provides a method and an apparatus for recognizing a speech, an electronic device and a storage medium. A speech to be recognized is obtained. An acoustic feature of the speech to be recognized and a language feature of the speech to be recognized are obtained. The speech to be recognized is input to a pronunciation difference statistics to generate a differential pronunciation pair corresponding to the speech to be recognized. The text information of the speech to be recognized is generated based on the differential pronunciation pair, the acoustic feature and the language feature.

Patent Agency Ranking