Patent search ap:("BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO. Page LTD.") AND inv:"Zhijian WANG"

1.

发明申请
METHOD AND DEVICE FOR PROCESSING VOICE INTERACTION, ELECTRONIC DEVICE AND STORAGE MEDIUM 有权

公开(公告)号：US20220005474A1

公开(公告)日：2022-01-06

申请号：US17476333

申请日：2021-09-15

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Jinfeng BAI , Zhijian WANG , Cong GAO

IPC: G10L15/22

Abstract: The present disclosure provides a method and a device for processing voice interaction, an electronic device and a storage medium. The method includes: determining a first integrity of a voice instruction from a user by using a pre-trained integrity detection model in response to detecting that the voice instruction from the user is not a high-frequency instruction; determining a waiting duration for the voice instruction based on the first integrity and a preset integrity threshold, wherein the waiting duration for the voice instruction indicates a length of period between a time when a voice interaction device determines that receiving the voice instruction is completed and a time when the voice interaction device performs an operation in response to the voice instruction of the user; and controlling the voice interaction device to respond to the voice instruction of the user based on the waiting duration.

2.

发明申请
METHOD FOR RECOGNIZING CHINESE-ENGLISH MIXED SPEECH, ELECTRONIC DEVICE, AND STORAGE MEDIUM 有权

公开(公告)号：US20220139369A1

公开(公告)日：2022-05-05

申请号：US17530276

申请日：2021-11-18

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Zhijian WANG , Sheng QIAN , Qi ZHANG

IPC: G10L15/00 , G10L15/183 , G10L15/32

Abstract: A method for recognizing a Chinese-English mixed speech, includes: determining pronunciation information and scores of a language model, of speech information, in response to receiving the speech information; determining whether an English word exists in content of the speech information based on the pronunciation information; determining a Chinese word corresponding to the English word based on a preset Chinese-English mapping table in response to the English word existing in the content of the speech information, in which the Chinese-English mapping table includes a mapping relationship of at least one pair of English word and Chinese word; determining a score of the Chinese word corresponding to the English word; replacing a score of the English word in the scores of the language model with the score of the Chinese word; and obtaining a speech recognition result for the speech information based on the replaced scores of the language model.

3.

发明申请
METHOD AND APPARATUS OF PERFORMING VOICE INTERACTION, ELECTRONIC DEVICE, AND READABLE STORAGE MEDIUM 有权

公开(公告)号：US20220068277A1

公开(公告)日：2022-03-03

申请号：US17522985

申请日：2021-11-10

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Zhijian WANG , Sheng QIAN

IPC: G10L15/22 , G10L15/02 , G10L15/01

Abstract: The present disclosure provides a method and apparatus of performing a voice interaction, an electronic device and a readable storage medium, which relates to technical fields of voice processing and deep learning. The method of performing the voice interaction includes: acquiring an audio to be recognized; obtaining a recognition result for the audio to be recognized, by using an audio recognition model, and extracting an input of an output layer of the audio recognition model in a recognition process as a recognition feature; obtaining a response confidence level according to the recognition feature; and responding to the audio to be recognized, in response to determining that the response confidence level meets a preset response condition.

Patent Agency Ranking