-
公开(公告)号:US20220005474A1
公开(公告)日:2022-01-06
申请号:US17476333
申请日:2021-09-15
Inventor: Jinfeng BAI , Zhijian WANG , Cong GAO
IPC: G10L15/22
Abstract: The present disclosure provides a method and a device for processing voice interaction, an electronic device and a storage medium. The method includes: determining a first integrity of a voice instruction from a user by using a pre-trained integrity detection model in response to detecting that the voice instruction from the user is not a high-frequency instruction; determining a waiting duration for the voice instruction based on the first integrity and a preset integrity threshold, wherein the waiting duration for the voice instruction indicates a length of period between a time when a voice interaction device determines that receiving the voice instruction is completed and a time when the voice interaction device performs an operation in response to the voice instruction of the user; and controlling the voice interaction device to respond to the voice instruction of the user based on the waiting duration.
-
2.
公开(公告)号:US20220139369A1
公开(公告)日:2022-05-05
申请号:US17530276
申请日:2021-11-18
Inventor: Zhijian WANG , Sheng QIAN , Qi ZHANG
IPC: G10L15/00 , G10L15/183 , G10L15/32
Abstract: A method for recognizing a Chinese-English mixed speech, includes: determining pronunciation information and scores of a language model, of speech information, in response to receiving the speech information; determining whether an English word exists in content of the speech information based on the pronunciation information; determining a Chinese word corresponding to the English word based on a preset Chinese-English mapping table in response to the English word existing in the content of the speech information, in which the Chinese-English mapping table includes a mapping relationship of at least one pair of English word and Chinese word; determining a score of the Chinese word corresponding to the English word; replacing a score of the English word in the scores of the language model with the score of the Chinese word; and obtaining a speech recognition result for the speech information based on the replaced scores of the language model.
-
3.
公开(公告)号:US20220068277A1
公开(公告)日:2022-03-03
申请号:US17522985
申请日:2021-11-10
Inventor: Zhijian WANG , Sheng QIAN
Abstract: The present disclosure provides a method and apparatus of performing a voice interaction, an electronic device and a readable storage medium, which relates to technical fields of voice processing and deep learning. The method of performing the voice interaction includes: acquiring an audio to be recognized; obtaining a recognition result for the audio to be recognized, by using an audio recognition model, and extracting an input of an output layer of the audio recognition model in a recognition process as a recognition feature; obtaining a response confidence level according to the recognition feature; and responding to the audio to be recognized, in response to determining that the response confidence level meets a preset response condition.
-
-