-
公开(公告)号:US20240420684A1
公开(公告)日:2024-12-19
申请号:US18706313
申请日:2023-01-17
Inventor: Saisai ZOU , Lei JIA , Haifeng WANG
Abstract: A speech wake-up method, an electronic device, and a storage medium are provided. The method includes: performing a word recognition on a speech to be recognized to obtain a wake-up word recognition result (S210); performing a syllable recognition on the speech to be recognized to obtain a wake-up syllable recognition result, in response to determining that the wake-up word recognition result represents that the speech to be recognized contains a predetermined wake-up word (S220); and determining that the speech to be recognized is a correct wake-up speech, in response to determining that the wake-up syllable recognition result represents that the speech to be recognized contains a predetermined syllable (S230).
-
公开(公告)号:US20230360638A1
公开(公告)日:2023-11-09
申请号:US18221593
申请日:2023-07-13
Inventor: Saisai ZOU , Lei JIA , Haifeng WANG
CPC classification number: G10L15/02 , G10L15/14 , G10L2015/027
Abstract: A method of processing a speech information, a method of training a speech model, a speech wake-up method, an electronic device, and a storage medium are provided, which relate to a field of artificial intelligence technology, in particular to fields of human-computer interaction, deep learning and intelligent speech technologies. A specific implementation solution includes: performing a syllable recognition on a speech information to obtain a posterior probability sequence for the speech information, where the speech information includes a speech frame sequence, the posterior probability sequence corresponds to the speech frame sequence, and each posterior probability in the posterior probability sequence represents a similarity between a syllable in a speech frame matched with the posterior probability and a predetermined syllable; and determining a target peak speech frame from the speech frame sequence based on the posterior probability sequence.
-
3.
公开(公告)号:US20230317060A1
公开(公告)日:2023-10-05
申请号:US18328135
申请日:2023-06-02
Inventor: Saisai ZOU , Li CHEN , Ruoxi ZHANG , Lei JIA , Haifeng WANG
CPC classification number: G10L15/063 , G10L15/02
Abstract: The present disclosure provides a method and an apparatus for training a voice wake-up model, a method and an apparatus for voice wake-up, a device and a storage medium, which relates to the field of artificial intelligence and particularly to the field of deep learning and voice technology. A specific implementation lies in: acquiring voice recognition training data and voice wake-up training data that are created, and firstly performing training on a base model according to the voice recognition training data to obtain a model parameter of the base model when a model loss function converges; then updating, based on a model configuration instruction, a configuration parameter of a decoding module in the base model to obtain a first model; and finally performing training on the first model according to the voice wake-up training data to obtain a trained voice wake-up model when the model loss function converges.
-
公开(公告)号:US20230096150A1
公开(公告)日:2023-03-30
申请号:US18061151
申请日:2022-12-02
Inventor: Nan XU , Saisai ZOU , Li CHEN
IPC: G10L21/0208
Abstract: A method and an apparatus for determining an echo, and a storage medium. The implementation solution includes: obtaining an echo estimation result by performing echo estimation on an original audio signal; obtaining an optimization processing result by performing optimization processing on the echo estimation result, the optimization processing includes at least one of amplitude dimension optimization processing, phase dimension optimization processing, or time domain dimension optimization processing; and determining an echo of the original audio signal using the optimization processing result.
-
-
-