Patent search ap:("BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO. Page LTD.") AND inv:"Saisai ZOU"

1.

发明申请
SPEECH WAKE-UP METHOD, ELECTRONIC DEVICE, AND STORAGE MEDIUM 有权

公开(公告)号：US20240420684A1

公开(公告)日：2024-12-19

申请号：US18706313

申请日：2023-01-17

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Saisai ZOU , Lei JIA , Haifeng WANG

IPC: G10L15/16 , G10L13/02 , G10L15/02 , G10L15/08

Abstract: A speech wake-up method, an electronic device, and a storage medium are provided. The method includes: performing a word recognition on a speech to be recognized to obtain a wake-up word recognition result (S210); performing a syllable recognition on the speech to be recognized to obtain a wake-up syllable recognition result, in response to determining that the wake-up word recognition result represents that the speech to be recognized contains a predetermined wake-up word (S220); and determining that the speech to be recognized is a correct wake-up speech, in response to determining that the wake-up syllable recognition result represents that the speech to be recognized contains a predetermined syllable (S230).

2.

发明公开
METHOD OF PROCESSING SPEECH INFORMATION, METHOD OF TRAINING MODEL, AND WAKE-UP METHOD 审中-公开

公开(公告)号：US20230360638A1

公开(公告)日：2023-11-09

申请号：US18221593

申请日：2023-07-13

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Saisai ZOU , Lei JIA , Haifeng WANG

IPC: G10L15/02 , G10L15/14

CPC classification number: G10L15/02 , G10L15/14 , G10L2015/027

Abstract: A method of processing a speech information, a method of training a speech model, a speech wake-up method, an electronic device, and a storage medium are provided, which relate to a field of artificial intelligence technology, in particular to fields of human-computer interaction, deep learning and intelligent speech technologies. A specific implementation solution includes: performing a syllable recognition on a speech information to obtain a posterior probability sequence for the speech information, where the speech information includes a speech frame sequence, the posterior probability sequence corresponds to the speech frame sequence, and each posterior probability in the posterior probability sequence represents a similarity between a syllable in a speech frame matched with the posterior probability and a predetermined syllable; and determining a target peak speech frame from the speech frame sequence based on the posterior probability sequence.

3.

发明公开
METHOD AND APPARATUS FOR TRAINING VOICE WAKE-UP MODEL, METHOD AND APPARATUS FOR VOICE WAKE-UP, DEVICE, AND STORAGE MEDIUM 审中-公开

公开(公告)号：US20230317060A1

公开(公告)日：2023-10-05

申请号：US18328135

申请日：2023-06-02

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Saisai ZOU , Li CHEN , Ruoxi ZHANG , Lei JIA , Haifeng WANG

IPC: G10L15/06 , G10L15/02

CPC classification number: G10L15/063 , G10L15/02

Abstract: The present disclosure provides a method and an apparatus for training a voice wake-up model, a method and an apparatus for voice wake-up, a device and a storage medium, which relates to the field of artificial intelligence and particularly to the field of deep learning and voice technology. A specific implementation lies in: acquiring voice recognition training data and voice wake-up training data that are created, and firstly performing training on a base model according to the voice recognition training data to obtain a model parameter of the base model when a model loss function converges; then updating, based on a model configuration instruction, a configuration parameter of a decoding module in the base model to obtain a first model; and finally performing training on the first model according to the voice wake-up training data to obtain a trained voice wake-up model when the model loss function converges.

4.

发明申请
METHOD AND APPARATUS FOR DETERMINING ECHO, AND STORAGE MEDIUM 有权

公开(公告)号：US20230096150A1

公开(公告)日：2023-03-30

申请号：US18061151

申请日：2022-12-02

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Nan XU , Saisai ZOU , Li CHEN

IPC: G10L21/0208

Abstract: A method and an apparatus for determining an echo, and a storage medium. The implementation solution includes: obtaining an echo estimation result by performing echo estimation on an original audio signal; obtaining an optimization processing result by performing optimization processing on the echo estimation result, the optimization processing includes at least one of amplitude dimension optimization processing, phase dimension optimization processing, or time domain dimension optimization processing; and determining an echo of the original audio signal using the optimization processing result.

Patent Agency Ranking