Patent search ap:("Beijing Baidu Netcom Science Technology Co. Page Ltd.") AND inv:"Cong Gao"

1.

发明申请
METHOD FOR SYNTHETIZING SPEECH AND ELECTRONIC DEVICE 有权

公开(公告)号：US20230081543A1

公开(公告)日：2023-03-16

申请号：US18057363

申请日：2022-11-21

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Bo Peng , Yongguo Kang , Cong Gao

IPC: G10L13/08 , G10L15/26 , G10L15/22 , G10L21/0232 , G10L25/18 , G10L13/047 , G10L15/02

Abstract: A method for synthetizing a speech includes: obtaining a source speech; suppressing a noise in the source speech based on an amplitude component and/or phase component of the source speech, to obtain a noise-reduced speech; performing a speech recognition process on the noise-reduced speech to obtain corresponding text information; inputting the text information of the noise-reduced speech and a preset tag into a trained acoustic model to obtain a predicted acoustic feature matching the text information; and generating a target speech based on the predicted acoustic feature.

2.

发明授权
Method and device for processing voice interaction, electronic device and storage medium 有权

公开(公告)号：US12112746B2

公开(公告)日：2024-10-08

申请号：US17476333

申请日：2021-09-15

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Jinfeng Bai , Zhijian Wang , Cong Gao

IPC: G10L15/22

CPC classification number: G10L15/22 , G10L2015/223

Abstract: The present disclosure provides a method and a device for processing voice interaction, an electronic device and a storage medium. The method includes: determining a first integrity of a voice instruction from a user by using a pre-trained integrity detection model in response to detecting that the voice instruction from the user is not a high-frequency instruction; determining a waiting duration for the voice instruction based on the first integrity and a preset integrity threshold, wherein the waiting duration for the voice instruction indicates a length of period between a time when a voice interaction device determines that receiving the voice instruction is completed and a time when the voice interaction device performs an operation in response to the voice instruction of the user; and controlling the voice interaction device to respond to the voice instruction of the user based on the waiting duration.

Patent Agency Ranking