-
公开(公告)号:US20230081543A1
公开(公告)日:2023-03-16
申请号:US18057363
申请日:2022-11-21
Inventor: Bo Peng , Yongguo Kang , Cong Gao
IPC: G10L13/08 , G10L15/26 , G10L15/22 , G10L21/0232 , G10L25/18 , G10L13/047 , G10L15/02
Abstract: A method for synthetizing a speech includes: obtaining a source speech; suppressing a noise in the source speech based on an amplitude component and/or phase component of the source speech, to obtain a noise-reduced speech; performing a speech recognition process on the noise-reduced speech to obtain corresponding text information; inputting the text information of the noise-reduced speech and a preset tag into a trained acoustic model to obtain a predicted acoustic feature matching the text information; and generating a target speech based on the predicted acoustic feature.
-
公开(公告)号:US12112746B2
公开(公告)日:2024-10-08
申请号:US17476333
申请日:2021-09-15
Inventor: Jinfeng Bai , Zhijian Wang , Cong Gao
IPC: G10L15/22
CPC classification number: G10L15/22 , G10L2015/223
Abstract: The present disclosure provides a method and a device for processing voice interaction, an electronic device and a storage medium. The method includes: determining a first integrity of a voice instruction from a user by using a pre-trained integrity detection model in response to detecting that the voice instruction from the user is not a high-frequency instruction; determining a waiting duration for the voice instruction based on the first integrity and a preset integrity threshold, wherein the waiting duration for the voice instruction indicates a length of period between a time when a voice interaction device determines that receiving the voice instruction is completed and a time when the voice interaction device performs an operation in response to the voice instruction of the user; and controlling the voice interaction device to respond to the voice instruction of the user based on the waiting duration.
-