-
公开(公告)号:US10978091B2
公开(公告)日:2021-04-13
申请号:US16205328
申请日:2018-11-30
Applicant: ACADEMIA SINICA
Inventor: Yu Tsao , Syu-Siang Wang
IPC: G10L25/69 , G06F40/20 , G08G1/01 , G10L21/02 , G10L25/06 , G10L15/30 , G10L15/22 , G10L15/02 , G10L25/03
Abstract: A system is provided to realize suppression by selecting wavelets for feature compression in distributed speech recognition. The system comprises a first device and a second device. The first device comprising: a first network module for connecting to a network; an acoustic transducer module for recording speech and outputting frames of recorded signal; and a first processor configured for the following: extracting multiple-dimensional speech features from the frames of the recorded signal to generate multiple feature sequences; applying discrete wavelet transform (DWT) to the feature sequences to obtain a plurality of component data; and transmitting at least one of the plurality of component data via the network, wherein another one of the plurality of component data is not transmitted. The second device comprising: a second network module for connecting to the network and receiving the at least one of the plurality of component data from the first device; and a second processor configured for the following: updating the received data to generate an updated data; and applying inverse discrete wavelet transform (IDWT) to the updated data to obtain reconstructed speech data.
-
公开(公告)号:US11741984B2
公开(公告)日:2023-08-29
申请号:US17335487
申请日:2021-06-01
Applicant: ACADEMIA SINICA
Inventor: Tsao Yu , Syu-Siang Wang , Szu-Wei Fu , Alexander Chao-Fu Kang , Hsin-Min Wang
IPC: G10L21/0272 , G10L25/48 , G10L15/20
CPC classification number: G10L21/0272 , G10L15/20 , G10L25/48
Abstract: An acoustic scene conversion method, comprising: receiving sound signals including user's speech and scenic sounds; processing the sound signals according to an artificial intelligence model to generate enhanced speech signals without scenic sounds; and mixing the enhanced speech signals with new scenic sounds to produce converted sound signals.
-