-
公开(公告)号:US20210390971A1
公开(公告)日:2021-12-16
申请号:US17335487
申请日:2021-06-01
Applicant: ACADEMIA SINICA
Inventor: TSAO YU , SYU-SIANG WANG , SZU-WEI FU , ALEXANDER CHAO-FU KANG , HSIN-MIN WANG
IPC: G10L21/0272 , G10L25/48 , G10L15/20
Abstract: An acoustic scene conversion method, comprising: receiving sound signals including user's speech and scenic sounds; processing the sound signals according to an artificial intelligence model to generate enhanced speech signals without scenic sounds; and mixing the enhanced speech signals with new scenic sounds to produce converted sound signals.
-
2.
公开(公告)号:US20190287551A1
公开(公告)日:2019-09-19
申请号:US16205328
申请日:2018-11-30
Applicant: ACADEMIA SINICA
Inventor: Yu TSAO , SYU-SIANG WANG
Abstract: A system is provided to realize suppression by selecting wavelets for feature compression in distributed speech recognition. The system comprises a first device and a second device. The first device comprising: a first network module for connecting to a network; an acoustic transducer module for recording speech and outputting frames of recorded signal; and a first processor configured for the following: extracting multiple-dimensional speech features from the frames of the recorded signal to generate multiple feature sequences; applying discrete wavelet transform (DWT) to the feature sequences to obtain a plurality of component data; and transmitting at least one of the plurality of component data via the network, wherein another one of the plurality of component data is not transmitted. The second device comprising: a second network module for connecting to the network and receiving the at least one of the plurality of component data from the first device; and a second processor configured for the following: updating the received data to generate an updated data; and applying inverse discrete wavelet transform (IDWT) to the updated data to obtain reconstructed speech data.
-