- 专利标题: Voice/non-voice determination device, voice/non-voice determination model parameter learning device, voice/non-voice determination method, voice/non-voice determination model parameter learning method, and program
-
申请号: US17628467申请日: 2019-07-25
-
公开(公告)号: US11894017B2公开(公告)日: 2024-02-06
- 发明人: Ryo Masumura , Takanobu Oba , Kiyoaki Matsui
- 申请人: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
- 申请人地址: JP Tokyo
- 专利权人: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
- 当前专利权人: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
- 当前专利权人地址: JP Tokyo
- 国际申请: PCT/JP2019/029241 2019.07.25
- 国际公布: WO2021/014649A 2021.01.28
- 进入国家日期: 2022-01-19
- 主分类号: G10L25/93
- IPC分类号: G10L25/93 ; G10L25/78 ; G10L15/00 ; G10L15/02 ; G10L21/0208 ; G06N20/20 ; G06N3/044 ; G06N3/09 ; G10L17/00 ; G10L25/84
摘要:
A voice/non-voice determination device robust with respect to an acoustic signal in a high-noise environment is provided. The voice/non-voice determination device includes an acoustic scene classification unit including a first model which receives input of an acoustic signal and outputs acoustic scene information which is information regarding a scene where the acoustic signal is collected, a speech enhancement unit including a second model which receives input of the acoustic signal and outputs speech enhancement information which is information regarding the acoustic signal after enhancement, and a voice/non-voice determination unit including a third model which receives input of the acoustic signal, the acoustic scene information and the speech enhancement information and outputs a voice/non-voice label which is information regarding a label of either a speech section or a non-speech section.
公开/授权文献
信息查询