- 专利标题: Acoustic change detection for robust automatic speech recognition based on a variance between distance dependent GMM models
-
申请号: US15861037申请日: 2018-01-03
-
公开(公告)号: US10783882B2公开(公告)日: 2020-09-22
- 发明人: Osamu Ichikawa , Gakuto Kurata , Takashi Fukuda
- 申请人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理机构: Tutunjian & Bitetto, P.C.
- 代理商 Vazken Alexanian
- 主分类号: H04M1/725
- IPC分类号: H04M1/725 ; G10L15/20 ; G10L15/02 ; G10L15/08 ; G10L15/06 ; G10L15/07 ; G10L25/51 ; G10L25/24 ; G10L25/27
摘要:
Acoustic change is detected by a method including preparing a first Gaussian Mixture Model (GMM) trained with first audio data of first speech sound from a speaker at a first distance from an audio interface and a second GMM generated from the first GMM using second audio data of second speech sound from the speaker at a second distance from the audio interface; calculating a first output of the first GMM and a second output of the second GMM by inputting obtained third audio data into the first GMM and the second GMM; and transmitting a notification in response to determining at least that a difference between the first output and the second output exceeds a threshold. Each Gaussian distribution of the second GMM has a mean obtained by shifting a mean of a corresponding Gaussian distribution of the first GMM by a common channel bias.
公开/授权文献
信息查询