发明授权
- 专利标题: Voice activity detection and pitch estimation
- 专利标题(中): 语音活动检测和音调估计
-
申请号: US13590022申请日: 2012-08-20
-
公开(公告)号: US09384759B2公开(公告)日: 2016-07-05
- 发明人: Pierre Zakarauskas , Alexander Escott , Clarence S. H. Chu , Shawn E. Stevenson
- 申请人: Pierre Zakarauskas , Alexander Escott , Clarence S. H. Chu , Shawn E. Stevenson
- 申请人地址: BB Upton, St. Michael
- 专利权人: Malaspina Labs (Barbados) Inc.
- 当前专利权人: Malaspina Labs (Barbados) Inc.
- 当前专利权人地址: BB Upton, St. Michael
- 主分类号: G10L21/00
- IPC分类号: G10L21/00 ; G10L25/00 ; G10L25/93 ; G10L15/00 ; G10L15/20 ; G10L25/78 ; G10L25/90 ; G10L25/18
摘要:
Implementations include systems, methods and/or devices operable to detect voice activity in an audible signal by detecting glottal pulses. The dominant frequency of a series of glottal pulses is perceived as the intonation pattern or melody of natural speech, which is also referred to as the pitch. However, as noted above, spoken communication typically occurs in the presence of noise and/or other interference. In turn, the undulation of voiced speech is masked in some portions of the frequency spectrum associated with human speech by the noise and/or other interference. In some implementations, detection of voice activity is facilitated by dividing the frequency spectrum associated with human speech into multiple sub-bands in order to identify glottal pulses that dominate the noise and/or other inference in particular sub-bands. Additionally and/or alternatively, in some implementations the analysis is furthered to provide a pitch estimate of the detected voice activity.
公开/授权文献
- US20130231932A1 Voice Activity Detection and Pitch Estimation 公开/授权日:2013-09-05
信息查询