- 专利标题: Methods for voice enhancement
-
申请号: US15471629申请日: 2017-03-28
-
公开(公告)号: US10600432B1公开(公告)日: 2020-03-24
- 发明人: Wai Chung Chu , Carlo Murgia , Hyeong Cheol Kim
- 申请人: Amazon Technologies, Inc.
- 申请人地址: US WA Seattle
- 专利权人: Amazon Technologies, Inc.
- 当前专利权人: Amazon Technologies, Inc.
- 当前专利权人地址: US WA Seattle
- 代理机构: Pierce Atwood LLP
- 主分类号: G10L21/034
- IPC分类号: G10L21/034 ; G10L25/84 ; G10L21/02 ; G10L25/21
摘要:
A system configured to perform power normalization for voice enhancement. The system may identify active intervals corresponding to voice activity and may selectively amplify the active intervals in order to generate output audio data at a near uniform loudness. The system may determine a variable gain for each of the active intervals based on a desired output loudness and a flatness value, which indicates how much a signal envelope is to be modified. For example, a low flatness value corresponds to no modification, with peak active interval values corresponding to the desired output loudness and lower active intervals being lower than the desired output loudness. In contrast, a high flatness value corresponds to extensive modification, with peak active interval values and lower active interval values both corresponding to the desired output loudness. Thus, individual words may share the same peak power level.
信息查询