发明授权
- 专利标题: Methods and apparatus for tracking speakers in an audio stream
- 专利标题(中): 用于跟踪音频流中的扬声器的方法和装置
-
申请号: US09345238申请日: 1999-06-30
-
公开(公告)号: US07739114B1公开(公告)日: 2010-06-15
- 发明人: Scott Shaobing Chen , Alain Charles Louis Tritschler , Mahesh Viswanathan
- 申请人: Scott Shaobing Chen , Alain Charles Louis Tritschler , Mahesh Viswanathan
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理机构: Ryan, Mason & Lewis, LLP
- 主分类号: G10L21/00
- IPC分类号: G10L21/00
摘要:
Speakers are automatically identified in an audio (or video) source. The audio information is processed to identify potential segment boundaries. Homogeneous segments are clustered substantially concurrently with the segmentation routine, and a cluster identifier is assigned to each identified segment. A segmentation subroutine identifies potential segment boundaries using the BIC model selection criterion. A clustering subroutine uses a BIC model selection criterion to assign a cluster identifier to each of the identified segments. If the difference of BIC values for each model is positive, the two clusters are merged.
信息查询