- 专利标题: EXTRACTION OF TARGET SPEECHES
-
申请号: US15440773申请日: 2017-02-23
-
公开(公告)号: US20170278524A1公开(公告)日: 2017-09-28
- 发明人: Takashi Fukuda , Osamu Ichikawa
- 申请人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 主分类号: G10L21/028
- IPC分类号: G10L21/028 ; G10L21/0264 ; G10L15/14
摘要:
Methods and systems are provided for separating a target speech from a plurality of other speeches having different directions of arrival. One of the methods includes obtaining speech signals from speech input devices disposed apart in predetermined distances from one another, calculating a direction of arrival of target speeches and directions of arrival of other speeches other than the target speeches for each of at least one pair of speech input devices, calculating an aliasing metric, wherein the aliasing metric indicates which frequency band of speeches is susceptible to spatial aliasing, enhancing speech signals arrived from the direction of arrival of the target speech signals, based on the speech signals and the direction of arrival of the target speeches, to generate the enhanced speech signals, reading a probability model, and inputting the enhanced speech signals and the aliasing metric to the probability model to output target speeches.
公开/授权文献
- US09818428B2 Extraction of target speeches 公开/授权日:2017-11-14
信息查询