EXTRACTION OF TARGET SPEECHES

Invention Application

US20170278524A1 EXTRACTION OF TARGET SPEECHES 有权

Please log in to see more content

Patent Title: EXTRACTION OF TARGET SPEECHES
Application No.: US15440773

Application Date: 2017-02-23
Publication No.: US20170278524A1

Publication Date: 2017-09-28
Inventor: Takashi Fukuda , Osamu Ichikawa
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Main IPC: G10L21/028
IPC: G10L21/028 ; G10L21/0264 ; G10L15/14

Abstract:

Methods and systems are provided for separating a target speech from a plurality of other speeches having different directions of arrival. One of the methods includes obtaining speech signals from speech input devices disposed apart in predetermined distances from one another, calculating a direction of arrival of target speeches and directions of arrival of other speeches other than the target speeches for each of at least one pair of speech input devices, calculating an aliasing metric, wherein the aliasing metric indicates which frequency band of speeches is susceptible to spatial aliasing, enhancing speech signals arrived from the direction of arrival of the target speech signals, based on the speech signals and the direction of arrival of the target speeches, to generate the enhanced speech signals, reading a probability model, and inputting the enhanced speech signals and the aliasing metric to the probability model to output target speeches.

Public/Granted literature

US09818428B2 Extraction of target speeches Public/Granted day:2017-11-14

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L21/00	为了改变语音或声音信号的质量或其可识度而处理语音或声音信号，以产生另一种可听的或非可听的信号，例如视觉信号或触觉信号（G10L19/00优先）
G10L21/02	.语音增强，例如降低噪声或消除回声（在直线传送系统中减轻回声效应入H04B3/20；免提电话中的回声抑制入H04M9/08）
G10L21/0272	..声音信号的分离
G10L21/028	...采用声源的属性