专利检索 ap:("Hiromu Gotanda" OR "Keiichi Kaneda" OR "Takeshi Koya") AND inv:"Takeshi Koya" 第 1 页

1.

发明授权
Recovering method of target speech based on split spectra using sound sources' locational information 失效
标题翻译：基于使用声源位置信息的分割谱的目标语音的恢复方法

公开(公告)号：US07315816B2

公开(公告)日：2008-01-01

申请号：US10435135

申请日：2003-05-09

申请人： Hiromu Gotanda , Kazuyuki Nobu , Takeshi Koya , Keiichi Kaneda , Takaaki Ishibashi

发明人： Hiromu Gotanda , Kazuyuki Nobu , Takeshi Koya , Keiichi Kaneda , Takaaki Ishibashi

IPC分类号： G10L21/02 , G10L15/20 , H04B1/10

CPC分类号： G10L21/0208 , G10L2021/02165

摘要： The present invention relates to a method for recovering target speech from mixed signals, which include the target speech and noise observed in a real-world environment, based on split spectra using sound sources' locational information. This method includes: the first step of receiving target speech from a target speech source and noise from a noise source and forming mixed signals of the target speech and the noise at a first microphone and at a second microphone; the second step of performing the Fourier transform of the mixed signals from a time domain to a frequency domain, decomposing the mixed signals into two separated signals UA and UB by use of the Independent Component Analysis, and, based on transmission path characteristics of the four different paths from the target speech source and the noise source to the first and second microphones, generating from the separated signal UA a pair of split spectra vA1 and vA2, which were received at the first and second microphones respectively, and from the separated signal UB another pair of split spectra vB1 and vB2, which were received at the first and second microphones respectively; and the third step of extracting a recovered spectrum of the target speech, wherein the split spectra are analyzed by applying criteria based on sound transmission characteristics that depend on the four different distances between the first and second microphones and the target speech and noise sources, and performing the inverse Fourier transform of the recovered spectrum from the frequency domain to the time domain to recover the target speech.

摘要翻译： 本发明涉及一种基于使用声源定位信息的分割频谱从混合信号中恢复目标语音的方法，其包括在现实环境中观察到的目标语音和噪声。该方法包括：从目标语音源接收目标语音和来自噪声源的噪声的第一步骤，并在第一麦克风和第二麦克风处形成目标语音和噪声的混合信号; 执行从时域到频域的混合信号的傅里叶变换的第二步骤，通过使用将混合信号分解成两个分离的信号U SUB和U B 的独立分量分析，并且基于从目标语音源和噪声源到第一和第二麦克风的四个不同路径的传输路径特性，从分离的信号U A A生成一对分别在第一和第二麦克风处接收的分离光谱v A1和v A2，以及分离的信号U B B另一对分别在第一和第二麦克风接收的分离光谱v B1和V B2; 以及提取目标语音的恢复频谱的第三步骤，其中通过应用基于取决于第一和第二麦克风与目标语音和噪声源之间的四个不同距离的声音传输特性的标准来分析分离频谱，以及从频域到时域执行恢复频谱的傅里叶逆变换，以恢复目标语音。

2.

发明授权
Method for recovering target speech based on speech segment detection under a stationary noise 失效
标题翻译：基于固定噪声下语音段检测的目标语音恢复方法

公开(公告)号：US07533017B2

公开(公告)日：2009-05-12

申请号：US10570808

申请日：2004-08-31

申请人： Hiromu Gotanda , Keiichi Kaneda , Takeshi Koya

发明人： Hiromu Gotanda , Keiichi Kaneda , Takeshi Koya

IPC分类号： G10L21/02

CPC分类号： G10L21/0208 , G10L25/78

摘要： Method for recovering target speech by extracting signal components falling in a speech segment, which is determined based on separated signals obtained through the Independent Component Analysis, thereby minimizing the residual noise in the recovered target speech. The present method comprises: the first step of receiving target speech emitted from a sound source and a noise emitted from another sound source and extracting estimated spectra Y* corresponding to the target speech by use of the Independent Component Analysis; the second step of separating from the estimated spectra Y* an estimated spectrum series group y* in which the noise is removed by applying separation judgment criteria based on the kurtosis of the amplitude distribution of each of estimated spectrum series in Y*; the third step of detecting a speech segment and a noise segment of the total sum F of all the estimated spectrum series in y* by applying detection judgment criteria based on a predetermined threshold value T that is determined by the maximum value of F; and the fourth step of extracting components falling in the speech segment from the estimated spectra Y* to generate a recovered spectrum group of the target speech for recovering the target speech.

摘要翻译： 通过提取基于通过独立分量分析获得的分离信号确定的落在语音段中的信号分量来恢复目标语音的方法，从而使恢复的目标语音中的残余噪声最小化。本方法包括：接收从声源发出的目标语音和从另一声源发出的噪声的第一步骤，并通过使用独立分量分析提取对应于目标语音的估计频谱Y *; 通过基于Y *中的每个估计频谱序列的振幅分布的峰度应用分离判断标准，从估计频谱Y *中除去噪声被去除的估计频谱Y *的估计频谱Y *的第二步骤; 基于由最大值F确定的预定阈值T，通过应用检测判断基准来检测y *中所有估计频谱序列的总和F的语音段和噪声段的第三步骤; 以及从估计的光谱Y *中提取掉在语音段中的分量的第四步骤，以产生用于恢复目标语音的目标语音的恢复的频谱组。

3.

发明申请
Method for recovering target speech based on speech segment detection under a stationary noise 失效
标题翻译：基于固定噪声下语音段检测的目标语音恢复方法

公开(公告)号：US20070055511A1

公开(公告)日：2007-03-08

申请号：US10570808

申请日：2004-08-31

申请人： Hiromu Gotanda , Keiichi Kaneda , Takeshi Koya

发明人： Hiromu Gotanda , Keiichi Kaneda , Takeshi Koya

IPC分类号： G10L15/20

CPC分类号： G10L21/0208 , G10L25/78

摘要： Method for recovering target speech by extracting signal components falling in a speech segment, which is determined based on separated signals obtained through the Independent Component Analysis, thereby minimizing the residual noise in the recovered target speech. The present method comprises: the first step of receiving target speech emitted from a sound source and a noise emitted from another sound source and extracting estimated spectra Y* corresponding to the target speech by use of the Independent Component Analysis; the second step of separating from the estimated spectra Y* an estimated spectrum series group y* in which the noise is removed by applying separation judgment criteria based on the kurtosis of the amplitude distribution of each of estimated spectrum series in Y*; the third step of detecting a speech segment and a noise segment of the total sum F of all the estimated spectrum series in y* by applying detection judgment criteria based on a predetermined threshold value T that is determined by the maximum value of F; and the fourth step of extracting components falling in the speech segment from the estimated spectra Y* to generate a recovered spectrum group of the target speech for recovering the target speech.

摘要翻译： 通过提取基于通过独立分量分析获得的分离信号确定的落在语音段中的信号分量来恢复目标语音的方法，从而使恢复的目标语音中的残余噪声最小化。本方法包括：接收从声源发出的目标语音和从另一声源发出的噪声的第一步骤，并通过使用独立分量分析提取对应于目标语音的估计频谱Y *; 通过基于Y *中的每个估计频谱序列的振幅分布的峰度应用分离判断标准，从估计频谱Y *中除去噪声被去除的估计频谱Y *的估计频谱Y *的第二步骤; 基于由最大值F确定的预定阈值T，通过应用检测判断基准来检测y *中所有估计频谱序列的总和F的语音段和噪声段的第三步骤; 以及从估计的光谱Y *中提取掉在语音段中的分量的第四步骤，以产生用于恢复目标语音的目标语音的恢复的频谱组。

4.

发明授权
Method for recovering target speech based on amplitude distributions of separated signals 失效
标题翻译：基于分离信号的幅度分布来恢复目标语音的方法

公开(公告)号：US07562013B2

公开(公告)日：2009-07-14

申请号：US10572427

申请日：2004-08-31

申请人： Hiromu Gotanda , Keiichi Kaneda , Takeshi Koya

发明人： Hiromu Gotanda , Keiichi Kaneda , Takeshi Koya

IPC分类号： G10L21/02

CPC分类号： G10L21/0272 , G10L25/27

摘要： The present invention provides a method for recovering target speech based on shapes of amplitude distributions of split spectra obtained by use of blind signal separation. This method includes: a first step of receiving target speech emitted from a sound source and a noise emitted from another sound source and forming mixed signals of the target speech and the noise at a first microphone and at a second microphone; a second step of performing the Fourier transform of the mixed signals from the time domain to the frequency domain, decomposing the mixed signals into two separated signals U1 and U2 by use of the Independent Component Analysis, and, based on transmission path characteristics of the four different paths from the two sound sources to the first and second microphones, generating the split spectra v11, v12, v21 and v22 from the separated signals U1 and U2; and a third step of extracting estimated spectra Z* corresponding to the target speech to generate a recovered spectrum group of the target speech, wherein the split spectra v11, v12, v21, and v22 are analyzed by applying criteria based on the shape of the amplitude distribution of each of the split spectra v11, v12, v21, and v22, and performing the inverse Fourier transform of the recovered spectrum group from the frequency domain to the time domain to recover the target speech.

摘要翻译： 本发明提供一种基于通过使用盲信号分离获得的分离频谱的振幅分布的形状来恢复目标语音的方法。该方法包括：接收从声源发出的目标语音和从另一声源发出的噪声的第一步骤，并在第一麦克风和第二麦克风处形成目标语音和噪声的混合信号; 执行从时域到频域的混合信号的傅里叶变换的第二步骤，通过使用独立分量分析将混合信号分解成两个分离的信号U1和U2，并且基于四个信号的传输路径特性从两个声源到第一和第二麦克风的不同路径，从分离的信号U1和U2产生分离谱v11，v12，v21和v22; 以及第三步骤，提取与目标语音相对应的估计频谱Z *以产生目标语音的恢复频谱组，其中通过应用基于幅度的形状的标准来分析分离频谱v11，v12，v21和v22 分割频谱v11，v12，v21和v22中的每一个的分布，并且从频域到时域执行恢复的频谱组的傅里叶逆变换以恢复目标语音。

5.

发明申请
Method for recovering target speech based on amplitude distributions of separated signals 失效
标题翻译：基于分离信号的幅度分布来恢复目标语音的方法

公开(公告)号：US20070100615A1

公开(公告)日：2007-05-03

申请号：US10572427

申请日：2004-08-31

申请人： Hiromu Gotanda , Keiichi Kaneda , Takeshi Koya

发明人： Hiromu Gotanda , Keiichi Kaneda , Takeshi Koya

IPC分类号： G10L21/02

CPC分类号： G10L21/0272 , G10L25/27

摘要： The present invention provides a method for recovering target speech based on shapes of amplitude distributions of split spectra obtained by use of blind signal separation. This method includes: a first step of receiving target speech emitted from a sound source and a noise emitted from another sound source and forming mixed signals of the target speech and the noise at a first microphone and at a second microphone; a second step of performing the Fourier transform of the mixed signals from the time domain to the frequency domain, decomposing the mixed signals into two separated signals U1 and U2 by use of the Independent Component Analysis, and, based on transmission path characteristics of the four different paths from the two sound sources to the first and second microphones, generating the split spectra v11, v12, v21 and v22 from the separated signals U1 and U2; and a third step of extracting estimated spectra Z* corresponding to the target speech to generate a recovered spectrum group of the target speech, wherein the split spectra v11, v12, v21, and v22 are analyzed by applying criteria based on the shape of the amplitude distribution of each of the split spectra v11, v12, v21, and v22, and performing the inverse Fourier transform of the recovered spectrum group from the frequency domain to the time domain to recover the target speech.

摘要翻译： 本发明提供一种基于通过使用盲信号分离获得的分离频谱的振幅分布的形状来恢复目标语音的方法。该方法包括：接收从声源发出的目标语音和从另一声源发出的噪声的第一步骤，并在第一麦克风和第二麦克风处形成目标语音和噪声的混合信号; 执行从时域到频域的混合信号的傅里叶变换的第二步骤，通过使用将混合信号分解成两个分离的信号U 1和U 2 的独立分量分析，并且基于从两个声源到第一和第二麦克风的四个不同路径的传输路径特性，产生分离频谱v 11，v 12， / SUB>，v 21和v 22从分离的信号U 1和U 2中获得; 以及第三步骤，提取与所述目标语音相对应的估计频谱Z *，以产生所述目标语音的恢复频谱组，其中所述分割频谱v 11，V 12，通过应用基于每个分离光谱v 11的振幅分布的形状的标准来分析v 21和V 22，v 并且，将恢复的频谱组从频域到时域进行傅里叶逆变换，恢复目标演讲。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类