Mixed audio separation apparatus
    1.
    发明授权
    Mixed audio separation apparatus 有权
    混合音频分离装置

    公开(公告)号:US07974420B2

    公开(公告)日:2011-07-05

    申请号:US11665265

    申请日:2006-04-11

    IPC分类号: H03G5/00 H04B15/00

    CPC分类号: G10L21/0272 G10L19/0204

    摘要: A mixed audio separation system (100) which separates a specific audio from among a mixed audio (S100) includes a local frequency information generation unit (105) which obtains pieces of local frequency information (S103) corresponding to local reference waveforms (S102), based on the local reference waveforms (S102) and an analysis waveform which is the waveform of the mixed audio (S100). Each of the local reference waveforms (S102) (i) constitutes a part of a reference waveform for analyzing a predetermined frequency, (ii) has a predetermined temporal/spatial resolution and (iii) includes at least one of an amplification spectrum and a phase spectrum in the predetermined frequency. The system includes: a specific audio's frequency feature value extraction unit (106) which performs pattern matching between a first set which is the pieces of local frequency information and a second set of pieces of frequency information (S103) of a predetermined specific audio, and extracts the first set of the pieces of local frequency information (S103), based on a result of the pattern matching; and an audio signal generation unit which generates a signal of the specific audio, based on the first set of the pieces of local frequency information (S103) extracted by the specific audio's frequency feature value extraction unit.

    摘要翻译: 从混合音频(S100)中分离特定音频的混合音频分离系统(100)包括本地频率信息生成单元(105),其获取与局部参考波形相对应的本地频率信息(S103)(S102), 基于本地参考波形(S102)和作为混合音频的波形的分析波形(S100)。 每个局部参考波形(S102)(i)构成用于分析预定频率的参考波形的一部分,(ii)具有预定的时间/空间分辨率,以及(iii)包括放大光谱和相位 频谱在预定频率。 该系统包括:特定音频频率特征值提取单元(106),其执行作为本地频率信息的第一组与预定特定音频的第二组频率信息(S103)之间的模式匹配,以及 基于模式匹配的结果提取第一组本地频率信息(S103); 以及音频信号生成单元,其基于由特定音频的频率特征值提取单元提取的第一组本地频率信息(S103),生成特定音频的信号。

    Sound identification apparatus
    2.
    发明授权
    Sound identification apparatus 有权
    声音识别装置

    公开(公告)号:US07473838B2

    公开(公告)日:2009-01-06

    申请号:US11783376

    申请日:2007-04-09

    IPC分类号: G10H1/00 G06F17/00

    CPC分类号: G10L25/48

    摘要: A sound identification apparatus which reduces the chance of a drop in the identification rate, including: a frame sound feature extraction unit which extracts a sound feature per frame of an inputted audio signal; a frame likelihood calculation unit which calculates a frame likelihood of the sound feature in each frame, for each of a plurality of sound models; a confidence measure judgment unit which judges a confidence measure based on the frame likelihood; a cumulative likelihood output unit time determination unit which determines a cumulative likelihood output unit time based on the confidence measure; a cumulative likelihood calculation unit which calculates a cumulative likelihood in which the frame likelihoods of the frames included in the cumulative likelihood output unit time are cumulated, for each sound model; a sound type candidate judgment unit which determines, for each cumulative likelihood output unit time, a sound type corresponding to the sound model that has a maximum cumulative likelihood; a sound type frequency calculation unit which calculates the frequency of the sound type candidate; and a sound type interval determination unit which determines the sound type of the inputted audio signal and the interval of the sound type, based on the frequency of the sound type.

    摘要翻译: 一种声音识别装置,其减少识别率下降的可能性,包括:帧声音特征提取单元,其提取每帧输入的音频信号的声音特征; 帧似然计算单元,对于多个声音模型中的每一个,计算每个帧中的声音特征的帧似然度; 置信度判断单元,其基于所述帧可能性判断置信度量; 累积似然度输出单元时间确定单元,其基于所述置信度测量来确定累积似然度输出单位时间; 对于每个声音模型,计算累积似然输出单元时间中包括的帧的帧似然性的累积似然度的累积似然度计算单元; 声音候选判定单元,对于每个累积似然度输出单位时间,确定与具有最大累积似然性的声音模型对应的声音类型; 声音型频率计算单元,其计算声音类型候选的频率; 以及声音类型间隔确定单元,其基于声音类型的频率来确定输入的音频信号的声音类型和声音类型的间隔。

    Target sound analysis apparatus, target sound analysis method and target sound analysis program
    3.
    发明申请
    Target sound analysis apparatus, target sound analysis method and target sound analysis program 有权
    目标声音分析仪器,目标声音分析方法和目标声音分析程序

    公开(公告)号:US20080304672A1

    公开(公告)日:2008-12-11

    申请号:US11902731

    申请日:2007-09-25

    IPC分类号: H04R29/00

    摘要: A target sound analysis apparatus capable of distinguishing between a sound having the same fundamental period as a target sound but which differs therefrom and the target sound and analyzing whether or not the target sound is contained in an evaluation sound is an target sound analysis apparatus that analyzes whether or not a target sound is included in an evaluation sound, and includes: a target sound preparation unit that prepares a target sound that is an analysis waveform to be used for analyzing a fundamental period; an evaluation sound preparation unit that prepares an evaluation sound that is an analyzed waveform in which its fundamental period will be analyzed; and an analysis unit that temporally shifts the target sound with respect to the evaluation sound to sequentially calculate differential values of the evaluation sound and the target sound at corresponding points in time, calculate an iterative interval between the points in time where the differential value is equal to or lower than a predetermined threshold value, and judge whether or not the target sound exists in the evaluation sound based on a period of the iterative interval and the fundamental period of the target sound.

    摘要翻译: 能够区别具有与目标声音相同但与之不同的基准周期的声音与目标声音并且分析目标声音是否包含在评价声音中的目标声音分析装置是分析 无论目标声音是否被包括在评价声音中,并且包括:目标声音准备单元,其准备作为用于分析基本周期的分析波形的目标声音; 评估声音准备单元,其准备作为其分析基本周期的分析波形的评价声音; 以及分析单元,其相对于所述评价声音暂时移动所述目标声音,以在相应的时间点顺序地计算所述评价声音和所述目标声音的差分值,计算所述差分值相等的时间点之间的迭代间隔 达到或低于预定阈值,并且基于迭代间隔的周期和目标声音的基本周期来判断评估声中是否存在目标声音。

    Audio restoration apparatus and audio restoration method
    4.
    发明申请
    Audio restoration apparatus and audio restoration method 有权
    音频恢复装置和音频恢复方法

    公开(公告)号:US20060193671A1

    公开(公告)日:2006-08-31

    申请号:US11401263

    申请日:2006-04-11

    IPC分类号: B41J35/28

    CPC分类号: G10L19/005 G10L21/0208

    摘要: An audio restoration apparatus which restores an audio to be restored having a missing audio part and being included in a mixed audio. The audio restoration apparatus includes: a mixed audio separation unit which extracts the audio to be restored included in the mixed audio; an audio structure analysis unit which generates at least one of a phoneme sequence, a character sequence and a musical note sequence of the missing audio part in the extracted audio to be restored, based on an audio structure knowledge database in which semantics of audio are registered; an unchanged audio characteristic domain analysis unit which segments the extracted audio to be restored into time domains in each of which an audio characteristic remains unchanged; an audio characteristic extraction unit which identifies a time domain where the missing audio part is located, from among the segmented time domains, and extract audio characteristics of the identified time domain in the audio to be restored; and an audio restoration unit which restores the missing audio part in the audio to be restored, using the extracted audio characteristics and the generated one or more of phoneme sequence, character sequence and musical note sequence.

    摘要翻译: 一种音频恢复装置,其恢复具有丢失的音频部分并被包括在混合音频中的要恢复的音频。 音频恢复装置包括:混合音频分离单元,其提取包含在混合音频中的要恢复的音频; 音频结构分析单元,其基于音频结构知识数据库中生成音频结构知识数据库中的音素序列,字符序列和所提取的要恢复的音频中的音符序列中的至少一个。 ; 一个不变的音频特征域分析单元,其将提取的音频分段成恢复到每个音频特性保持不变的时域; 音频特征提取单元,从分段时域中识别缺失音频部分所在的时域,并且提取要恢复的音频中所识别的时域的音频特征; 以及音频恢复单元,其使用所提取的音频特性和所生成的一个或多个音素序列,字符序列和音符序列来恢复要恢复的音频中的丢失音频部分。

    Target sound analysis apparatus, target sound analysis method and target sound analysis program
    5.
    发明授权
    Target sound analysis apparatus, target sound analysis method and target sound analysis program 有权
    目标声音分析仪器,目标声音分析方法和目标声音分析程序

    公开(公告)号:US08223978B2

    公开(公告)日:2012-07-17

    申请号:US11902731

    申请日:2007-09-25

    IPC分类号: H04R29/00

    摘要: A target sound analysis apparatus capable of distinguishing between a sound having the same fundamental period as a target sound but which differs therefrom and the target sound and analyzing whether or not the target sound is contained in an evaluation sound is an target sound analysis apparatus that analyzes whether or not a target sound is included in an evaluation sound, and includes: a target sound preparation unit that prepares a target sound that is an analysis waveform to be used for analyzing a fundamental period; an evaluation sound preparation unit that prepares an evaluation sound that is an analyzed waveform in which its fundamental period will be analyzed; and an analysis unit that temporally shifts the target sound with respect to the evaluation sound to sequentially calculate differential values of the evaluation sound and the target sound at corresponding points in time, calculate an iterative interval between the points in time where the differential value is equal to or lower than a predetermined threshold value, and judge whether or not the target sound exists in the evaluation sound based on a period of the iterative interval and the fundamental period of the target sound.

    摘要翻译: 能够区别具有与目标声音相同但与之不同的基准周期的声音与目标声音并且分析目标声音是否包含在评价声音中的目标声音分析装置是分析 无论目标声音是否被包括在评价声音中,并且包括:目标声音准备单元,其准备作为用于分析基本周期的分析波形的目标声音; 评估声音准备单元,其准备作为其分析基本周期的分析波形的评价声音; 以及分析单元,其相对于所述评价声音暂时移动所述目标声音,以在相应的时间点顺序地计算所述评价声音和所述目标声音的差分值,计算所述差分值相等的时间点之间的迭代间隔 达到或低于预定阈值,并且基于迭代间隔的周期和目标声音的基本周期来判断评估声中是否存在目标声音。

    Audio restoration apparatus and audio restoration method
    6.
    发明授权
    Audio restoration apparatus and audio restoration method 有权
    音频恢复装置和音频恢复方法

    公开(公告)号:US07536303B2

    公开(公告)日:2009-05-19

    申请号:US11401263

    申请日:2006-04-11

    IPC分类号: G10L21/02 G10L15/20

    CPC分类号: G10L19/005 G10L21/0208

    摘要: An audio restoration apparatus is provided which restores an audio to be restored having a missing audio part and being included in a mixed audio. The audio restoration apparatus includes: a mixed audio separation unit which extracts the audio to be restored included in the mixed audio; an audio structure analysis unit which generates at least one of a phoneme sequence, a character sequence and a musical note sequence of the missing audio part; an unchanged audio characteristic domain analysis unit which segments the extracted audio to be restored into time domains in each of which an audio characteristic remains unchanged; an audio characteristic extraction unit which identifies a time domain where the missing audio part is located, and extracts audio characteristics of the identified time domain in the audio to be restored; and an audio restoration unit which restores the missing audio part in the audio to be restored.

    摘要翻译: 提供一种音频恢复装置,其恢复具有缺失音频部分并被包括在混合音频中的要恢复的音频。 音频恢复装置包括:混合音频分离单元,其提取包含在混合音频中的要恢复的音频; 音频结构分析单元,其产生丢失音频部分的音素序列,字符序列和音符序列中的至少一个; 一个不变的音频特征域分析单元,其将提取的音频分段成恢复到每个音频特性保持不变的时域; 识别丢失音频部分所在的时域的音频特征提取单元,并且提取要恢复的音频中所识别的时域的音频特性; 以及恢复要恢复的音频中的丢失音频部分的音频恢复单元。

    Sound identification apparatus
    7.
    发明申请
    Sound identification apparatus 有权
    声音识别装置

    公开(公告)号:US20070192099A1

    公开(公告)日:2007-08-16

    申请号:US11783376

    申请日:2007-04-09

    IPC分类号: G10L15/00

    CPC分类号: G10L25/48

    摘要: A sound identification apparatus which reduces the chance of a drop in the identification rate, including: a frame sound feature extraction unit which extracts a sound feature per frame of an inputted audio signal; a frame likelihood calculation unit which calculates a frame likelihood of the sound feature in each frame, for each of a plurality of sound models; a confidence measure judgment unit which judges a confidence measure based on the frame likelihood; a cumulative likelihood output unit time determination unit which determines a cumulative likelihood output unit time based on the confidence measure; a cumulative likelihood calculation unit which calculates a cumulative likelihood in which the frame likelihoods of the frames included in the cumulative likelihood output unit time are cumulated, for each sound model; a sound type candidate judgment unit which determines, for each cumulative likelihood output unit time, a sound type corresponding to the sound model that has a maximum cumulative likelihood; a sound type frequency calculation unit which calculates the frequency of the sound type candidate; and a sound type interval determination unit which determines the sound type of the inputted audio signal and the interval of the sound type, based on the frequency of the sound type.

    摘要翻译: 一种声音识别装置,其减少识别率下降的可能性,包括:帧声音特征提取单元,其提取每帧输入的音频信号的声音特征; 帧似然计算单元,对于多个声音模型中的每一个,计算每个帧中的声音特征的帧似然度; 置信度判断单元,其基于所述帧可能性判断置信度量; 累积似然度输出单元时间确定单元,其基于所述置信度测量来确定累积似然度输出单位时间; 对于每个声音模型,计算累积似然输出单元时间中包括的帧的帧似然性的累积似然度的累积似然度计算单元; 声音候选判定单元,对于每个累积似然度输出单位时间,确定与具有最大累积似然性的声音模型对应的声音类型; 声音型频率计算单元,其计算声音类型候选的频率; 以及声音类型间隔确定单元,其基于声音类型的频率来确定输入的音频信号的声音类型和声音类型的间隔。

    Mixed audio separation apparatus
    8.
    发明申请
    Mixed audio separation apparatus 有权
    混合音频分离装置

    公开(公告)号:US20090067647A1

    公开(公告)日:2009-03-12

    申请号:US11665265

    申请日:2006-04-11

    IPC分类号: H04B1/00 G10L19/00

    CPC分类号: G10L21/0272 G10L19/0204

    摘要: A mixed audio separation system (100) which separates a specific audio from among a mixed audio (S100) includes a local frequency information generation unit (105) which obtains pieces of local frequency information (S103) corresponding to local reference waveforms (S102), based on the local reference waveforms (S102) and an analysis waveform which is the waveform of the mixed audio (S100). Each of the local reference waveforms (S102) (i) constitutes a part of a reference waveform for analyzing a predetermined frequency, (ii) has a predetermined temporal/spatial resolution and (iii) includes at least one of an amplification spectrum and a phase spectrum in the predetermined frequency. The system includes: a specific audio's frequency feature value extraction unit (106) which performs pattern matching between a first set which is the pieces of local frequency information and a second set of pieces of frequency information (S103) of a predetermined specific audio, and extracts the first set of the pieces of local frequency information (S103), based on a result of the pattern matching; and an audio signal generation unit which generates a signal of the specific audio, based on the first set of the pieces of local frequency information (S103) extracted by the specific audio's frequency feature value extraction unit.

    摘要翻译: 从混合音频(S100)中分离特定音频的混合音频分离系统(100)包括本地频率信息生成单元(105),其获取与局部参考波形相对应的本地频率信息(S103)(S102), 基于本地参考波形(S102)和作为混合音频的波形的分析波形(S100)。 每个局部参考波形(S102)(i)构成用于分析预定频率的参考波形的一部分,(ii)具有预定的时间/空间分辨率,以及(iii)包括放大光谱和相位 频谱在预定频率。 该系统包括:特定音频频率特征值提取单元(106),其执行作为本地频率信息的第一组与预定特定音频的第二组频率信息(S103)之间的模式匹配,以及 基于模式匹配的结果提取第一组本地频率信息(S103); 以及音频信号生成单元,其基于由特定音频的频率特征值提取单元提取的第一组本地频率信息(S103),生成特定音频的信号。

    AUDIO SOURCE DIRECTION DETECTING DEVICE
    9.
    发明申请
    AUDIO SOURCE DIRECTION DETECTING DEVICE 有权
    音频源方向检测装置

    公开(公告)号:US20100303254A1

    公开(公告)日:2010-12-02

    申请号:US12446499

    申请日:2008-09-10

    IPC分类号: H04R3/00

    CPC分类号: G01S3/8083

    摘要: A sound source direction detector comprises FFT analysis sections (103(1) to 103(3)) for generating a frequency spectrum in at least one frequency band of acoustic signals for each of the acoustic signals collected by two or more microphones arranged apart from one another, detection sound identifying sections (104(1) to 104(3)) for identifying a time portion of the frequency spectrum of a detection sound which obtains a sound source direction from the frequency spectrum in the frequency band, and a direction detecting section (105) for obtaining the difference between the times at which the detection sound reaches the microphones, obtaining the sound source direction from the time difference, the distance between the microphones, and the sound velocity, and outputting it depending on the degree of coincidence between the microphones of the frequency spectrum in the time portion identified by the detection sound identifying sections (104(1) to 104(3)) in a time interval which is the time unit to detect the sound source direction.

    摘要翻译: 声源方向检测器包括FFT分析部分(103(1)至103(3)),用于在由两个或多个麦克风分离的每个声信号收集的声信号的至少一个频带中产生频谱 另一个用于识别从频带中的频谱获得声源方向的检测声音的频谱的时间部分的检测声音识别部分(104(1)至104(3)),以及方向检测部分 (105),用于获得检测声到达麦克风的时间之间的差异,从时差获得声源方向,麦克风之间的距离和声速,并根据其中的一致程度输出 由检测声音识别部分(104(1)至104(3))识别的时间部分中的频谱的麦克风在时间间隔内为ti 我单位来检测声源方向。

    Speech recognition apparatus and speech recognition method
    10.
    发明申请
    Speech recognition apparatus and speech recognition method 有权
    语音识别装置和语音识别方法

    公开(公告)号:US20060100876A1

    公开(公告)日:2006-05-11

    申请号:US11296268

    申请日:2005-12-08

    IPC分类号: G10L15/18

    CPC分类号: G10L15/32 G10L15/183

    摘要: To provide a speech recognition apparatus which appropriately performs speech recognition by generating, in real time, language models adapted to a new topic even in the case where topics are changed. The speech recognition apparatus includes: a word specification unit for obtaining and specifying a word; a language model information storage unit for storing language models for recognizing speech and the respectively corresponding pieces of tag information; a combination coefficient calculation unit for calculating the weights of the respective language models, as combination coefficients, according to the word obtained by the word specification unit, based on the relevance degree between the word obtained by the word specification unit and the tag information of each language model; a language probability calculation unit for calculating the probabilities of word appearance by combining the respective language models according to the calculated combination coefficients; and a speech recognition unit for recognizing speech using the calculated probabilities of word appearance.

    摘要翻译: 为了提供一种语音识别装置,通过即使在主题改变的情况下实时地生成适应于新主题的语言模型来适当地执行语音识别。 语音识别装置包括:字指定单元,用于获取并指定单词; 语言模型信息存储单元,用于存储用于识别语音的语言模型和分别对应的标签信息; 组合系数计算单元,用于根据由单词指定单元获得的单词,根据由单词指定单元获得的单词与每个单词指定单元的标签信息之间的相关度计算各语言模型的权重作为组合系数 语言模型; 语言概率计算单元,用于通过根据所计算的组合系数组合各个语言模型来计算出词概率; 以及语音识别单元,用于使用计算出的单词外观的概率来识别语音。