METHOD FOR CLASSIFYING AUDIO DATA
    1.
    发明申请
    METHOD FOR CLASSIFYING AUDIO DATA 有权
    分类音频数据的方法

    公开(公告)号:US20090069914A1

    公开(公告)日:2009-03-12

    申请号:US11908944

    申请日:2006-03-15

    IPC分类号: G06F17/00

    摘要: A method for classifying audio data. For a given piece of audio data a location or position for the given audio data within a mood space is generated and compared to a comparison mood space location. As a result of the comparison, comparison data are generated and provided as a classification result with respect to the given audio data.

    摘要翻译: 一种分类音频数据的方法。 对于给定的音频数据,产生在心情空间内的给定音频数据的位置或位置,并将其与比较情绪空间位置进行比较。 作为比较的结果,生成比较数据并将其作为关于给定音频数据的分类结果提供。

    Signal variation feature based confidence measure
    2.
    发明授权
    Signal variation feature based confidence measure 失效
    基于信号变化特征的置信度量度

    公开(公告)号:US07292981B2

    公开(公告)日:2007-11-06

    申请号:US10957816

    申请日:2004-10-04

    IPC分类号: G10L25/14

    CPC分类号: G10L15/08 G10L15/02 G10L15/20

    摘要: A method for predicting a misrecognition in a speech recognition system, is based on; the insight that variations in a speech input signal are different depending on the origin of the signal being a speech or a non-speech event. The method comprises steps for receiving a speech input signal, extracting at least one signal variation feature of the speech input signal, and applying a signal variation meter to the speech input signal for deriving a signal variation measure.

    摘要翻译: 一种用于预测语音识别系统中的误识别的方法是基于的; 根据作为语音或非语音事件的信号的原点,语音输入信号的变化是不同的。 该方法包括步骤:接收语音输入信号,提取语音输入信号的至少一个信号变化特征,以及将信号变化仪应用于语音输入信号以导出信号变化度量。

    Signal variation feature based confidence measure
    3.
    发明申请
    Signal variation feature based confidence measure 失效
    基于信号变化特征的置信度量度

    公开(公告)号:US20050114135A1

    公开(公告)日:2005-05-26

    申请号:US10957816

    申请日:2004-10-04

    CPC分类号: G10L15/08 G10L15/02 G10L15/20

    摘要: Based on the insight that variations in a speech input signal are different depending on the origin of the signal being a speech or a non-speech event, the present invention proposes method for predicting a misrecognition in a speech recognition system with steps for receiving a speech input signal, extracting at least one signal variation feature of the speech input signal, and applying a signal variation meter to the speech input signal for deriving a signal variation measure.

    摘要翻译: 基于语音输入信号的变化根据作为语音或非语音事件的信号的原点而不同的观点,本发明提出了一种用于在具有用于接收语音的步骤的语音识别系统中预测误识别的方法 输入信号,提取所述语音输入信号的至少一个信号变化特征,以及将信号变化计应用于所述语音输入信号以导出信号变化度量。

    Apparatus and method for automatic dissection of segmented audio signals
    4.
    发明授权
    Apparatus and method for automatic dissection of segmented audio signals 失效
    分段音频信号自动解剖的装置和方法

    公开(公告)号:US07962330B2

    公开(公告)日:2011-06-14

    申请号:US10985451

    申请日:2004-11-10

    摘要: An apparatus for automatic dissection of segmented audio signals, wherein at least one information signal for identifying programs included in said audio signals and for identifying contents included in said programs. Content detection device detects programs and contents belonging to the respective programs in the information signal. Program weighting device weights each program includes in the information signal based on the contents of the respective program detected by the content detection device. Program ranking device indentifies programmers of the same category and ranking said programs based on a weighting result for each program provided by the program weighting device.

    摘要翻译: 一种用于自动解剖分段音频信号的装置,其中至少一个信息信号用于识别包括在所述音频信号中的节目,并用于识别包括在所述节目中的内容。 内容检测装置检测信息信号中属于各个节目的节目和内容。 每个程序的程序加权设备权重包括在基于由内容检测设备检测到的相应程序的内容的信息信号中。 程序排名装置识别相同类别的程序员,并且基于由程序加权装置提供的每个程序的加权结果对所述程序进行排序。

    Apparatus and method for automatic extraction of important events in audio signals
    5.
    发明授权
    Apparatus and method for automatic extraction of important events in audio signals 失效
    自动提取音频信号中重要事件的装置和方法

    公开(公告)号:US08635065B2

    公开(公告)日:2014-01-21

    申请号:US10985446

    申请日:2004-11-10

    CPC分类号: G10L25/00 G10L15/00 G10L17/26

    摘要: The present invention discloses an apparatus for automatic extraction of important events in audio signals comprising: signal input means for supplying audio signals; audio signal fragmenting means for partitioning audio signals supplied by the signal input means into audio fragments of a predetermined length and for allocating a sequence of one or more audio fragments to a respective audio window; feature extracting means for analyzing acoustic characteristics of the audio signals comprised in the audio fragments and for analyzing acoustic characteristics of the audio signals comprised in the audio windows; and important event extraction means for extracting important events in audio signals supplied by the audio signal fragmenting means based on predetermined important event classifying rules depending on acoustic characteristics of the audio signals comprised in the audio fragments and on acoustic characteristics of the audio signals comprised in the audio windows, wherein each important event extracted by the important event extraction means comprises a discrete sequence of cohesive audio fragments corresponding to an important event included in the audio signals.

    摘要翻译: 本发明公开了一种用于自动提取音频信号中的重要事件的装置,包括:用于提供音频信号的信号输入装置; 用于将由信号输入装置提供的音频信号划分成预定长度的音频片段并用于将一个或多个音频片段的序列分配到相应音频窗口的音频信号分段装置; 特征提取装置,用于分析包含在音频片段中的音频信号的声学特性并分析包含在音频窗口中的音频信号的声学特性; 以及重要事件提取装置,用于根据包含在音频片段中的音频信号的声学特性以及包含在音频片段中的音频信号的声学特性,基于预定的重要事件分类规则,提取由音频信号分段装置提供的音频信号中的重要事件。 音频窗口,其中由重要事件提取装置提取的每个重要事件包括对应于包括在音频信号中的重要事件的粘性音频片段的离散序列。

    Identification of the presence of speech in digital audio data
    6.
    发明授权
    Identification of the presence of speech in digital audio data 失效
    识别数字音频数据中语音的存在

    公开(公告)号:US08036884B2

    公开(公告)日:2011-10-11

    申请号:US11065555

    申请日:2005-02-24

    IPC分类号: G10L19/14

    CPC分类号: G10L25/78 G10H2210/046

    摘要: The present invention provides a method, a computer-software-product and an apparatus for enabling a determination of speech related audio data within a record of digital audio data. The method comprises steps for extracting audio features from the record of digital audio data, for classifying one or more subsections of the record of digital audio data, and for marking at least a part of the record of digital audio data classified as speech. The classification of the digital audio data record is performed on the basis of the extracted audio features and with respect to at least one predetermined audio class. The extraction of the at least one audio feature as used by a method according to the invention comprises steps for partitioning the record of digital audio data into adjoining frames, defining a window for each frame which is formed by a sequence of adjoining frames containing the frame under consideration, determining for the frame under consideration and at least one further frame of the window a spectral-emphasis-value which is related to the frequency distribution contained in the digital audio data of the respective frame, and assigning a presence-of-speech indicator value to the frame under consideration based on an evaluation of the differences between the spectral-emphasis-values determined for the frame under consideration and at least one further frame of the window.

    摘要翻译: 本发明提供一种能够确定数字音频数据的记录内的语音相关音频数据的方法,计算机软件产品和装置。 该方法包括从数字音频数据的记录中提取音频特征的步骤,用于分类数字音频数据的记录的一个或多个子部分,以及标记被分类为语音的数字音频数据的记录的至少一部分。 数字音频数据记录的分类是根据所提取的音频特征和关于至少一个预定音频类进行的。 如根据本发明的方法使用的至少一个音频特征的提取包括将数字音频数据的记录划分成相邻帧的步骤,为由包含帧的相邻帧的序列形成的每个帧定义窗口 在考虑下,确定正在考虑的帧和窗口的至少一个另外的帧,与包含在各个帧的数字音频数据中的频率分布相关的频谱强调值,并且分配语音演讲 基于对所考虑的帧确定的频谱强调值与窗口的至少一个另外的帧之间的差异的评估,对正在考虑的帧的指示符值进行评估。

    Apparatus and method for segmentation of audio data into meta patterns
    7.
    发明授权
    Apparatus and method for segmentation of audio data into meta patterns 失效
    将音频数据分割为元模式的装置和方法

    公开(公告)号:US07680654B2

    公开(公告)日:2010-03-16

    申请号:US10985615

    申请日:2004-11-10

    IPC分类号: G10L15/00 G10L15/20

    CPC分类号: G10L25/00

    摘要: An audio data segmentation apparatus for segmenting of audio data including for supplying audio data, dividing the audio data supplied into audio clips of a predetermined length, discriminating the audio clips into predetermined audio classes, the audio classes identifying a kind of audio data included in the respective audio clip and segmenting for segmenting the audio data into audio meta patterns based on a sequence of audio classes of consecutive audio clips, each meta pattern being allocated to a predetermined type of contents of the audio data. It is difficult to achieve good results with known methods for segmentation of audio data into meta patterns since the rules for the allocation of the meta patterns are dissatisfying. This problem is solved by the inventive audio data segmentation apparatus further including a program database including program data units to identify a certain kind of program, a plurality of respective audio meta patterns being allocated to each program data unit, wherein the segmenting segments the audio data into corresponding audio meta patterns on the basis of the program data units of the program database 5.

    摘要翻译: 一种用于分割音频数据的音频数据分割装置,包括用于提供音频数据,将提供的音频数据分成预定长度的音频剪辑,将音频剪辑识别为预定音频类别,识别包括在音频数据中的音频数据的种类的音频类别 相应的音频剪辑和分段,用于基于连续音频剪辑的音频类别序列将音频数据分割成音频元模式,每个元模式被分配给音频数据的预定类型的内容。 由于元模式的分配规则不满意,因此将音频数据分割为元模式的已知方法难以获得良好的结果。 本发明的音频数据分割装置还包括程序数据库,该程序数据库包括用于识别特定类型的程序的程序数据单元,分配给每个程序数据单元的多个各自的音频元模式,其中分段将音频数据分段 基于程序数据库5的程序数据单元转换成对应的音频元模式。

    Method for classifying audio data
    8.
    发明授权
    Method for classifying audio data 有权
    音频数据分类方法

    公开(公告)号:US08170702B2

    公开(公告)日:2012-05-01

    申请号:US11908944

    申请日:2006-03-15

    IPC分类号: G06F17/00

    摘要: A method for classifying audio data. For a given piece of audio data a location or position for the given audio data within a mood space is generated and compared to a comparison mood space location. As a result of the comparison, comparison data are generated and provided as a classification result with respect to the given audio data.

    摘要翻译: 一种分类音频数据的方法。 对于给定的音频数据,产生在心情空间内的给定音频数据的位置或位置,并将其与比较情绪空间位置进行比较。 作为比较的结果,生成比较数据并将其作为关于给定音频数据的分类结果提供。

    Method for recognizing speech
    9.
    发明授权
    Method for recognizing speech 失效
    识别语音的方法

    公开(公告)号:US07752044B2

    公开(公告)日:2010-07-06

    申请号:US10683495

    申请日:2003-10-10

    IPC分类号: G10L15/00 G10L15/04

    CPC分类号: G10L15/08 G10L2015/025

    摘要: To increase the robustness and/or the recognition rate of methods for recognizing speech it is proposed to include phone boundary verification measure features in the process of obtaining and/or generating confidence measures obtained recognition results.

    摘要翻译: 为了增加用于识别语音的方法的鲁棒性和/或识别率,提出在获得和/或产生置信度量获得的识别结果的过程中包括电话边界验证测度特征。