Apparatus and method for segmentation of audio data into meta patterns
    1.
    发明授权
    Apparatus and method for segmentation of audio data into meta patterns 失效
    将音频数据分割为元模式的装置和方法

    公开(公告)号:US07680654B2

    公开(公告)日:2010-03-16

    申请号:US10985615

    申请日:2004-11-10

    IPC分类号: G10L15/00 G10L15/20

    CPC分类号: G10L25/00

    摘要: An audio data segmentation apparatus for segmenting of audio data including for supplying audio data, dividing the audio data supplied into audio clips of a predetermined length, discriminating the audio clips into predetermined audio classes, the audio classes identifying a kind of audio data included in the respective audio clip and segmenting for segmenting the audio data into audio meta patterns based on a sequence of audio classes of consecutive audio clips, each meta pattern being allocated to a predetermined type of contents of the audio data. It is difficult to achieve good results with known methods for segmentation of audio data into meta patterns since the rules for the allocation of the meta patterns are dissatisfying. This problem is solved by the inventive audio data segmentation apparatus further including a program database including program data units to identify a certain kind of program, a plurality of respective audio meta patterns being allocated to each program data unit, wherein the segmenting segments the audio data into corresponding audio meta patterns on the basis of the program data units of the program database 5.

    摘要翻译: 一种用于分割音频数据的音频数据分割装置,包括用于提供音频数据,将提供的音频数据分成预定长度的音频剪辑,将音频剪辑识别为预定音频类别,识别包括在音频数据中的音频数据的种类的音频类别 相应的音频剪辑和分段,用于基于连续音频剪辑的音频类别序列将音频数据分割成音频元模式,每个元模式被分配给音频数据的预定类型的内容。 由于元模式的分配规则不满意,因此将音频数据分割为元模式的已知方法难以获得良好的结果。 本发明的音频数据分割装置还包括程序数据库,该程序数据库包括用于识别特定类型的程序的程序数据单元,分配给每个程序数据单元的多个各自的音频元模式,其中分段将音频数据分段 基于程序数据库5的程序数据单元转换成对应的音频元模式。

    Apparatus and method for automatic extraction of important events in audio signals
    2.
    发明申请
    Apparatus and method for automatic extraction of important events in audio signals 失效
    自动提取音频信号中重要事件的装置和方法

    公开(公告)号:US20050102135A1

    公开(公告)日:2005-05-12

    申请号:US10985446

    申请日:2004-11-10

    CPC分类号: G10L25/00 G10L15/00 G10L17/26

    摘要: The present invention discloses an apparatus for automatic extraction of important events in audio signals comprising: signal input means for supplying audio signals; audio signal fragmenting means for partitioning audio signals supplied by the signal input means into audio fragments of a predetermined length and for allocating a sequence of one or more audio fragments to a respective audio window; feature extracting means for analysing acoustic characteristics of the audio signals comprised in the audio fragments and for analysing acoustic characteristics of the audio signals comprised in the audio windows; and important event extraction means for extracting important events in audio signals supplied by the audio signal fragmenting means based on predetermined important event classifying rules depending on acoustic characteristics of the audio signals comprised in the audio fragments and on acoustic characteristics of the audio signals comprised in the audio windows, wherein each important event extracted by the important event extraction means comprises a discrete sequence of cohesive audio fragments corresponding to an important event included in the audio signals.

    摘要翻译: 本发明公开了一种用于自动提取音频信号中的重要事件的装置,包括:用于提供音频信号的信号输入装置; 用于将由信号输入装置提供的音频信号划分成预定长度的音频片段并用于将一个或多个音频片段的序列分配到相应音频窗口的音频信号分段装置; 特征提取装置,用于分析包含在音频片段中的音频信号的声学特性并分析包含在音频窗口中的音频信号的声学特性; 以及重要事件提取装置,用于根据包含在音频片段中的音频信号的声学特性以及包含在音频片段中的音频信号的声学特性,基于预定的重要事件分类规则,提取由音频信号分段装置提供的音频信号中的重要事件。 音频窗口,其中由重要事件提取装置提取的每个重要事件包括对应于包括在音频信号中的重要事件的粘性音频片段的离散序列。

    Apparatus and method for automatic dissection of segmented audio signals
    3.
    发明授权
    Apparatus and method for automatic dissection of segmented audio signals 失效
    分段音频信号自动解剖的装置和方法

    公开(公告)号:US07962330B2

    公开(公告)日:2011-06-14

    申请号:US10985451

    申请日:2004-11-10

    摘要: An apparatus for automatic dissection of segmented audio signals, wherein at least one information signal for identifying programs included in said audio signals and for identifying contents included in said programs. Content detection device detects programs and contents belonging to the respective programs in the information signal. Program weighting device weights each program includes in the information signal based on the contents of the respective program detected by the content detection device. Program ranking device indentifies programmers of the same category and ranking said programs based on a weighting result for each program provided by the program weighting device.

    摘要翻译: 一种用于自动解剖分段音频信号的装置,其中至少一个信息信号用于识别包括在所述音频信号中的节目,并用于识别包括在所述节目中的内容。 内容检测装置检测信息信号中属于各个节目的节目和内容。 每个程序的程序加权设备权重包括在基于由内容检测设备检测到的相应程序的内容的信息信号中。 程序排名装置识别相同类别的程序员,并且基于由程序加权装置提供的每个程序的加权结果对所述程序进行排序。

    Apparatus and method for classifying an audio signal
    5.
    发明申请
    Apparatus and method for classifying an audio signal 审中-公开
    对音频信号进行分类的装置和方法

    公开(公告)号:US20050131688A1

    公开(公告)日:2005-06-16

    申请号:US10985295

    申请日:2004-11-10

    摘要: An apparatus for classifying audio signals comprises audio signal clipping means for partitioning audio signals into audio clips, and class discrimination means for discriminating the audio clips provided by the audio signal clipping means into predetermined audio classes based on predetermined audio class classifying rules, by analysing acoustic characteristics of the audio signals comprised in the audio clips, wherein a predetermined audio class classifying rule is provided for each audio class, and each audio class represents a respective kind of audio signals comprised in the corresponding audio clip. The determination process to find acceptable audio class classifying rules for each audio class according to the prior art is depending on both the used raw audio signals and the personal experience of the person conducting the determination process. Thus, the determination process usually is very difficult, time consuming and subjective. Furthermore, there is a high risk that not all possible peculiarities of the different programmes and the different categories the audio signal can belong to is sufficiently accounted for. This problem is solved in the inventive apparatus for classifying audio signals by class discrimination means calculating an audio class confidence value for each audio class assigned to an audio clip, wherein the audio class confidence value indicates the likelihood the respective audio class characterises the respective kind of audio signals comprised in the respective audio clip correctly. Furthermore, the class discrimination means use acoustic characteristics of audio clips of audio classes having a high audio class confidence value to train the respective audio class classifying rule.

    摘要翻译: 用于对音频信号进行分类的装置包括用于将音频信号分割成音频剪辑​​的音频信号限幅装置,以及用于基于预定音频类别分类规则,将由音频信号限幅装置提供的音频片段识别为预定音频类别的类别鉴别装置,通过分析声音 包括在音频剪辑中的音频信号的特征,其中为每个音频类提供预定音频类别分类规则,并且每个音频类表示包括在相应的音频剪辑中的各种音频信号。 根据现有技术,为每个音频类找到可接受的音频类别分类规则的确定过程取决于所使用的原始音频信号和进行确定处理的人的个人经历。 因此,确定过程通常是非常困难,耗时和主观的。 此外,充分考虑到不同于音频信号可能属于不同节目和不同类别的所有可能的特征的高风险。 在本发明的用于对音频信号进行分类的设备中解决了这个问题,该类别鉴别装置为分配给音频剪辑的每个音频类别计算音频类别置信度值,其中,音频类别置信度值表示各个音频类别 包含在相应音频剪辑中的音频信号正确。 此外,等级识别装置使用具有高音频类置信度值的音频类别的音频剪辑的声学特性来训练各个音频类别分类规则。

    Methods to create a user profile and to specify a suggestion for a next selection of a user
    6.
    发明授权
    Methods to create a user profile and to specify a suggestion for a next selection of a user 失效
    创建用户简档并指定下一个用户选择的建议的方法

    公开(公告)号:US07970762B2

    公开(公告)日:2011-06-28

    申请号:US12507574

    申请日:2009-07-22

    IPC分类号: G06F17/30

    摘要: A user profile and/or the suggestions computed based thereon are obtained taking a special set of user features into account. The user features are defined to represent a typical general behaviour of an individual user in respect to the application where the user profile is used. In other words, for each application where a user profile is used a special set of user features are defined which are able to represent a typical general behaviour of an individual user. Based on these user features the weights in the list of word-weight pairs or weighted keywords which represents the user profile are computed or influenced during the creation of the user profile, and/or a mufti-user profile is split during the creation of an individual user profile from a mufti-user profile, and/or during specification of a suggestion a user history which is used to create the user profile, and/or the user profile, and/or the suggestion results are filtered.

    摘要翻译: 考虑到用户特征集合的用户简档和/或基于其计算的建议。 用户特征被定义为表示个人用户相对于使用用户简档的应用的典型一般行为。 换句话说,对于使用用户简档的每个应用程序,定义了能够表示单个用户的典型一般行为的一组特定用户特征。 基于这些用户特征,在创建用户简档期间计算或影响表示用户简档的单词权重对或加权关键词列表中的权重,和/或在创建用户简档期间分割多个用户简档。 和/或在建议的指定期间,用于创建用户简档的用户历史记录和/或用户简档和/或建议结果的个人用户简档被过滤。

    Method for recognizing speech using eigenpronunciations

    公开(公告)号:US07113908B2

    公开(公告)日:2006-09-26

    申请号:US10090861

    申请日:2002-03-05

    IPC分类号: G10L15/06

    CPC分类号: G10L15/07

    摘要: To increase the recognition rate and quality in a process of recognizing speech an approximative set of pronunciation rules (APR) for a current pronunciation (CP) of a current speaker is determined in a given pronunciation space (PS) and then applied to a current pronunciation lexicon (CL) so as to perform a speaker specific adaptation of said current lexicon (CL).

    Apparatus and method for automatic dissection of segmented audio signals
    8.
    发明申请
    Apparatus and method for automatic dissection of segmented audio signals 失效
    分段音频信号自动解剖的装置和方法

    公开(公告)号:US20050160449A1

    公开(公告)日:2005-07-21

    申请号:US10985451

    申请日:2004-11-10

    IPC分类号: G10L15/00 G10L25/00 H04N7/16

    摘要: Apparatus and method for automatic dissection of segmented audio signals According to the present invention, an apparatus for automatic dissection of segmented audio signals, wherein at least one information signal for identifying programmes included in said audio signals and for identifying contents included in said programmes is provided, comprises: content detection means for detecting programmes and contents belonging to the respective programmes in the information signal; programme weighting means for weighting each programme comprised in the information signal based on the contents of the respective programme detected by the content detection means; and programme ranking means for identifying programmes of the same category and ranking said programmes based on a weighting result for each programme provided by the programme weighting means.

    摘要翻译: 用于自动解剖分段音频信号的装置和方法根据本发明,一种用于自动解剖分段音频信号的装置,其中提供用于识别包括在所述音频信号中的节目和用于识别包括在所述节目中的内容的至少一个信息信号 包括:内容检测装置,用于在信息信号中检测属于各个节目的节目和内容; 程序加权装置,用于基于由内容检测装置检测的相应程序的内容对包含在信息信号中的每个节目进行加权; 以及用于识别相同类别的节目并且基于由节目加权装置提供的每个节目的加权结果对所述节目进行排名的节目排名装置。

    Apparatus and method for segmentation of audio data into meta patterns
    9.
    发明申请
    Apparatus and method for segmentation of audio data into meta patterns 失效
    将音频数据分割为元模式的装置和方法

    公开(公告)号:US20050114388A1

    公开(公告)日:2005-05-26

    申请号:US10985615

    申请日:2004-11-10

    IPC分类号: G10L25/00 G06F17/00

    CPC分类号: G10L25/00

    摘要: An audio data segmentation apparatus for segmenting of audio data comprises audio data input means for supplying audio data, audio data clipping means for dividing the audio data supplied by the audio data input means into audio clips of a predetermined length, class discrimination means for discriminating the audio clips supplied by the audio data clipping means into predetermined audio classes, the audio classes identifying a kind of audio data included in the respective audio clip and segmenting means for segmenting the audio data into audio meta patterns based on a sequence of audio classes of consecutive audio clips, each meta pattern being allocated to a predetermined type of contents of the audio data. It is difficult to achieve good results with known methods for segmentation of audio data into meta patterns since the rules for the allocation of the meta patterns are dissatisfying. This problem is solved by the inventive audio data segmentation apparatus further comprising a programme database comprising programme data units to identify a certain kind of programme, a plurality of respective audio meta patterns being allocated to each programme data unit, wherein the segmenting means segments the audio data into corresponding audio meta patterns on the basis of the programme data units of the programme database 5.

    摘要翻译: 用于分割音频数据的音频数据分割装置包括用于提供音频数据的音频数据输入装置,用于将由音频数据输入装置提供的音频数据分割成预定长度的音频剪辑的音频数据剪辑装置,用于鉴别 由音频数据剪辑装置提供的音频剪辑成预定的音频类别,音频类别标识包括在各个音频剪辑中的一种音频数据,以及分割装置,用于基于连续的音频类别序列将音频数据分割成音频元模式 音频剪辑,每个元模式被分配给音频数据的预定类型的内容。 由于元模式的分配规则不满意,因此将音频数据分割为元模式的已知方法难以获得良好的结果。 本发明的音频数据分割装置还包括程序数据库,该程序数据库包括用于识别某种程序的程序数据单元,分配给每个程序数据单元的多个各自的音频元模式,其中分段装置分割音频 基于程序数据库5的程序数据单元将数据转换成相应的音频元模式。

    Speech recognition control of remotely controllable devices in a home network environment
    10.
    发明授权
    Speech recognition control of remotely controllable devices in a home network environment 有权
    家庭网络环境中遥控设备的语音识别控制

    公开(公告)号:US06535854B2

    公开(公告)日:2003-03-18

    申请号:US09175382

    申请日:1998-10-19

    IPC分类号: G10L1522

    摘要: Home networks low-cost digital interfaces are introduced that integrate entertainment, communication and computing electronics into consumer multimedia. Normally, these are low-cost, easy to use systems, since they allow the user to remove or add any kind of network devices with the bus being active. To improve the user interface a speech unit (2) is proposed that enables all devices (11) connected to the bus system (31) to be controlled by a single speech recognition device. The properties of this device, e.g. the vocabulary can be dynamically and actively extended by the consumer devices (11) connected to the bus system (31). The proposed technology is independent from a specific bus standard, e.g. the IEEE 1394 standard, and is well-suited for all kinds of wired wireless home networks. The speech unit (2) receives data and messages from the device. The speech unit (2) recognizes speaker-dependent commands. A Speech synthesizer synthesizes messages. A remotely controllable device (11) has access to a medium which may be a CD-ROM. The device may ask for a logical name or identifier.

    摘要翻译: 家庭网络引入了低成本的数字接口,将娱乐,通信和计算电子整合到消费者多媒体中。 通常,这些是低成本,易于使用的系统,因为它们允许用户去除或添加任何类型的网络设备,其中总线是活动的。 为了改善用户接口,提出了一种语音单元(2),其使得能够通过单个语音识别设备来控制连接到总线系统(31)的所有设备(11)。 该装置的特性,例如 可以通过连接到总线系统(31)的消费者设备(11)来动态地和主动地扩展词汇。 所提出的技术独立于特定总线标准,例如。 IEEE 1394标准,非常适用于各种有线无线家庭网络。 语音单元(2)从设备接收数据和消息。 语音单元(2)识别与扬声器相关的命令。 语音合成器综合消息。 远程可控设备(11)可以访问可以是CD-ROM的介质。 该设备可能会要求一个逻辑名称或标识符。