Speaker and call characteristic sensitive open voice search
    1.
    发明授权
    Speaker and call characteristic sensitive open voice search 有权
    扬声器和呼叫特性敏感开放语音搜索

    公开(公告)号:US08630860B1

    公开(公告)日:2014-01-14

    申请号:US13039467

    申请日:2011-03-03

    摘要: Techniques disclosed herein include systems and methods for open-domain voice-enabled searching that is speaker sensitive. Techniques include using speech information, speaker information, and information associated with a spoken query to enhance open voice search results. This includes integrating a textual index with a voice index to support the entire search cycle. Given a voice query, the system can execute two matching processes simultaneously. This can include a text matching process based on the output of speech recognition, as well as a voice matching process based on characteristics of a caller or user voicing a query. Characteristics of the caller can include output of voice feature extraction and metadata about the call. The system clusters callers according to these characteristics. The system can use specific voice and text clusters to modify speech recognition results, as well as modifying search results.

    摘要翻译: 本文公开的技术包括用于开放式语音启用搜索的扬声器敏感的系统和方法。 技术包括使用语音信息,说话者信息和与口语查询相关联的信息来增强开放语音搜索结果。 这包括将文本索引与语音索引集成以支持整个搜索周期。 给定语音查询,系统可以同时执行两个匹配过程。 这可以包括基于语音识别输出的文本匹配过程,以及基于呼叫者或用户发出查询的特征的语音匹配过程。 呼叫者的特征可以包括语音特征提取的输出和关于呼叫的元数据。 根据这些特点,系统集群呼叫者。 系统可以使用特定的语音和文本集群来修改语音识别结果,以及修改搜索结果。

    SPEAKER AND CALL CHARACTERISTIC SENSITIVE OPEN VOICE SEARCH
    2.
    发明申请
    SPEAKER AND CALL CHARACTERISTIC SENSITIVE OPEN VOICE SEARCH 有权
    扬声器和呼叫特征敏感开放语音搜索

    公开(公告)号:US20140129220A1

    公开(公告)日:2014-05-08

    申请号:US14152136

    申请日:2014-01-10

    IPC分类号: G10L15/26

    摘要: Techniques disclosed herein include systems and methods for open-domain voice-enabled searching that is speaker sensitive. Techniques include using speech information, speaker information, and information associated with a spoken query to enhance open voice search results. This includes integrating a textual index with a voice index to support the entire search cycle. Given a voice query, the system can execute two matching processes simultaneously. This can include a text matching process based on the output of speech recognition, as well as a voice matching process based on characteristics of a caller or user voicing a query. Characteristics of the caller can include output of voice feature extraction and metadata about the call. The system clusters callers according to these characteristics. The system can use specific voice and text clusters to modify speech recognition results, as well as modifying search results.

    摘要翻译: 本文公开的技术包括用于开放式语音启用搜索的扬声器敏感的系统和方法。 技术包括使用语音信息,说话者信息和与口语查询相关联的信息来增强开放语音搜索结果。 这包括将文本索引与语音索引集成以支持整个搜索周期。 给定语音查询,系统可以同时执行两个匹配过程。 这可以包括基于语音识别输出的文本匹配过程,以及基于呼叫者或用户发出查询的特征的语音匹配过程。 呼叫者的特征可以包括语音特征提取的输出和关于呼叫的元数据。 根据这些特点,系统集群呼叫者。 系统可以使用特定的语音和文本集群来修改语音识别结果,以及修改搜索结果。

    Method and apparatus for processing spoken search queries
    4.
    发明授权
    Method and apparatus for processing spoken search queries 有权
    用于处理口语搜索查询的方法和装置

    公开(公告)号:US08666963B2

    公开(公告)日:2014-03-04

    申请号:US13527500

    申请日:2012-06-19

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30976

    摘要: Some embodiments relate to a method of performing a search for content on the Internet, in which a user may speak a search query and speech recognition may be performed on the spoken query to generate a text search query to be provided to a plurality of search engines. This enables a user to speak the search query rather than having to type it, and also allows the user to provide the search query only once, rather than having to provide it separately to multiple different search engines.

    摘要翻译: 一些实施例涉及对互联网上的内容执行搜索的方法,其中用户可以在其中说出搜索查询,并且可以在口头查询上执行语音识别以生成要提供给多个搜索引擎的文本搜索查询 。 这使得用户能够说出搜索查询而不必输入搜索查询,并且还允许用户仅提供一次搜索查询,而不必单独提供给多个不同的搜索引擎。

    SPEAKER VERIFICATION METHODS AND APPARATUS
    5.
    发明申请
    SPEAKER VERIFICATION METHODS AND APPARATUS 有权
    扬声器验证方法和设备

    公开(公告)号:US20120239398A1

    公开(公告)日:2012-09-20

    申请号:US13442170

    申请日:2012-04-09

    IPC分类号: G10L17/00

    CPC分类号: G10L17/24 G10L17/04 G10L17/20

    摘要: In one aspect, a method for determining a validity of an identity asserted by a speaker using a voice print is provided. The method comprises acts of performing a first verification stage comprising comparing a first voice signal from the speaker uttering at least one first challenge utterance-with at least a portion of the voice print and performing a second verification stage if it is concluded in the first verification stage that the first voice signal was obtained from an utterance by the user. The second verification stage comprises adapting at least one parameter of the voice print based, at least in part, on the first voice signal to obtain an adapted voice print, and comparing a second voice signal from the speaker uttering at least one second challenge utterance with at least a portion of the adapted voice print.

    摘要翻译: 在一方面,提供了一种用于确定由使用语音打印的扬声器所确定的身份的有效性的方法。 该方法包括执行第一验证阶段的动作,包括将来自扬声器的第一语音信号与至少一个第一挑战话语 - 与语音打印的至少一部分进行比较,并且如果在第一验证中得出结论,则执行第二验证阶段 第一语音信号是由用户的话语获得的。 第二验证阶段包括至少部分地基于第一语音信号来调整语音印刷的至少一个参数以获得适应的语音印刷,并且将来自扬声器的第二语音信号与至少一个第二挑战话语与 至少一部分适应的语音打印。

    Method and apparatus for processing spoken search queries
    6.
    发明授权
    Method and apparatus for processing spoken search queries 有权
    用于处理口语搜索查询的方法和装置

    公开(公告)号:US08239366B2

    公开(公告)日:2012-08-07

    申请号:US12877549

    申请日:2010-09-08

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30976

    摘要: Some embodiments relate to a method of performing a search for content on the Internet, in which a user may speak a search query and speech recognition may be performed on the spoken query to generate a text search query to be provided to a plurality of search engines. This enables a user to speak the search query rather than having to type it, and also allows the user to provide the search query only once, rather than having to provide it separately to multiple different search engines.

    摘要翻译: 一些实施例涉及对互联网上的内容执行搜索的方法,其中用户可以在其中说出搜索查询,并且可以在口头查询上执行语音识别以生成要提供给多个搜索引擎的文本搜索查询 。 这使得用户能够说出搜索查询而不必输入搜索查询,并且还允许用户仅提供一次搜索查询,而不必单独提供给多个不同的搜索引擎。

    System and method for modeless large vocabulary speech recognition
    7.
    发明授权
    System and method for modeless large vocabulary speech recognition 有权
    无模式大词汇语音识别的系统和方法

    公开(公告)号:US06292779B1

    公开(公告)日:2001-09-18

    申请号:US09267925

    申请日:1999-03-09

    IPC分类号: G10L1514

    摘要: A modeless large vocabulary continuous speech recognition system is provided that represents an input utterance as a sequence of input vectors. The system includes a common library of acoustic model states for arrangement in sequences that form acoustic models. Each acoustic model is composed of a sequence of segment models and each segment model is composed of a sequence of model states. An input processor compares each vector in a sequence of input vectors to a set of model states in the common library to produce a match score for each model state in the set, reflecting the likelihood that a state is represented by a vector. The system also includes a plurality of recognition modules and associated recognition grammars. The recognition modules operate in parallel and use the match scores with the acoustic models to determine at least one recognition result in each of the recognition modules. The recognition modules includes a dictation module for producing at least one probable dictation recognition result, a select module for recognizing a portion of visually displayed text for processing with a command, and a command module for producing at least one probable command recognition result. An arbitrator uses an arbitration algorithm and a score ordered queue of recognition results, together with their associated recognition modules, to compare the recognition results of the recognition modules to select at least one system recognition result.

    摘要翻译: 提供了一种无噪声大词汇连续语音识别系统,其表示输入语音作为输入向量的序列。 该系统包括用于形成声学模型的序列中布置的声学模型状态的公共库。 每个声学模型由段模型序列组成,每个分段模型由一系列模型状态组成。 输入处理器将输入向量序列中的每个向量与公共库中的一组模型状态进行比较,以产生该集合中每个模型状态的匹配分数,反映状态由向量表示的可能性。 该系统还包括多个识别模块和相关联的识别语法。 识别模块并行运行,并使用声学模型的匹配分数来确定每个识别模块中的至少一个识别结果。 识别模块包括用于产生至少一个可能的听写识别结果的听写模块,用于识别用于用命令处理的视觉显示文本的一部分的选择模块,以及用于产生至少一个可能的命令识别结果的命令模块。 仲裁员使用仲裁算法和识别结果的得分排序队列及其相关联的识别模块来比较识别模块的识别结果以选择至少一个系统识别结果。

    Detecting potential significant errors in speech recognition results
    8.
    发明授权
    Detecting potential significant errors in speech recognition results 有权
    检测语音识别结果中潜在的重大错误

    公开(公告)号:US09064493B2

    公开(公告)日:2015-06-23

    申请号:US13544279

    申请日:2012-07-09

    IPC分类号: G10L15/22 G10L15/18 G10L15/08

    摘要: In some embodiments, the recognition results produced by a speech processing system (which may include a top recognition result and one or more alternative recognition results) based on an analysis of a speech input, are evaluated for indications of potential significant errors. In some embodiments, the recognition results may be evaluated to determine whether a meaning of any of the alternative recognition results differs from a meaning of the top recognition result in a manner that is significant for the domain. In some embodiments, one or more of the recognition results may be evaluated to determine whether the result(s) include one or more words or phrases that, when included in a result, would change a meaning of the result in a manner that would be significant for the domain.

    摘要翻译: 在一些实施例中,基于语音输入的分析,由语音处理系统(其可以包括顶部识别结果和一个或多个替代识别结果)产生的识别结果被评估用于潜在重大错误的指示。 在一些实施例中,可以评估识别结果以确定任何替代识别结果的含义以对于域有意义的方式与顶部识别结果的含义不同。 在一些实施例中,可以评估一个或多个识别结果以确定结果是否包括一个或多个单词或短语,当被包括在结果中时,将以以下方式改变结果的含义: 对领域有重要意义

    DETECTING POTENTIAL SIGNIFICANT ERRORS IN SPEECH RECOGNITION RESULTS
    10.
    发明申请
    DETECTING POTENTIAL SIGNIFICANT ERRORS IN SPEECH RECOGNITION RESULTS 有权
    检测语音识别结果中的潜在重要错误

    公开(公告)号:US20140012580A1

    公开(公告)日:2014-01-09

    申请号:US13544279

    申请日:2012-07-09

    IPC分类号: G10L15/18

    摘要: In some embodiments, the recognition results produced by a speech processing system (which may include a top recognition result and one or more alternative recognition results) based on an analysis of a speech input, are evaluated for indications of potential significant errors. In some embodiments, the recognition results may be evaluated to determine whether a meaning of any of the alternative recognition results differs from a meaning of the top recognition result in a manner that is significant for the domain. In some embodiments, one or more of the recognition results may be evaluated to determine whether the result(s) include one or more words or phrases that, when included in a result, would change a meaning of the result in a manner that would be significant for the domain.

    摘要翻译: 在一些实施例中,基于语音输入的分析,由语音处理系统(其可以包括顶部识别结果和一个或多个替代识别结果)产生的识别结果被评估用于潜在重大错误的指示。 在一些实施例中,可以评估识别结果以确定任何替代识别结果的含义以对于域有意义的方式与顶部识别结果的含义不同。 在一些实施例中,可以评估一个或多个识别结果以确定结果是否包括一个或多个单词或短语,当被包括在结果中时,将以以下方式改变结果的含义: 对领域有重要意义