INTEGRATED LOCAL AND CLOUD BASED SPEECH RECOGNITION
    1.
    发明申请
    INTEGRATED LOCAL AND CLOUD BASED SPEECH RECOGNITION 有权
    综合本地和云的语音识别

    公开(公告)号:US20130060571A1

    公开(公告)日:2013-03-07

    申请号:US13224778

    申请日:2011-09-02

    IPC分类号: G10L15/00

    CPC分类号: G10L15/30 G06F3/011 G06F3/167

    摘要: A system for integrating local speech recognition with cloud-based speech recognition in order to provide an efficient natural user interface is described. In some embodiments, a computing device determines a direction associated with a particular person within an environment and generates an audio recording associated with the direction. The computing device then performs local speech recognition on the audio recording in order to detect a first utterance spoken by the particular person and to detect one or more keywords within the first utterance. The first utterance may be detected by applying voice activity detection techniques to the audio recording. The first utterance and the one or more keywords are subsequently transferred to a server which may identify speech sounds within the first utterance associated with the one or more keywords and adapt one or more speech recognition techniques based on the identified speech sounds.

    摘要翻译: 描述了一种用于将本地语音识别与基于云的语音识别相结合以提供有效的自然用户界面的系统。 在一些实施例中,计算设备确定与环境内的特定人员相关联的方向,并且生成与该方向相关联的音频记录。 然后,计算设备在音频记录上执行本地语音识别,以便检测特定人员所说出的第一话语,并检测第一话语内的一个或多个关键字。 可以通过将语音活动检测技术应用于音频记录来检测第一话语。 第一话语和一个或多个关键字随后被转移到可以识别与该一个或多个关键词相关联的第一话语内的语音的服务器,并且基于所识别的语音来调整一个或多个语音识别技术。

    Word-dependent language model
    2.
    发明授权
    Word-dependent language model 有权
    词依赖语言模型

    公开(公告)号:US08838449B2

    公开(公告)日:2014-09-16

    申请号:US12977461

    申请日:2010-12-23

    IPC分类号: G10L15/04 G06F17/30 G10L15/19

    CPC分类号: G10L15/19

    摘要: This document describes word-dependent language models, as well as their creation and use. A word-dependent language model can permit a speech-recognition engine to accurately verify that a speech utterance matches a multi-word phrase. This is useful in many contexts, including those where one or more letters of the expected phrase are known to the speaker.

    摘要翻译: 本文档描述了依赖于字的语言模型,以及它们的创建和使用。 一个与字相关的语言模型可以允许一个语音识别引擎准确地验证一个语音发音是否匹配一个多单词短语。 这在许多情况下是有用的,包括说话者知道预期短语的一个或多个字母的情况。

    Word-Dependent Language Model
    3.
    发明申请
    Word-Dependent Language Model 有权
    词语相关语言模型

    公开(公告)号:US20120166196A1

    公开(公告)日:2012-06-28

    申请号:US12977461

    申请日:2010-12-23

    IPC分类号: G10L15/04

    CPC分类号: G10L15/19

    摘要: This document describes word-dependent language models, as well as their creation and use. A word-dependent language model can permit a speech-recognition engine to accurately verify that a speech utterance matches a multi-word phrase. This is useful in many contexts, including those where one or more letters of the expected phrase are known to the speaker.

    摘要翻译: 本文档描述了依赖于字的语言模型,以及它们的创建和使用。 一个与字相关的语言模型可以允许一个语音识别引擎准确地验证一个语音发音是否匹配一个多单词短语。 这在许多情况下是有用的,包括说话者知道预期短语的一个或多个字母的情况。

    Integrated local and cloud based speech recognition
    4.
    发明授权
    Integrated local and cloud based speech recognition 有权
    综合本地和云的语音识别

    公开(公告)号:US08660847B2

    公开(公告)日:2014-02-25

    申请号:US13224778

    申请日:2011-09-02

    IPC分类号: G10L15/00

    CPC分类号: G10L15/30 G06F3/011 G06F3/167

    摘要: A system for integrating local speech recognition with cloud-based speech recognition in order to provide an efficient natural user interface is described. In some embodiments, a computing device determines a direction associated with a particular person within an environment and generates an audio recording associated with the direction. The computing device then performs local speech recognition on the audio recording in order to detect a first utterance spoken by the particular person and to detect one or more keywords within the first utterance. The first utterance may be detected by applying voice activity detection techniques to the audio recording. The first utterance and the one or more keywords are subsequently transferred to a server which may identify speech sounds within the first utterance associated with the one or more keywords and adapt one or more speech recognition techniques based on the identified speech sounds.

    摘要翻译: 描述了一种用于将本地语音识别与基于云的语音识别相结合以提供有效的自然用户界面的系统。 在一些实施例中,计算设备确定与环境内的特定人员相关联的方向,并且生成与该方向相关联的音频记录。 然后,计算设备在音频记录上执行本地语音识别,以便检测特定人员所说出的第一话语,并检测第一话语内的一个或多个关键字。 可以通过将语音活动检测技术应用于音频记录来检测第一话语。 第一话语和一个或多个关键字随后被转移到可以识别与该一个或多个关键词相关联的第一话语内的语音的服务器,并且基于所识别的语音来调整一个或多个语音识别技术。