Two-pass decoding for speech recognition of search and action requests
    1.
    发明授权
    Two-pass decoding for speech recognition of search and action requests 有权
    用于搜索和动作请求的语音识别的双程解码

    公开(公告)号:US08645138B1

    公开(公告)日:2014-02-04

    申请号:US13723191

    申请日:2012-12-20

    Applicant: Google Inc.

    CPC classification number: G10L15/19

    Abstract: Disclosed are apparatus and methods for processing spoken speech. Input speech can be received at a computing system. During a first pass of speech recognition, a plurality of language model outputs can be determined by: providing the input speech to each of a plurality of language models and responsively receiving a language model output from each language model. A language model of the plurality of language models can be selected using a classifier operating on the plurality of language model outputs. During a second pass of speech recognition, a revised language model output can be determined by: providing the input speech and the language model output from the selected language model to the selected language model and responsively receiving the revised language model output from the selected language model. The computing system can generate a result based on the revised language model output.

    Abstract translation: 公开了用于处理口头语音的装置和方法。 可以在计算系统处接收输入语音。 在语音识别的第一次通过期间,可以通过以下方式来确定多个语言模型输出:将输入语音提供给多个语言模型中的每一个并且响应地接收来自每个语言模型的语言模型输出。 可以使用在多个语言模型输出上操作的分类器来选择多个语言模型的语言模型。 在语音识别的第二次通过期间,可以通过以下方式来确定经修改的语言模型输出:将所选语言模型的输入语音和语言模型输出提供给所选择的语言模型,并响应于接收来自所选语言模型的修订语言模型输出 。 计算系统可以根据修订后的语言模型输出生成一个结果。

Patent Agency Ranking