Recognizing speech in multiple languages
    1.
    发明授权
    Recognizing speech in multiple languages 有权
    认识多种语言的言论

    公开(公告)号:US09129591B2

    公开(公告)日:2015-09-08

    申请号:US13726954

    申请日:2012-12-26

    Applicant: Google Inc.

    CPC classification number: G10L15/005 G10L15/183 G10L15/32

    Abstract: Speech recognition systems may perform the following operations: receiving audio; recognizing the audio using language models for different languages to produce recognition candidates for the audio, where the recognition candidates are associated with corresponding recognition scores; identifying a candidate language for the audio; selecting a recognition candidate based on the recognition scores and the candidate language; and outputting data corresponding to the selected recognition candidate as a recognized version of the audio.

    Abstract translation: 语音识别系统可以执行以下操作:接收音频; 使用不同语言的语言模型识别音频以产生用于音频的识别候选,其中识别候选与相应的识别分数相关联; 识别音频的候选语言; 基于识别分数和候选语言选择识别候选; 并输出与所选择的识别候选对应的数据作为音频的识别版本。

    Recognizing different versions of a language
    4.
    发明授权
    Recognizing different versions of a language 有权
    识别不同版本的语言

    公开(公告)号:US09275635B1

    公开(公告)日:2016-03-01

    申请号:US13672945

    申请日:2012-11-09

    Applicant: Google Inc.

    CPC classification number: G10L15/32 G10L15/183

    Abstract: Speech recognition systems may perform the following operations: receiving audio at a computing device; identifying a language associated with the audio; recognizing the audio using recognition models for different versions of the language to produce recognition candidates for the audio, where the recognition candidates are associated with corresponding information; comparing the information of the recognition candidates to identify agreement between at least two of the recognition models; selecting a recognition candidate based on information of the recognition candidate and agreement between the at least two of the recognition models; and outputting data corresponding to the selected recognition candidate as a recognized version of the audio.

    Abstract translation: 语音识别系统可以执行以下操作:在计算设备处接收音频; 识别与音频相关联的语言; 使用用于不同版本的语言的识别模型来识别音频以产生用于音频的识别候选,其中识别候选者与对应的信息相关联; 比较识别候选者的信息以识别至少两个识别模型之间的一致性; 基于所述识别候选者的信息和所述至少两个识别模型之间的一致性来选择识别候选者; 并输出与所选择的识别候选对应的数据作为音频的识别版本。

    COOPERATIVELY TRAINING AND/OR USING SEPARATE INPUT AND SUBSEQUENT CONTENT NEURAL NETWORKS FOR INFORMATION RETRIEVAL

    公开(公告)号:US20180240013A1

    公开(公告)日:2018-08-23

    申请号:US15476280

    申请日:2017-03-31

    Applicant: Google Inc.

    Abstract: Systems, methods, and computer readable media related to information retrieval. Some implementations are related to training and/or using a relevance model for information retrieval. The relevance model includes an input neural network model and a subsequent content neural network model. The input neural network model and the subsequent content neural network model can be separate, but trained and/or used cooperatively. The input neural network model and the subsequent content neural network model can be “separate” in that separate inputs are applied to the neural network models, and each of the neural network models is used to generate its own feature vector based on its applied input. A comparison of the feature vectors generated based on the separate network models can then be performed, where the comparison indicates relevance of the input applied to the input neural network model to the separate input applied to the subsequent content neural network model.

    RECOGNIZING SPEECH IN MULTIPLE LANGUAGES
    6.
    发明申请
    RECOGNIZING SPEECH IN MULTIPLE LANGUAGES 有权
    在多种语言中识别语音

    公开(公告)号:US20130238336A1

    公开(公告)日:2013-09-12

    申请号:US13726954

    申请日:2012-12-26

    Applicant: GOOGLE INC.

    CPC classification number: G10L15/005 G10L15/183 G10L15/32

    Abstract: Speech recognition systems may perform the following operations: receiving audio; recognizing the audio using language models for different languages to produce recognition candidates for the audio, where the recognition candidates are associated with corresponding recognition scores; identifying a candidate language for the audio; selecting a recognition candidate based on the recognition scores and the candidate language; and outputting data corresponding to the selected recognition candidate as a recognized version of the audio.

    Abstract translation: 语音识别系统可以执行以下操作:接收音频; 使用不同语言的语言模型识别音频以产生用于音频的识别候选,其中识别候选与相应的识别分数相关联; 识别音频的候选语言; 基于识别分数和候选语言选择识别候选; 并输出与所选择的识别候选对应的数据作为音频的识别版本。

    Cooperatively training and/or using separate input and subsequent content neural networks for information retrieval

    公开(公告)号:US11188824B2

    公开(公告)日:2021-11-30

    申请号:US15476280

    申请日:2017-03-31

    Applicant: Google Inc.

    Abstract: Systems, methods, and computer readable media related to information retrieval. Some implementations are related to training and/or using a relevance model for information retrieval. The relevance model includes an input neural network model and a subsequent content neural network model. The input neural network model and the subsequent content neural network model can be separate, but trained and/or used cooperatively. The input neural network model and the subsequent content neural network model can be “separate” in that separate inputs are applied to the neural network models, and each of the neural network models is used to generate its own feature vector based on its applied input. A comparison of the feature vectors generated based on the separate network models can then be performed, where the comparison indicates relevance of the input applied to the input neural network model to the separate input applied to the subsequent content neural network model.

    Written-domain language modeling with decomposition
    8.
    发明授权
    Written-domain language modeling with decomposition 有权
    书面域语言建模与分解

    公开(公告)号:US09460088B1

    公开(公告)日:2016-10-04

    申请号:US13906654

    申请日:2013-05-31

    Applicant: Google Inc.

    CPC classification number: G06F17/2881 G06F17/2765 G10L15/19

    Abstract: An automatic speech recognition system and method are provided for written-domain language modeling. According to one implementation, a process includes accessing decomposed training data that results from applying rewrite grammar rules to original training data, the decomposed training data comprising (i) regular words from the original training data that have not been rewritten using the set of rewrite grammar rules, and (ii) decomposed segments that result from rewriting non-lexical entities from the original training data using the rewrite grammar rules, generating a restriction model that (i) maps language model paths for regular words to themselves, and (ii) restricts language model paths for decomposed segments for non-lexical entities, training a n-gram language model over the training data, composing the restriction model and the language model to obtain a restricted language model, and constructing a decoding network by composing a context dependency model and a pronunciation lexicon with the restricted language model.

    Abstract translation: 提供了一种用于书面域语言建模的自动语音识别系统和方法。 根据一个实施方式,一个过程包括访问由重写语法规则应用于原始训练数据而产生的分解的训练数据,分解的训练数据包括(i)来自原始训练数据的常规单词,该原始训练数据未被重写使用该组重写语法 规则,和(ii)使用重写语法规则从原始训练数据重写非词汇实体产生的分段,生成限制模型,其将(i)将常规单词的语言模型路径映射到自身,以及(ii)限制 用于非词汇实体的分解段的语言模型路径,训练训练数据上的n-gram语言模型,组成限制模型和语言模型以获得受限语言模型,以及通过组合上下文依赖模型构建解码网络 和具有受限语言模型的发音词典。

Patent Agency Ranking