Recognizing speech in the presence of additional audio
    2.
    发明授权
    Recognizing speech in the presence of additional audio 有权
    在存在额外音频的情况下认识到演讲

    公开(公告)号:US09318112B2

    公开(公告)日:2016-04-19

    申请号:US14181345

    申请日:2014-02-14

    Applicant: Google Inc.

    Abstract: The technology described in this document can be embodied in a computer-implemented method that includes receiving, at a processing system, a first signal including an output of a speaker device and an additional audio signal. The method also includes determining, by the processing system, based at least in part on a model trained to identify the output of the speaker device, that the additional audio signal corresponds to an utterance of a user. The method further includes initiating a reduction in an audio output level of the speaker device based on determining that the additional audio signal corresponds to the utterance of the user.

    Abstract translation: 本文中描述的技术可以以计算机实现的方法来实现,该方法包括在处理系统处接收包括扬声器设备的输出和附加音频信号的第一信号。 该方法还包括至少部分地基于经训练以识别扬声器设备的输出的模型来确定该附加音频信号对应于用户的话语。 该方法还包括基于确定附加音频信号对应于用户的话语来启动扬声器设备的音频输出电平的降低。

    Caching speech recognition scores

    公开(公告)号:US09858922B2

    公开(公告)日:2018-01-02

    申请号:US14311557

    申请日:2014-06-23

    Applicant: Google Inc.

    CPC classification number: G10L15/08 G10L15/285

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for caching speech recognition scores. In some implementations, one or more values comprising data about an utterance are received. An index value is determined for the one or more values. An acoustic model score for the one or more received values is selected, from a cache of acoustic model scores that were computed before receiving the one or more values, based on the index value. A transcription for the utterance is determined using the selected acoustic model score.

    Speaker verification using neural networks
    5.
    发明授权
    Speaker verification using neural networks 有权
    使用神经网络的扬声器验证

    公开(公告)号:US09401148B2

    公开(公告)日:2016-07-26

    申请号:US14228469

    申请日:2014-03-28

    Applicant: Google Inc.

    CPC classification number: G10L17/18

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for inputting speech data that corresponds to a particular utterance to a neural network; determining an evaluation vector based on output at a hidden layer of the neural network; comparing the evaluation vector with a reference vector that corresponds to a past utterance of a particular speaker; and based on comparing the evaluation vector and the reference vector, determining whether the particular utterance was likely spoken by the particular speaker.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于将对应于特定话语的语音数据输入到神经网络; 基于所述神经网络的隐藏层的输出确定评估向量; 将评估向量与对应于特定说话者的过去发音的参考向量进行比较; 并且基于比较评估向量和参考向量,确定特定发音是否可能由特定说话者说出。

    Language Identification
    6.
    发明申请
    Language Identification 审中-公开
    语言识别

    公开(公告)号:US20150364129A1

    公开(公告)日:2015-12-17

    申请号:US14313490

    申请日:2014-06-24

    Applicant: Google Inc.

    CPC classification number: G10L15/005 G10L15/183 G10L15/32

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for language identification. In some implementations, speech data for an utterance is received and provided to (i) a language identification module and (ii) multiple speech recognizers that are each configured to recognize speech in a different language. From the language identification module, language identification scores corresponding to different languages are received, the language identification scores each indicating a likelihood that the utterance is speech in the corresponding language. A language model confidence score that indicates a level of confidence that a language model has in a transcription of the utterance in a language corresponding to the language model is received. A language is selected based on the language identification scores and the language model confidence scores.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于语言识别。 在一些实现中,接收用于话语的语音数据并提供给(i)语言识别模块和(ii)多个语音识别器,每个语音识别器被配置为以不同语言识别语音。 从语言识别模块接收与不同语言相对应的语言识别分数,语言识别分数各自表示发音是相应语言的语音的可能性。 语言模型可信度得分表示语言模型在对应于语言模型的语言的语音转录中的置信水平。 基于语言识别分数和语言模型置信度得分选择语言。

    CACHING SPEECH RECOGNITION SCORES
    7.
    发明申请
    CACHING SPEECH RECOGNITION SCORES 有权
    缓存语音识别码

    公开(公告)号:US20150371631A1

    公开(公告)日:2015-12-24

    申请号:US14311557

    申请日:2014-06-23

    Applicant: Google Inc.

    CPC classification number: G10L15/08 G10L15/285

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for caching speech recognition scores. In some implementations, one or more values comprising data about an utterance are received. An index value is determined for the one or more values. An acoustic model score for the one or more received values is selected, from a cache of acoustic model scores that were computed before receiving the one or more values, based on the index value. A transcription for the utterance is determined using the selected acoustic model score.

    Abstract translation: 方法,系统和装置,包括编码在计算机存储介质上的用于缓存语音识别分数的计算机程序。 在一些实现中,接收包括关于话语的数据的一个或多个值。 确定一个或多个值的索引值。 基于索引值,从接收到一个或多个值之前计算的声学模型分数的高速缓存中选择一个或多个接收值的声学模型分数。 使用所选择的声学模型得分确定发音的转录。

    RECOGNIZING SPEECH IN THE PRESENCE OF ADDITIONAL AUDIO
    8.
    发明申请
    RECOGNIZING SPEECH IN THE PRESENCE OF ADDITIONAL AUDIO 有权
    在附加音频的存在下识别语音

    公开(公告)号:US20150235637A1

    公开(公告)日:2015-08-20

    申请号:US14181345

    申请日:2014-02-14

    Applicant: Google Inc.

    Abstract: The technology described in this document can be embodied in a computer-implemented method that includes receiving, at a processing system, a first signal including an output of a speaker device and an additional audio signal. The method also includes determining, by the processing system, based at least in part on a model trained to identify the output of the speaker device, that the additional audio signal corresponds to an utterance of a user. The method further includes initiating a reduction in an audio output level of the speaker device based on determining that the additional audio signal corresponds to the utterance of the user.

    Abstract translation: 本文中描述的技术可以以计算机实现的方法来实现,该方法包括在处理系统处接收包括扬声器设备的输出和附加音频信号的第一信号。 该方法还包括至少部分地基于经训练以识别扬声器设备的输出的模型来确定该附加音频信号对应于用户的话语。 该方法还包括基于确定附加音频信号对应于用户的话语来启动扬声器设备的音频输出电平的降低。

    SPEAKER VERIFICATION USING NEURAL NETWORKS
    9.
    发明申请
    SPEAKER VERIFICATION USING NEURAL NETWORKS 有权
    使用神经网络的扬声器验证

    公开(公告)号:US20150127336A1

    公开(公告)日:2015-05-07

    申请号:US14228469

    申请日:2014-03-28

    Applicant: Google Inc.

    CPC classification number: G10L17/18

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for inputting speech data that corresponds to a particular utterance to a neural network; determining an evaluation vector based on output at a hidden layer of the neural network; comparing the evaluation vector with a reference vector that corresponds to a past utterance of a particular speaker; and based on comparing the evaluation vector and the reference vector, determining whether the particular utterance was likely spoken by the particular speaker.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于将对应于特定话语的语音数据输入到神经网络; 基于所述神经网络的隐藏层的输出确定评估向量; 将评估向量与对应于特定说话者的过去发音的参考向量进行比较; 并且基于比较评估向量和参考向量,确定特定发音是否可能由特定说话者说出。

    SPEECH RECOGNITION USING NEURAL NETWORKS
    10.
    发明申请
    SPEECH RECOGNITION USING NEURAL NETWORKS 审中-公开
    使用神经网络的语音识别

    公开(公告)号:US20150039301A1

    公开(公告)日:2015-02-05

    申请号:US13955483

    申请日:2013-07-31

    Applicant: Google Inc.

    CPC classification number: G10L15/16

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech recognition using neural networks. A feature vector that models audio characteristics of a portion of an utterance is received. Data indicative of latent variables of multivariate factor analysis is received. The feature vector and the data indicative of the latent variables is provided as input to a neural network. A candidate transcription for the utterance is determined based on at least an output of the neural network.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于使用神经网络的语音识别。 接收对话音的一部分的音频特征进行建模的特征向量。 收到指示多元因素分析的潜在变量的数据。 特征向量和指示潜变量的数据被提供给神经网络的输入。 基于至少神经网络的输出确定用于话语的候选转录。

Patent Agency Ranking