Patent search ap:("Google Inc.") AND inv:"Ignacio L. Moreno" Page 1

1.

发明申请
USER INTERFACE CUSTOMIZATION BASED ON SPEAKER CHARACTERICS 审中-公开
Title translation: 基于扬声器特性的用户界面自定义

公开(公告)号：US20160342389A1

公开(公告)日：2016-11-24

申请号：US15230891

申请日：2016-08-08

Applicant: Google Inc.

Inventor： Eugene Weinstein , Ignacio L. Moreno

IPC: G06F3/16 , G06F17/21 , G06F3/0481

CPC classification number: G06F3/167 , G06F3/04817 , G06F9/451 , G06F17/214

Abstract: Characteristics of a speaker are estimated using speech processing and machine learning. The characteristics of the speaker are used to automatically customize a user interface of a client device for the speaker.

Abstract translation: 使用语音处理和机器学习来估计扬声器的特性。扬声器的特性用于自动定制扬声器的客户端设备的用户界面。

2.

发明授权
Recognizing speech in the presence of additional audio 有权
Title translation: 在存在额外音频的情况下认识到演讲

公开(公告)号：US09318112B2

公开(公告)日：2016-04-19

申请号：US14181345

申请日：2014-02-14

Applicant: Google Inc.

Inventor： Diego Melendo Casado , Ignacio L. Moreno , Javier Gonzalez-Dominguez

IPC: G10L15/00 , G10L15/22 , G10L17/00

CPC classification number: G10L15/20 , G06F3/165 , G06F3/167 , G10L15/222 , G10L15/265 , G10L17/00 , G10L17/06 , G10L21/034 , G10L25/84 , H03G3/3005

Abstract: The technology described in this document can be embodied in a computer-implemented method that includes receiving, at a processing system, a first signal including an output of a speaker device and an additional audio signal. The method also includes determining, by the processing system, based at least in part on a model trained to identify the output of the speaker device, that the additional audio signal corresponds to an utterance of a user. The method further includes initiating a reduction in an audio output level of the speaker device based on determining that the additional audio signal corresponds to the utterance of the user.

Abstract translation: 本文中描述的技术可以以计算机实现的方法来实现，该方法包括在处理系统处接收包括扬声器设备的输出和附加音频信号的第一信号。该方法还包括至少部分地基于经训练以识别扬声器设备的输出的模型来确定该附加音频信号对应于用户的话语。该方法还包括基于确定附加音频信号对应于用户的话语来启动扬声器设备的音频输出电平的降低。

3.

发明授权
Caching speech recognition scores 有权

公开(公告)号：US09858922B2

公开(公告)日：2018-01-02

申请号：US14311557

申请日：2014-06-23

Applicant: Google Inc.

Inventor： Eugene Weinstein , Sanjiv Kumar , Ignacio L. Moreno , Andrew W. Senior , Nikhil Prasad Bhat

IPC: G10L15/08 , G10L15/28

CPC classification number: G10L15/08 , G10L15/285

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for caching speech recognition scores. In some implementations, one or more values comprising data about an utterance are received. An index value is determined for the one or more values. An acoustic model score for the one or more received values is selected, from a cache of acoustic model scores that were computed before receiving the one or more values, based on the index value. A transcription for the utterance is determined using the selected acoustic model score.

4.

发明申请
USER INTERFACE CUSTOMIZATION BASED ON SPEAKER CHARACTERISTICS 审中-公开
Title translation: 基于扬声器特性的用户界面自定义

公开(公告)号：US20150154002A1

公开(公告)日：2015-06-04

申请号：US14096608

申请日：2013-12-04

Applicant: Google Inc.

Inventor： Eugene Weinstein , Ignacio L. Moreno

IPC: G06F3/16

CPC classification number: G06F3/167 , G06F3/04817 , G06F9/451 , G06F17/214

Abstract: Characteristics of a speaker are estimated using speech processing and machine learning. The characteristics of the speaker are used to automatically customize a user interface of a client device for the speaker.

Abstract translation: 使用语音处理和机器学习来估计扬声器的特性。扬声器的特性用于自动定制扬声器的客户端设备的用户界面。

5.

发明授权
Speaker verification using neural networks 有权
Title translation: 使用神经网络的扬声器验证

公开(公告)号：US09401148B2

公开(公告)日：2016-07-26

申请号：US14228469

申请日：2014-03-28

Applicant: Google Inc.

Inventor： Xin Lei , Erik McDermott , Ehsan Variani , Ignacio L. Moreno

IPC: G10L17/00 , G10L17/18

CPC classification number: G10L17/18

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for inputting speech data that corresponds to a particular utterance to a neural network; determining an evaluation vector based on output at a hidden layer of the neural network; comparing the evaluation vector with a reference vector that corresponds to a past utterance of a particular speaker; and based on comparing the evaluation vector and the reference vector, determining whether the particular utterance was likely spoken by the particular speaker.

Abstract translation: 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于将对应于特定话语的语音数据输入到神经网络; 基于所述神经网络的隐藏层的输出确定评估向量; 将评估向量与对应于特定说话者的过去发音的参考向量进行比较; 并且基于比较评估向量和参考向量，确定特定发音是否可能由特定说话者说出。

6.

发明申请
Language Identification 审中-公开
Title translation: 语言识别

公开(公告)号：US20150364129A1

公开(公告)日：2015-12-17

申请号：US14313490

申请日：2014-06-24

Applicant: Google Inc.

Inventor： Javier Gonzalez-Dominguez , Ignacio L. Moreno , David P. Eustis

IPC: G10L15/00 , G10L15/02

CPC classification number: G10L15/005 , G10L15/183 , G10L15/32

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for language identification. In some implementations, speech data for an utterance is received and provided to (i) a language identification module and (ii) multiple speech recognizers that are each configured to recognize speech in a different language. From the language identification module, language identification scores corresponding to different languages are received, the language identification scores each indicating a likelihood that the utterance is speech in the corresponding language. A language model confidence score that indicates a level of confidence that a language model has in a transcription of the utterance in a language corresponding to the language model is received. A language is selected based on the language identification scores and the language model confidence scores.

Abstract translation: 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于语言识别。在一些实现中，接收用于话语的语音数据并提供给（i）语言识别模块和（ii）多个语音识别器，每个语音识别器被配置为以不同语言识别语音。从语言识别模块接收与不同语言相对应的语言识别分数，语言识别分数各自表示发音是相应语言的语音的可能性。语言模型可信度得分表示语言模型在对应于语言模型的语言的语音转录中的置信水平。基于语言识别分数和语言模型置信度得分选择语言。

7.

发明申请
CACHING SPEECH RECOGNITION SCORES 有权
Title translation: 缓存语音识别码

公开(公告)号：US20150371631A1

公开(公告)日：2015-12-24

申请号：US14311557

申请日：2014-06-23

Applicant: Google Inc.

Inventor： Eugene Weinstein , Sanjiv Kumar , Ignacio L. Moreno , Andrew W. Senior , Nikhil Prasad Bhat

IPC: G10L15/14 , G10L19/038

CPC classification number: G10L15/08 , G10L15/285

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for caching speech recognition scores. In some implementations, one or more values comprising data about an utterance are received. An index value is determined for the one or more values. An acoustic model score for the one or more received values is selected, from a cache of acoustic model scores that were computed before receiving the one or more values, based on the index value. A transcription for the utterance is determined using the selected acoustic model score.

Abstract translation: 方法，系统和装置，包括编码在计算机存储介质上的用于缓存语音识别分数的计算机程序。在一些实现中，接收包括关于话语的数据的一个或多个值。确定一个或多个值的索引值。基于索引值，从接收到一个或多个值之前计算的声学模型分数的高速缓存中选择一个或多个接收值的声学模型分数。使用所选择的声学模型得分确定发音的转录。

8.

发明申请
RECOGNIZING SPEECH IN THE PRESENCE OF ADDITIONAL AUDIO 有权
Title translation: 在附加音频的存在下识别语音

公开(公告)号：US20150235637A1

公开(公告)日：2015-08-20

申请号：US14181345

申请日：2014-02-14

Applicant: Google Inc.

Inventor： Diego Melendo Casado , Ignacio L. Moreno , Javier Gonzalez-Dominguez

IPC: G10L13/02 , G10L15/16 , G10L15/26

CPC classification number: G10L15/20 , G06F3/165 , G06F3/167 , G10L15/222 , G10L15/265 , G10L17/00 , G10L17/06 , G10L21/034 , G10L25/84 , H03G3/3005

Abstract: The technology described in this document can be embodied in a computer-implemented method that includes receiving, at a processing system, a first signal including an output of a speaker device and an additional audio signal. The method also includes determining, by the processing system, based at least in part on a model trained to identify the output of the speaker device, that the additional audio signal corresponds to an utterance of a user. The method further includes initiating a reduction in an audio output level of the speaker device based on determining that the additional audio signal corresponds to the utterance of the user.

Abstract translation: 本文中描述的技术可以以计算机实现的方法来实现，该方法包括在处理系统处接收包括扬声器设备的输出和附加音频信号的第一信号。该方法还包括至少部分地基于经训练以识别扬声器设备的输出的模型来确定该附加音频信号对应于用户的话语。该方法还包括基于确定附加音频信号对应于用户的话语来启动扬声器设备的音频输出电平的降低。

9.

发明申请
SPEAKER VERIFICATION USING NEURAL NETWORKS 有权
Title translation: 使用神经网络的扬声器验证

公开(公告)号：US20150127336A1

公开(公告)日：2015-05-07

申请号：US14228469

申请日：2014-03-28

Applicant: Google Inc.

Inventor： Xin Lei , Erik McDermott , Ehsan Variani , Ignacio L. Moreno

IPC: G10L17/18

CPC classification number: G10L17/18

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for inputting speech data that corresponds to a particular utterance to a neural network; determining an evaluation vector based on output at a hidden layer of the neural network; comparing the evaluation vector with a reference vector that corresponds to a past utterance of a particular speaker; and based on comparing the evaluation vector and the reference vector, determining whether the particular utterance was likely spoken by the particular speaker.

Abstract translation: 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于将对应于特定话语的语音数据输入到神经网络; 基于所述神经网络的隐藏层的输出确定评估向量; 将评估向量与对应于特定说话者的过去发音的参考向量进行比较; 并且基于比较评估向量和参考向量，确定特定发音是否可能由特定说话者说出。

10.

发明申请
SPEECH RECOGNITION USING NEURAL NETWORKS 审中-公开
Title translation: 使用神经网络的语音识别

公开(公告)号：US20150039301A1

公开(公告)日：2015-02-05

申请号：US13955483

申请日：2013-07-31

Applicant: Google Inc.

Inventor： Andrew W. Senior , Ignacio L. Moreno

IPC: G10L15/16

CPC classification number: G10L15/16

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech recognition using neural networks. A feature vector that models audio characteristics of a portion of an utterance is received. Data indicative of latent variables of multivariate factor analysis is received. The feature vector and the data indicative of the latent variables is provided as input to a neural network. A candidate transcription for the utterance is determined based on at least an output of the neural network.

Abstract translation: 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于使用神经网络的语音识别。接收对话音的一部分的音频特征进行建模的特征向量。收到指示多元因素分析的潜在变量的数据。特征向量和指示潜变量的数据被提供给神经网络的输入。基于至少神经网络的输出确定用于话语的候选转录。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification