-
公开(公告)号:US09373329B2
公开(公告)日:2016-06-21
申请号:US14064755
申请日:2013-10-28
Applicant: Google Inc.
Inventor: Brian Strope , Francoise Beaufays , Olivier Siohan
Abstract: The subject matter of this specification can be embodied in, among other things, a method that includes receiving an audio signal and initiating speech recognition tasks by a plurality of speech recognition systems (SRS's). Each SRS is configured to generate a recognition result specifying possible speech included in the audio signal and a confidence value indicating a confidence in a correctness of the speech result. The method also includes completing a portion of the speech recognition tasks including generating one or more recognition results and one or more confidence values for the one or more recognition results, determining whether the one or more confidence values meets a confidence threshold, aborting a remaining portion of the speech recognition tasks for SRS's that have not generated a recognition result, and outputting a final recognition result based on at least one of the generated one or more speech results.
-
公开(公告)号:US09210258B2
公开(公告)日:2015-12-08
申请号:US13934993
申请日:2013-07-03
Applicant: Google Inc.
Inventor: Brian Strope , Francoise Beaufays , Hy Murveit
CPC classification number: H04L61/106 , H04L12/66 , H04L51/28 , H04L51/32 , H04L51/36 , H04M1/7255 , H04M3/42102 , H04W4/16 , H04W8/183
Abstract: In one implementation a computer-implemented method includes generating a group of telephone contacts for a first user, wherein the generating includes identifying a second user as a contact of the first user based upon a determination that the second user has at least a threshold email-based association with the first user; and adding the identified second user to the group of telephone contacts for the first user. The method further includes receiving a first request to connect a first telephone device associated with the first user to a second telephone device associated with the second user. The method also includes identifying a contact identifier of the second telephone device using the generated group of telephone contacts for the first user, and initiating a connection between the first telephone device and the second telephone device using the identified contact identifier.
Abstract translation: 在一个实现中,计算机实现的方法包括为第一用户生成一组电话联系人,其中生成包括基于第二用户至少具有阈值电子邮件地址的确定来将第二用户识别为第一用户的联系人, 与第一个用户的关联; 以及将所识别的第二用户添加到第一用户的电话联系人组。 该方法还包括接收将与第一用户相关联的第一电话设备连接到与第二用户相关联的第二电话设备的第一请求。 该方法还包括使用生成的第一用户的电话联系人识别第二电话设备的联系人标识符,以及使用所识别的联系人标识符来启动第一电话设备和第二电话设备之间的连接。
-
33.
公开(公告)号:US20150006178A1
公开(公告)日:2015-01-01
申请号:US13930495
申请日:2013-06-28
Applicant: Google Inc.
Inventor: Fuchun Peng , Francoise Beaufays , Brian Strope , Xin Lei , Pedro J. Moreno Mengibar , Trevor D. Strohman
IPC: G10L15/18
CPC classification number: G10L15/18 , G09B17/006 , G10L13/08 , G10L15/06
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining pronunciations for particular terms. The methods, systems, and apparatus include actions of obtaining audio samples of speech corresponding to a particular term and obtaining candidate pronunciations for the particular term. Further actions include generating, for each candidate pronunciation for the particular term and audio sample of speech corresponding to the particular term, a score reflecting a level of similarity between of the candidate pronunciation and the audio sample. Additional actions include aggregating the scores for each candidate pronunciation and adding one or more candidate pronunciations for the particular term to a pronunciation lexicon based on the aggregated scores for the candidate pronunciations.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于确定特定术语的发音。 方法,系统和装置包括获得与特定术语相对应的语音样本的动作,并获得特定术语的候选发音。 进一步的动作包括针对特定术语的每个候选发音和对应于特定术语的语音样本生成反映候选发音和音频样本之间的相似程度的分数。 附加动作包括聚合每个候选发音的分数,并且基于候选发音的聚合分数,将特定术语的一个或多个候选发音添加到发音词典。
-
公开(公告)号:US20140149119A1
公开(公告)日:2014-05-29
申请号:US13829482
申请日:2013-03-14
Applicant: Google Inc.
Inventor: Hasim Sak , Francoise Beaufays
IPC: G10L13/02
CPC classification number: G06F17/2775 , G10L15/083 , G10L15/187 , G10L15/197 , G10L15/26
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for transcribing utterances into written text are disclosed. The methods, systems, and apparatus include actions of obtaining a lexicon model that maps phones to spoken text and obtaining a language model that assigns probabilities to written text. Further includes generating a transducer that maps the written text to the spoken text, the transducer mapping multiple items of the written text to an item of the spoken text. Additionally, the actions include constructing a decoding network for transcribing utterances into written text, by composing the lexicon model, the inverse of the transducer, and the language model.
Abstract translation: 公开了包括在计算机存储介质上编码的用于将话语转换成书面文本的计算机程序的方法,系统和装置。 方法,系统和装置包括获取将电话映射到口语文本并获得将概率分配给书写文本的语言模型的词典模型的动作。 还包括生成将书写文本映射到口语文本的传感器,换能器将多个文本文本项目映射到口语文本的项目。 此外,这些动作包括通过组合词典模型,换能器的倒数和语言模型来构建用于将话语转录成书写文本的解码网络。
-
公开(公告)号:US20140058728A1
公开(公告)日:2014-02-27
申请号:US14064755
申请日:2013-10-28
Applicant: Google Inc.
Inventor: Brian Strope , Francoise Beaufays , Olivier Siohan
IPC: G10L15/26
Abstract: The subject matter of this specification can be embodied in, among other things, a method that includes receiving an audio signal and initiating speech recognition tasks by a plurality of speech recognition systems (SRS's). Each SRS is configured to generate a recognition result specifying possible speech included in the audio signal and a confidence value indicating a confidence in a correctness of the speech result. The method also includes completing a portion of the speech recognition tasks including generating one or more recognition results and one or more confidence values for the one or more recognition results, determining whether the one or more confidence values meets a confidence threshold, aborting a remaining portion of the speech recognition tasks for SRS's that have not generated a recognition result, and outputting a final recognition result based on at least one of the generated one or more speech results.
Abstract translation: 除了别的以外,本说明书的主题可以体现在包括通过多个语音识别系统(SRS)接收音频信号和发起语音识别任务的方法。 每个SRS被配置为产生指定包括在音频信号中的可能语音的识别结果,以及指示对语音结果的正确性置信度的置信度值。 该方法还包括完成语音识别任务的一部分,包括生成一个或多个识别结果和一个或多个识别结果的一个或多个置信度值,确定一个或多个置信度值是否满足置信阈值,中止其余部分 的没有产生识别结果的SRS的语音识别任务,并且基于所生成的一个或多个语音结果中的至少一个输出最终识别结果。
-
公开(公告)号:US20130238336A1
公开(公告)日:2013-09-12
申请号:US13726954
申请日:2012-12-26
Applicant: GOOGLE INC.
Inventor: Yun-hsuan Sung , Francoise Beaufays , Brian Strope , Hui Lin , Jui-Ting Huang
IPC: G10L15/00
CPC classification number: G10L15/005 , G10L15/183 , G10L15/32
Abstract: Speech recognition systems may perform the following operations: receiving audio; recognizing the audio using language models for different languages to produce recognition candidates for the audio, where the recognition candidates are associated with corresponding recognition scores; identifying a candidate language for the audio; selecting a recognition candidate based on the recognition scores and the candidate language; and outputting data corresponding to the selected recognition candidate as a recognized version of the audio.
Abstract translation: 语音识别系统可以执行以下操作:接收音频; 使用不同语言的语言模型识别音频以产生用于音频的识别候选,其中识别候选与相应的识别分数相关联; 识别音频的候选语言; 基于识别分数和候选语言选择识别候选; 并输出与所选择的识别候选对应的数据作为音频的识别版本。
-
-
-
-
-