-
公开(公告)号:US09842489B2
公开(公告)日:2017-12-12
申请号:US13766878
申请日:2013-02-14
Applicant: Google Inc.
Inventor: Glen Shires
CPC classification number: G08C17/00 , G08C2201/12 , G08C2201/32 , G10L15/30 , G10L15/32 , H04L12/12 , H04L12/282 , Y02D50/40
Abstract: The disclosed subject matter provides a main device and at least one secondary device. The at least one secondary device and the main device may operate in cooperation with one another and other networked components to provide improved performance, such as improved speech and other signal recognition operations. Using the improved recognition results, a higher probability of generating the proper commands to a controllable device is provided.
-
公开(公告)号:US08612211B1
公开(公告)日:2013-12-17
申请号:US13743838
申请日:2013-01-17
Applicant: Google Inc.
Inventor: Glen Shires , Sterling Swigart , Jonathan Zolla , Jason J. Gauci
CPC classification number: H04N7/15 , G06F17/27 , G10L15/1815 , G10L15/26
Abstract: The subject matter of this specification can be embodied in, among other things, a method that includes receiving two or more data sets each representing speech of a corresponding individual attending an internet-based social networking video conference session, decoding the received data sets to produce corresponding text for each individual attending the internet-based social networking video conference, and detecting characteristics of the session from a coalesced transcript produced from the decoded text of the attending individuals for providing context to the internet-based social networking video conference session.
Abstract translation: 本说明书的主题可以包括接收两个或多个数据集的方法,每个数据集表示参与基于互联网的社交网络视频会议会话的相应个人的语音,对所接收的数据集进行解码以产生 参与基于互联网的社交网络视频会议的每个人的相应文本,以及从参与人物的解码文本产生的合并记录中检测会话的特征,以将上下文提供给基于因特网的社交网络视频会议会话。
-
公开(公告)号:US20140229184A1
公开(公告)日:2014-08-14
申请号:US13766878
申请日:2013-02-14
Applicant: Google Inc.
Inventor: Glen Shires
IPC: G10L15/26
CPC classification number: G08C17/00 , G08C2201/12 , G08C2201/32 , G10L15/30 , G10L15/32 , H04L12/12 , H04L12/282 , Y02D50/40
Abstract: The disclosed subject matter provides a main device and at least one secondary device. The at least one secondary device and the main device may operate in cooperation with one another and other networked components to provide improved performance, such as improved speech and other signal recognition operations. Using the improved recognition results, a higher probability of generating the proper commands to a controllable device is provided.
Abstract translation: 所公开的主题提供主设备和至少一个辅助设备。 所述至少一个辅助设备和主设备可以彼此协作和其他网络组件协作以提供改进的性能,例如改进的语音和其他信号识别操作。 使用改进的识别结果,提供了向可控设备产生适当命令的较高概率。
-
公开(公告)号:US20180012591A1
公开(公告)日:2018-01-11
申请号:US15711260
申请日:2017-09-21
Applicant: Google Inc.
Inventor: Petar Aleksic , Glen Shires , Michael Buchanan
CPC classification number: G10L15/05 , G06F3/167 , G10L15/04 , G10L15/22 , G10L15/26 , G10L25/78 , G10L2015/088 , G10L2025/783
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data including an utterance, obtaining context data that indicates one or more expected speech recognition results, determining an expected speech recognition result based on the context data, receiving an intermediate speech recognition result generated by a speech recognition engine, comparing the intermediate speech recognition result to the expected speech recognition result for the audio data based on the context data, determining whether the intermediate speech recognition result corresponds to the expected speech recognition result for the audio data based on the context data, and setting an end of speech condition and providing a final speech recognition result in response to determining the intermediate speech recognition result matches the expected speech recognition result, the final speech recognition result including the one or more expected speech recognition results indicated by the context data.
-
公开(公告)号:US20170069309A1
公开(公告)日:2017-03-09
申请号:US15192431
申请日:2016-06-24
Applicant: Google Inc.
Inventor: Petar Aleksic , Glen Shires , Michael Buchanan
CPC classification number: G10L15/05 , G06F3/167 , G10L15/04 , G10L15/22 , G10L15/26 , G10L25/78 , G10L2015/088 , G10L2025/783
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data including an utterance, obtaining context data that indicates one or more expected speech recognition results, determining an expected speech recognition result based on the context data, receiving an intermediate speech recognition result generated by a speech recognition engine, comparing the intermediate speech recognition result to the expected speech recognition result for the audio data based on the context data, determining whether the intermediate speech recognition result corresponds to the expected speech recognition result for the audio data based on the context data, and setting an end of speech condition and providing a final speech recognition result in response to determining the intermediate speech recognition result matches the expected speech recognition result, the final speech recognition result including the one or more expected speech recognition results indicated by the context data.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的用于接收包括话语的音频数据的计算机程序,获得指示一个或多个预期语音识别结果的上下文数据,基于上下文数据确定预期语音识别结果, 接收由语音识别引擎产生的中间语音识别结果,根据上下文数据将中间语音识别结果与音频数据的预期语音识别结果进行比较,确定中间语音识别结果是否对应于预期语音识别结果 基于所述上下文数据的所述音频数据,以及响应于确定所述中间语音识别结果匹配所述预期语音识别结果而设置语音结束结束并提供最终语音识别结果,所述最终语音识别结果包括所述一个或多个预期的 语音识别 由上下文数据指示的结果。
-
公开(公告)号:US09420227B1
公开(公告)日:2016-08-16
申请号:US14078800
申请日:2013-11-13
Applicant: Google Inc.
Inventor: Glen Shires , Sterling Swigart , Jonathan Zolla , Jason J. Gauci
CPC classification number: H04N7/15 , G06F17/27 , G10L15/1815 , G10L15/26
Abstract: The subject matter of this specification can be embodied in, among other things, a method that includes receiving two or more data sets each representing speech of a corresponding individual attending an internet-based social networking video conference session, decoding the received data sets to produce corresponding text for each individual attending the internet-based social networking video conference, and detecting characteristics of the session from a coalesced transcript produced from the decoded text of the attending individuals for providing context to the internet-based social networking video conference session.
-
公开(公告)号:US09035996B1
公开(公告)日:2015-05-19
申请号:US14185879
申请日:2014-02-20
Applicant: Google Inc.
Inventor: Glen Shires , Maryam Garrett
CPC classification number: H04N7/152 , H04L12/1831
Abstract: A method of adding a computing device to a multi-device video communication session. A server receives recorded content from a plurality of multi-device video communication sessions and a search request from a computing device. The server identifies a first multi-device video communication session based on the search request. The first multi-device video communication session includes a weighted list of text elements. The server transmits information based on the weighted list of text elements to the computing device, receives a selection from the computing device corresponding to a first text element, and transmits at least a portion of the recorded content from the first multi-device video communication session to the computing device based on the first text element. The server receives an add request for the computing device to be added to the first multi-device video communication session and transmits the add request to the first multi-device video communication session.
Abstract translation: 一种将计算设备添加到多设备视频通信会话的方法。 服务器从多个多设备视频通信会话和来自计算设备的搜索请求接收记录的内容。 服务器基于搜索请求识别第一多设备视频通信会话。 第一个多设备视频通信会话包括文本元素的加权列表。 服务器将基于文本元素的加权列表的信息发送到计算设备,从对应于第一文本元素的计算设备接收选择,并且从第一多设备视频通信会话中传送所记录的内容的至少一部分 基于第一文本元素到计算设备。 服务器接收对要添加到第一多设备视频通信会话的计算设备的添加请求,并将添加请求发送到第一多设备视频通信会话。
-
公开(公告)号:US20170069308A1
公开(公告)日:2017-03-09
申请号:US14844563
申请日:2015-09-03
Applicant: Google Inc.
Inventor: Petar Aleksic , Glen Shires , Michael Buchanan
CPC classification number: G10L15/04 , G06F17/2765 , G10L15/18 , G10L2015/228
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data including an utterance, obtaining context data that indicates one or more expected speech recognition results, determining an expected speech recognition result based on the context data, receiving an intermediate speech recognition result generated by a speech recognition engine, comparing the intermediate speech recognition result to the expected speech recognition result for the audio data based on the context data, determining whether the intermediate speech recognition result corresponds to the expected speech recognition result for the audio data based on the context data, and setting an end of speech condition and providing a final speech recognition result in response to determining the intermediate speech recognition result matches the expected speech recognition result, the final speech recognition result including the one or more expected speech recognition results indicated by the context data.
-
公开(公告)号:US20160364118A1
公开(公告)日:2016-12-15
申请号:US14739853
申请日:2015-06-15
Applicant: Google Inc.
Inventor: Jakob Nicolaus Foerster , Diego Melendo Casado , Glen Shires
IPC: G06F3/0484 , G06F17/28 , G06F3/16 , G06F17/27 , G06F3/0488 , G06F3/041
CPC classification number: G06F3/04842 , G06F3/0237 , G06F3/0416 , G06F3/0488 , G06F3/04883 , G06F3/167 , G06F17/276 , G06F17/289
Abstract: In some implementations, data indicating a touch received on a proximity-sensitive display is received while the proximity-sensitive display is presenting one or more items. In one aspect, the techniques describe may involve a process for disambiguating touch selections of hypothesized items, such as text or graphical objects that have been generated based on input data, on a proximity-sensitive display. This process may allow a user to more easily select hypothesized items that the user may wish to correct, by determining whether a touch received through the proximity-sensitive display represents a selection of each hypothesized item based at least on a level of confidence that the hypothesized item accurately represents the input data.
Abstract translation: 在一些实施方式中,在接近敏感显示器呈现一个或多个项目的同时接收指示在接近敏感显示器上接收的触摸的数据。 在一个方面,描述的技术可以涉及一种用于在接近敏感显示器上消除假定物品(例如基于输入数据生成的文本或图形对象)的触摸选择的过程。 该过程可以允许用户更容易地选择用户可能希望校正的假设物品,通过确定通过近似敏感显示器接收到的触摸是否至少基于置信度来选择每个假设物品,假定的 项目准确地表示输入数据。
-
-
-
-
-
-
-
-