-
公开(公告)号:US20170256256A1
公开(公告)日:2017-09-07
申请号:US15057453
申请日:2016-03-01
Applicant: Google Inc.
Inventor: Bo Wang , Sunil Vemuri , Barnaby John James , Scott B. Huffman , Pravir Kumar Gupta
CPC classification number: G10L15/22 , G10L15/1822 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/227 , G10L2015/228
Abstract: Methods, systems, and apparatus for receiving, by a voice action system, data specifying a new voice action for an application different from the voice action system. A voice action intent for the application is generated based at least on the data, wherein the voice action intent comprises data that, when received by the application, requests that the application perform one or more operations specified for the new voice action. The voice action intent is associated with trigger terms specified for the new voice action. The voice action system is configured to receive an indication of a user utterance obtained by a device having the application installed, and determines that a transcription of the user utterance corresponds to the trigger terms associated with the voice action intent. In response to the determination, the voice action system provides the voice action intent to the device.
-
公开(公告)号:US20170110116A1
公开(公告)日:2017-04-20
申请号:US15196663
申请日:2016-06-29
Applicant: Google Inc.
Inventor: Siddhi Tadpatrikar , Michael Buchanan , Pravir Kumar Gupta
IPC: G10L15/05 , G06F17/30 , G10L15/065 , G10L15/26 , G10L25/78
CPC classification number: G10L15/05 , G06F17/30746 , G10L15/04 , G10L15/065 , G10L15/07 , G10L15/22 , G10L15/26 , G10L25/78 , G10L2025/783
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing are described. In one aspect, a method includes the action of accessing voice query log data that includes voice queries spoken by a particular user. The actions further include based on the voice query log data that includes voice queries spoken by a particular user, determining a pause threshold from the voice query log data that includes voice queries spoken by the particular user. The actions further include receiving, from the particular user, an utterance. The actions further include determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold. The actions further include based on determining that the particular user has stopped speaking for at least a period of time equal to the pause threshold, processing the utterance as a voice query.
-
公开(公告)号:US09542441B1
公开(公告)日:2017-01-10
申请号:US14935268
申请日:2015-11-06
Applicant: Google Inc.
Inventor: Michael Buchanan , Mark Andrew Paskin , Pravir Kumar Gupta
IPC: G06F17/30
CPC classification number: G06F17/3043 , G06F17/30477 , G06F17/3053 , G06F17/30554 , G06F17/30646 , G06F17/30663 , G06F17/30864
Abstract: The specification relates to a method of receiving a first query and a second query. The method analyzes the second query for a presence of anaphora. If anaphora is present, the method analyzes the first query for a presence of an entity that can be associated with the anaphora. If the analysis analyzing the first query returns two or more associated entities, the method forms a third query wherein the anaphora of the second query is replaced with one of the associated entities and forms a fourth query wherein the anaphora is replaced with the other of the associated entities. The third query and the fourth query are sent to a query-ranking engine. The third query and the fourth query receive a ranking and the higher-ranked query is sent to a search engine.
Abstract translation: 本说明书涉及一种接收第一查询和第二查询的方法。 该方法分析了第二次查询的存在。 如果存在隐喻,则该方法分析第一个查询,以查看可与隐喻相关联的实体的存在。 如果分析第一查询的分析返回两个或更多个相关联的实体,则该方法形成第三查询,其中第二查询的描述被替换为相关联的实体中的一个,并形成第四查询,其中该照明被另一个 关联实体。 第三个查询和第四个查询被发送到查询排名引擎。 第三查询和第四查询接收到排名,并且将较高排名的查询发送到搜索引擎。
-
公开(公告)号:US09536006B2
公开(公告)日:2017-01-03
申请号:US14949308
申请日:2015-11-23
Applicant: Google Inc.
Inventor: Tal Cohen , Ziv Bar-Yossef , Igor Tsvetkov , Tomer Kol , Adi Mano , Oren Naim , Nitsan Oz , Pravir Kumar Gupta , Kavi J. Goel
IPC: G06F17/30
CPC classification number: G06F17/30867 , G06F17/30528 , G06F17/3053 , G06F17/30554 , G06F17/30864
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhancing search results. In one aspect, a method includes identifying a plurality of registered publishers for enriched search results and, for each registered publisher, obtaining enrichment information from the registered publisher and associating the enrichment information with a resource provided by the publisher. A query is received. A plurality of responsive resources that are responsive to the query are identified. A first responsive resource is determined to be associated with enrichment information. An enriched search result is provided, the enriched search result identifying the first responsive resource and including the first responsive resource's associated enrichment information.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于增强搜索结果。 一方面,一种方法包括识别用于丰富搜索结果的多个注册发布者,并且对于每个注册的发布者,从注册的发行者获取富集信息并将所述浓缩信息与由发布者提供的资源相关联。 接收到查询。 识别响应于查询的多个响应资源。 第一响应资源被确定为与浓缩信息相关联。 提供丰富的搜索结果,丰富的搜索结果识别第一响应资源并且包括第一响应资源的相关联的富集信息。
-
公开(公告)号:US20160350304A1
公开(公告)日:2016-12-01
申请号:US14808919
申请日:2015-07-24
Applicant: Google Inc.
Inventor: Vikram Aggarwal , Pravir Kumar Gupta
CPC classification number: G10L15/1822 , G06F3/167 , G06F16/3322 , G06F16/3323 , G10L15/26 , G10L2015/223
Abstract: Technology of the disclosure may facilitate user discovery of various voice-based action queries that can be spoken to initiate computer-based actions, such as voice-based action queries that can be provided as spoken input to a computing device to initiate computer-based actions that are particularized to content being viewed or otherwise consumed by the user on the computing device. Some implementations are generally directed to determining, in view of content recently viewed by a user on a computing device, at least one suggested voice-based action query for presentation via the computing device. Some implementations are additionally or alternatively generally directed to receiving at least one suggested voice-based action query at a computing device and providing the suggested voice-based action query as a suggestion in response to input to initiate providing of a voice-based query via the computing device.
Abstract translation: 本公开的技术可以促进用户发现可以说出的各种基于语音的动作查询以启动基于计算机的动作,诸如基于语音的动作查询,其可以作为向计算设备的口头输入提供以发起基于计算机的动作 其特别是在用户在计算设备上被观看或以其他方式消费的内容。 一些实施方式通常涉及从计算设备上的用户最近观看的内容来确定至少一个建议的基于语音的动作查询,用于经由计算设备呈现。 一些实施方案另外地或替代地通常涉及在计算设备处接收至少一个建议的基于语音的动作查询,并且提供建议的基于语音的动作查询作为响应于输入的建议,以启动基于语音的查询经由 计算设备。
-
公开(公告)号:US20160314791A1
公开(公告)日:2016-10-27
申请号:US14693330
申请日:2015-04-22
Applicant: Google Inc.
Inventor: Bo Wang , Sunil Vemuri , Nitin Mangesh Shetti , Pravir Kumar Gupta , Scott B. Huffman , Javier Alejandro Rey , Jeffrey A. Boortz
CPC classification number: G10L15/22 , G06F3/167 , G10L15/1815 , G10L15/19 , G10L2015/0638 , G10L2015/088 , G10L2015/223
Abstract: Methods, systems, and apparatus for receiving data identifying an application and a voice command trigger term, validating the received data, inducting the received data to generate an intent that specifies the application, the voice command trigger term, and one or more other voice command trigger terms that are determined based at least on the voice command trigger term, and storing the intent at a contextual intent database, wherein the contextual intent database comprises one or more other intents.
Abstract translation: 用于接收识别应用程序和语音命令触发术语的数据的方法,系统和装置,验证所接收的数据,感应所接收的数据以产生指定应用的意图,语音命令触发项和一个或多个其他语音命令 触发术语,其至少基于语音命令触发项确定,并且将意图存储在语境意图数据库中,其中所述语境意图数据库包括一个或多个其他意图。
-
公开(公告)号:US20150310879A1
公开(公告)日:2015-10-29
申请号:US14681203
申请日:2015-04-08
Applicant: Google Inc.
Inventor: Michael Buchanan , Pravir Kumar Gupta , Christopher Bo Tandiono
CPC classification number: G10L15/05 , G10L15/04 , G10L15/22 , G10L15/26 , G10L17/06 , G10L25/51 , G10L25/78 , G10L25/87 , G10L25/90
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech endpointing based on word comparisons are described. In one aspect, a method includes the actions of obtaining a transcription of an utterance. The actions further include determining, as a first value, a quantity of text samples in a collection of text samples that (i) include terms that match the transcription, and (ii) do not include any additional terms. The actions further include determining, as a second value, a quantity of text samples in the collection of text samples that (i) include terms that match the transcription, and (ii) include one or more additional terms. The actions further include classifying the utterance as a likely incomplete utterance or not a likely incomplete utterance based at least on comparing the first value and the second value.
Abstract translation: 描述了包括在计算机存储介质上编码的计算机程序,用于基于词比较的语音终点的方法,系统和装置。 一方面,一种方法包括获得话语转录的动作。 所述动作还包括确定文本样本集合中的文本样本的数量(i)包括与转录匹配的条款,以及(ii)不包括任何附加条款。 所述动作进一步包括将文本样本集合中的文本样本的数量确定为(i)包括与转录匹配的术语,以及(ii)包括一个或多个附加术语。 所述动作进一步包括至少基于比较第一值和第二值,将话语分类为可能不完整的话语,或者不是可能的不完整的话语。
-
-
-
-
-
-