-
公开(公告)号:US11929075B2
公开(公告)日:2024-03-12
申请号:US16936935
申请日:2020-07-23
Applicant: Google LLC
Inventor: Bo Wang , Sunil Vemuri , Barnaby John James , Pravir Kumar Gupta , Nitin Mangesh Shetti
CPC classification number: G10L15/30 , G06F3/167 , G10L15/1822 , G10L25/72 , G10L2015/223 , G10L2015/225
Abstract: Methods, systems, and apparatus for receiving, by a voice action system, data specifying trigger terms that trigger an application to perform a voice action and a context that specifies a status of the application when the voice action can be triggered. The voice action system receives data defining a discoverability example for the voice action that comprises one or more of the trigger terms that trigger the application to perform the voice action when a status of the application satisfies the specified context. The voice action system receives a request for discoverability examples for the application from a user device having the application installed, and provides the data defining the discoverability examples to the user device in response to the request. The user device is configured to provide a notification of the one or more of the trigger terms when a status of the application satisfies the specified context.
-
公开(公告)号:US20230186915A1
公开(公告)日:2023-06-15
申请号:US18107327
申请日:2023-02-08
Applicant: GOOGLE LLC
Inventor: Barnaby John James , David Roy Schairer , Amy Lynn Baldwin , Vincent Yanton Mo , Jun Yang , Mark Spates , Lei Zhong
IPC: G10L15/22 , G10L15/183 , G10L15/30 , H04L12/28 , H04L41/12
CPC classification number: G10L15/22 , G10L15/30 , G10L15/183 , H04L12/282 , H04L41/12 , G10L2015/223 , G10L2015/227 , G10L2015/228 , H04L67/125
Abstract: Example aspects of the present disclosure are directed to processing voice commands or utterances. For instance, data indicative of a voice utterance can be received. A device topology representation can be accessed. The device topology representation can define a plurality of smart devices associated with one or more structures. The device topology representation can further define a location of each of the plurality of devices within the associated structures. A transcription of the voice utterance can be determined based at least in part on the device topology representation. One or more selected devices and one or more actions to be performed by the one or more selected devices can be determined based at least in part on the determined transcription and the device topology representation.
-
公开(公告)号:US20190156856A1
公开(公告)日:2019-05-23
申请号:US16308570
申请日:2016-11-29
Applicant: GOOGLE LLC
Inventor: Barnaby John James
IPC: G10L25/48 , H04L29/06 , G10L17/00 , G10L15/22 , G06F21/32 , G10L15/30 , G06F21/35 , G10L15/26 , H04W12/06 , G10L17/02
Abstract: In some implementations, (i) audio data representing a voice command spoken by a speaker and (ii) a speaker identification result indicating that the voice command was spoken by the speaker are obtained. A voice action is selected based at least on a transcription of the audio data. A service provider corresponding to the selected voice action is selected from among a plurality of different service providers. One or more input data types that the selected service provider uses to perform authentication for the selected voice action are identified. A request to perform the selected voice action and (i) one or more values that correspond to the identified one or more input data types are provided to the service provider.
-
公开(公告)号:US10853747B2
公开(公告)日:2020-12-01
申请号:US15815353
申请日:2017-11-16
Applicant: Google LLC
Inventor: Bo Wang , Lei Zhong , Barnaby John James , Saisuresh Krishnakumaran , Robert Stets , Bogdan Caprita , Valerie Nygaard
IPC: G10L15/22 , G06Q10/06 , G10L15/08 , G06F16/951 , G10L13/00
Abstract: An example method includes receiving, by a computational assistant executing at one or more processors, a representation of an utterance spoken at a computing device; identifying, based on the utterance, a task to be performed; determining a capability level of a first party (1P) agent to perform the task; determining capability levels of respective third party (3P) agents of a plurality of 3P agents to perform the task; responsive to determining that the capability level of the 1P agent does not satisfy a threshold capability level, that a capability level of a particular 3P agent of the plurality of 3P agents is a greatest of the determined capability levels, and that the capability level of the particular 3P agent satisfies the threshold capability level, selecting the particular 3P agent to perform the task; and performing one or more actions determined by the selected agent to perform the task.
-
公开(公告)号:US10770093B2
公开(公告)日:2020-09-08
申请号:US16308570
申请日:2016-11-29
Applicant: GOOGLE LLC
Inventor: Barnaby John James
IPC: G06F21/34 , G10L25/48 , G06F21/32 , G06F21/35 , G10L15/22 , G10L17/00 , H04L29/06 , G10L15/26 , G10L15/30 , G10L17/02 , H04W12/06 , G07C9/37
Abstract: In some implementations, (i) audio data representing a voice command spoken by a speaker and (ii) a speaker identification result indicating that the voice command was spoken by the speaker are obtained. A voice action is selected based at least on a transcription of the audio data. A service provider corresponding to the selected voice action is selected from among a plurality of different service providers. One or more input data types that the selected service provider uses to perform authentication for the selected voice action are identified. A request to perform the selected voice action and (i) one or more values that correspond to the identified one or more input data types are provided to the service provider.
-
公开(公告)号:US10510129B2
公开(公告)日:2019-12-17
申请号:US15782351
申请日:2017-10-12
Applicant: GOOGLE LLC
IPC: G06Q50/14 , G06Q30/06 , G06F16/29 , G06F16/583 , G01C21/36 , G06K9/62 , G06T11/60 , G06T7/11 , G06F16/95 , G06T17/05
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for computerized travel services. One of the methods includes identifying photographs using an index of photographs, the photographs being identified from the index as photographs geographically related to a point of interest or destination and having a creation timestamp corresponding to a time of the year; determining for each of the photographs, a relevancy score based at least in part on: selection success data of the photograph for image queries referring to the point of interest or destination, and references to the point of interest or destination in documents associated with the photograph; and selecting a selected photograph from the photographs based at least in part on a respective visual quality score and the respective relevancy scores, the visual quality score representing a degree of visual quality of the respective photographs.
-
公开(公告)号:US20230269586A1
公开(公告)日:2023-08-24
申请号:US18307210
申请日:2023-04-26
Applicant: Google LLC
Inventor: Barnaby John James
IPC: H04W12/065 , G06F21/32 , G06F21/35 , G10L15/22 , G10L17/00 , H04L9/40 , H04W12/06 , H04W12/30 , G10L15/26 , G10L15/30 , G10L17/02 , G10L25/48
CPC classification number: H04W12/065 , G06F21/32 , G06F21/35 , G10L15/22 , G10L15/26 , G10L15/30 , G10L17/00 , G10L17/02 , G10L25/48 , H04L63/0861 , H04W12/06 , H04W12/30 , G07C9/37
Abstract: In some implementations, (i) audio data representing a voice command spoken by a speaker and (ii) a speaker identification result indicating that the voice command was spoken by the speaker are obtained. A voice action is selected based at least on a transcription of the audio data. A service provider corresponding to the selected voice action is selected from among a plurality of different service providers. One or more input data types that the selected service provider uses to perform authentication for the selected voice action are identified. A request to perform the selected voice action and (i) one or more values that correspond to the identified one or more input data types are provided to the service provider.
-
公开(公告)号:US20200286482A1
公开(公告)日:2020-09-10
申请号:US16880567
申请日:2020-05-21
Applicant: Google LLC
Inventor: Barnaby John James , David Roy Schairer , Amy Lynn Baldwin , Vincent Yanton Mo , Jun Yang , Mark Spates, IV , Lei Zhong
IPC: G10L15/22 , G10L15/183 , G10L15/30 , H04L12/28 , H04L12/24
Abstract: Example aspects of the present disclosure are directed to processing voice commands or utterances. For instance, data indicative of a voice utterance can be received. A device topology representation can be accessed. The device topology representation can define a plurality of smart devices associated with one or more structures. The device topology representation can further define a location of each of the plurality of devices within the associated structures. A transcription of the voice utterance can be determined based at least in part on the device topology representation. One or more selected devices and one or more actions to be performed by the one or more selected devices can be determined based at least in part on the determined transcription and the device topology representation.
-
公开(公告)号:US10741183B2
公开(公告)日:2020-08-11
申请号:US16101940
申请日:2018-08-13
Applicant: Google LLC
Inventor: Bo Wang , Sunil Vemuri , Barnaby John James , Pravir Kumar Gupta , Nitin Mangesh Shetti
Abstract: Methods, systems, and apparatus for receiving, by a voice action system, data specifying trigger terms that trigger an application to perform a voice action and a context that specifies a status of the application when the voice action can be triggered. The voice action system receives data defining a discoverability example for the voice action that comprises one or more of the trigger terms that trigger the application to perform the voice action when a status of the application satisfies the specified context. The voice action system receives a request for discoverability examples for the application from a user device having the application installed, and provides the data defining the discoverability examples to the user device in response to the request. The user device is configured to provide a notification of the one or more of the trigger terms when a status of the application satisfies the specified context.
-
10.
公开(公告)号:US10127926B2
公开(公告)日:2018-11-13
申请号:US15178895
申请日:2016-06-10
Applicant: GOOGLE LLC
Inventor: Barnaby John James
IPC: G10L17/22 , G06F21/30 , G10L25/48 , G10L15/26 , G10L15/30 , G10L17/02 , H04W12/06 , G06F21/32 , G06F21/35 , G10L17/00 , H04L29/06 , G10L15/22 , G07C9/00
Abstract: In some implementations, (i) audio data representing a voice command spoken by a speaker and (ii) a speaker identification result indicating that the voice command was spoken by the speaker are obtained. A voice action is selected based at least on a transcription of the audio data. A service provider corresponding to the selected voice action is selected from among a plurality of different service providers. One or more input data types that the selected service provider uses to perform authentication for the selected voice action are identified. A request to perform the selected voice action and (i) one or more values that correspond to the identified one or more input data types are provided to the service provider.
-
-
-
-
-
-
-
-
-