-
公开(公告)号:US20230274205A1
公开(公告)日:2023-08-31
申请号:US18135579
申请日:2023-04-17
Applicant: GOOGLE LLC
Inventor: Robert Stets , Valerie Nygaard , Bogdan Caprita , Bradley M. Abrams , Jason Brant Douglas
IPC: G06Q10/0631 , G10L15/22
CPC classification number: G06Q10/063112 , G10L15/22 , G10L2015/223 , G06F16/951
Abstract: An example method includes receiving, by one or more processors, a representation of an utterance spoken at a computing device; identifying, by a first computational agent from a plurality of computational agents and based on the utterance, a multi-element task to be performed, wherein the plurality of computational agents includes one or more first party computational agents and a plurality of third-party computational agents; and performing, by the first computational agent, a first sub-set of elements of the multi-element task, wherein performing the first sub-set of elements comprises selecting a second computational agent from the plurality of computational agents to perform a second sub-set of elements of the multi-element task.
-
公开(公告)号:US10853747B2
公开(公告)日:2020-12-01
申请号:US15815353
申请日:2017-11-16
Applicant: Google LLC
Inventor: Bo Wang , Lei Zhong , Barnaby John James , Saisuresh Krishnakumaran , Robert Stets , Bogdan Caprita , Valerie Nygaard
IPC: G10L15/22 , G06Q10/06 , G10L15/08 , G06F16/951 , G10L13/00
Abstract: An example method includes receiving, by a computational assistant executing at one or more processors, a representation of an utterance spoken at a computing device; identifying, based on the utterance, a task to be performed; determining a capability level of a first party (1P) agent to perform the task; determining capability levels of respective third party (3P) agents of a plurality of 3P agents to perform the task; responsive to determining that the capability level of the 1P agent does not satisfy a threshold capability level, that a capability level of a particular 3P agent of the plurality of 3P agents is a greatest of the determined capability levels, and that the capability level of the particular 3P agent satisfies the threshold capability level, selecting the particular 3P agent to perform the task; and performing one or more actions determined by the selected agent to perform the task.
-
公开(公告)号:US10714086B2
公开(公告)日:2020-07-14
申请号:US16244780
申请日:2019-01-10
Applicant: Google LLC
Inventor: Ulas Kirazci , Bo Wang , Steve Chen , Sunil Vemuri , Barnaby James , Valerie Nygaard
Abstract: Some implementations are directed to selective invocation of a particular third-party (3P) agent by an automated assistant to achieve an intended action determined by the automated assistant during a dynamic dialog between the automated assistant and a user. In some of those implementations, the particular 3P agent is invoked with value(s) for parameter(s) that are determined during the dynamic dialog; and/or the particular 3P agent is selected, from a plurality of candidate 3P agents, for invocation based on the determined value(s) for the parameter(s) and/or based on other criteria. In some of those implementations, the automated assistant invokes the particular 3P agent by transmitting, to the particular 3P agent, a 3P invocation request that includes the determined value(s) for the parameter(s).
-
公开(公告)号:US11935530B2
公开(公告)日:2024-03-19
申请号:US17515901
申请日:2021-11-01
Applicant: Google LLC
Inventor: April Pufahl , Jared Strawderman , Harry Yu , Adriana Olmos Antillon , Jonathan Livni , Okan Kolak , James Giangola , Nitin Khandelwal , Jason Kearns , Andrew Watson , Joseph Ashear , Valerie Nygaard
CPC classification number: G10L15/22 , G06F1/1694 , G06F3/167 , G06F2203/0381 , G10L2015/223 , G10L2015/225 , H04M2203/253
Abstract: Systems, methods, and apparatus for using a multimodal response in the dynamic generation of client device output that is tailored to a current modality of a client device is disclosed herein. Multimodal client devices can engage in a variety of interactions across the multimodal spectrum including voice only interactions, voice forward interactions, multimodal interactions, visual forward interactions, visual only interactions etc. A multimodal response can include a core message to be rendered for all interaction types as well as one or more modality dependent components to provide a user with additional information.
-
公开(公告)号:US11663535B2
公开(公告)日:2023-05-30
申请号:US15815368
申请日:2017-11-16
Applicant: Google LLC
Inventor: Robert Stets , Valerie Nygaard , Bogdan Caprita , Bradley M. Abrams , Jason Brant Douglas
IPC: G06Q10/0631 , G10L15/22 , G06F16/951
CPC classification number: G06Q10/063112 , G10L15/22 , G06F16/951 , G10L2015/223
Abstract: An example method includes receiving, by one or more processors, a representation of an utterance spoken at a computing device; identifying, by a first computational agent from a plurality of computational agents and based on the utterance, a multi-element task to be performed, wherein the plurality of computational agents includes one or more first party computational agents and a plurality of third-party computational agents; and performing, by the first computational agent, a first sub-set of elements of the multi-element task, wherein performing the first sub-set of elements comprises selecting a second computational agent from the plurality of computational agents to perform a second sub-set of elements of the multi-element task.
-
公开(公告)号:US10854188B2
公开(公告)日:2020-12-01
申请号:US16417024
申请日:2019-05-20
Applicant: Google LLC
Inventor: Valerie Nygaard , Bogdan Caprita , Robert Stets , Saisuresh Krishnakumaran , Jason Brant Douglas
Abstract: An example method includes receiving, by a computational assistant executing at one or more processors, a representation of an utterance spoken at a computing device; selecting, based on the utterance, an agent from a plurality of agents, wherein the plurality of agents includes one or more first party agents and a plurality of third-party agents; responsive to determining that the selected agent comprises a first party agent, selecting a reserved voice from a plurality of voices; and outputting synthesized audio data using the selected voice to satisfy the utterance.
-
公开(公告)号:US10311856B2
公开(公告)日:2019-06-04
申请号:US15815375
申请日:2017-11-16
Applicant: Google LLC
Inventor: Valerie Nygaard , Bogdan Caprita , Robert Stets , Saisuresh Krishnakumaran , Jason Brant Douglas
Abstract: An example method includes receiving, by a computational assistant executing at one or more processors, a representation of an utterance spoken at a computing device; selecting, based on the utterance, an agent from a plurality of agents, wherein the plurality of agents includes one or more first party agents and a plurality of third-party agents; responsive to determining that the selected agent comprises a first party agent, selecting a reserved voice from a plurality of voices; and outputting synthesized audio data using the selected voice to satisfy the utterance.
-
公开(公告)号:US11763813B2
公开(公告)日:2023-09-19
申请号:US17243232
申请日:2021-04-28
Applicant: Google LLC
Inventor: Lior Alon , Rafael Goldfarb , Dekel Auster , Dan Rasin , Michael Andrew Goodman , Trevor Strohman , Nino Tasca , Valerie Nygaard , Jaclyn Konzelmann
CPC classification number: G10L15/22 , G06F3/167 , G10L15/083 , G10L15/1815 , G10L15/285 , G10L2015/223
Abstract: Implementations described herein relate to reducing latency in automated assistant interactions. In some implementations, a client device can receive audio data that captures a spoken utterance of a user. The audio data can be processed to determine an assistant command to be performed by an automated assistant. The assistant command can be processed, using a latency prediction model, to generate a predicted latency to fulfill the assistant command. Further, the client device (or the automated assistant) can determine, based on the predicted latency, whether to audibly render pre-cached content for presentation to the user prior to audibly rendering content that is responsive to the spoken utterance. The pre-cached content can be tailored to the assistant command and audibly rendered for presentation to the user while the content is being obtained, and the content can be audibly rendered for presentation to the user subsequent to the pre-cached content.
-
公开(公告)号:US11562742B2
公开(公告)日:2023-01-24
申请号:US17150139
申请日:2021-01-15
Applicant: Google LLC
Inventor: Ulas Kirazci , Bo Wang , Steve Chen , Sunil Vemuri , Barnaby James , Valerie Nygaard
Abstract: Some implementations are directed to selective invocation of a particular third-party (3P) agent by an automated assistant to achieve an intended action determined by the automated assistant during a dynamic dialog between the automated assistant and a user. In some of those implementations, the particular 3P agent is invoked with value(s) for parameter(s) that are determined during the dynamic dialog; and/or the particular 3P agent is selected, from a plurality of candidate 3P agents, for invocation based on the determined value(s) for the parameter(s) and/or based on other criteria. In some of those implementations, the automated assistant invokes the particular 3P agent by transmitting, to the particular 3P agent, a 3P invocation request that includes the determined value(s) for the parameter(s).
-
公开(公告)号:US11164576B2
公开(公告)日:2021-11-02
申请号:US16251982
申请日:2019-01-18
Applicant: Google LLC
Inventor: April Pufahl , Jared Strawderman , Harry Yu , Adriana Olmos Antillon , Jonathan Livni , Okan Kolak , James Giangola , Nitin Khandelwal , Jason Kearns , Andrew Watson , Joseph Ashear , Valerie Nygaard
Abstract: Systems, methods, and apparatus for using a multimodal response in the dynamic generation of client device output that is tailored to a current modality of a client device is disclosed herein. Multimodal client devices can engage in a variety of interactions across the multimodal spectrum including voice only interactions, voice forward interactions, multimodal interactions, visual forward interactions, visual only interactions etc. A multimodal response can include a core message to be rendered for all interaction types as well as one or more modality dependent components to provide a user with additional information.
-
-
-
-
-
-
-
-
-