-
公开(公告)号:US20200279564A1
公开(公告)日:2020-09-03
申请号:US16877799
申请日:2020-05-19
Applicant: Google LLC
Inventor: Ulas Kirazci , Bo Wang , Steve Chen , Sunil Vemuri , Barnaby James , Valerie Nygaard
Abstract: Some implementations are directed to selective invocation of a particular third-party (3P) agent by an automated assistant to achieve an intended action determined by the automated assistant during a dynamic dialog between the automated assistant and a user. In some of those implementations, the particular 3P agent is invoked with value(s) for parameter(s) that are determined during the dynamic dialog; and/or the particular 3P agent is selected, from a plurality of candidate 3P agents, for invocation based on the determined value(s) for the parameter(s) and/or based on other criteria. In some of those implementations, the automated assistant invokes the particular 3P agent by transmitting, to the particular 3P agent, a 3P invocation request that includes the determined value(s) for the parameter(s).
-
公开(公告)号:US20200258523A1
公开(公告)日:2020-08-13
申请号:US16860679
申请日:2020-04-28
Applicant: Google LLC
Inventor: Gaurav Bhaya , Ulas Kirazci , Bradley Abrams , Adam Coimbra , Ilya Firman , Carey Radebaugh
Abstract: The present disclosure is generally directed to the generation of voice-activated data flows in interconnected network. The voice-activated data flows can include input audio signals that include a request and are detected at a client device. The client device can transmit the input audio signal to a data processing system, where the input audio signal can be parsed and passed to the data processing system of a service provider to fulfill the request in the input audio signal. The present solution is configured to conserve network resources by reducing the number of network transmissions needed to fulfill a request.
-
公开(公告)号:US20200184974A1
公开(公告)日:2020-06-11
申请号:US16791334
申请日:2020-02-14
Applicant: Google LLC
Inventor: Barnaby James , Bo Wang , Sunil Vemuri , David Schairer , Ulas Kirazci , Ertan Dogrultan , Petar Aleksic
Abstract: Implementations relate to dynamically, and in a context-sensitive manner, biasing voice to text conversion. In some implementations, the biasing of voice to text conversions is performed by a voice to text engine of a local agent, and the biasing is based at least in part on content provided to the local agent by a third-party (3P) agent that is in network communication with the local agent. In some of those implementations, the content includes contextual parameters that are provided by the 3P agent in combination with responsive content generated by the 3P agent during a dialog that: is between the 3P agent, and a user of a voice-enabled electronic device; and is facilitated by the local agent. The contextual parameters indicate potential feature(s) of further voice input that is to be provided in response to the responsive content generated by the 3P agent.
-
公开(公告)号:US12217759B2
公开(公告)日:2025-02-04
申请号:US18434602
申请日:2024-02-06
Applicant: GOOGLE LLC
Inventor: Barnaby James , Bo Wang , Sunil Vemuri , David Schairer , Ulas Kirazci , Ertan Dogrultan , Petar Aleksic
IPC: G10L15/26 , G06F40/205 , G06F40/284 , G06F40/30 , G10L15/18 , G10L15/183 , G10L15/22 , G10L15/30
Abstract: Implementations relate to dynamically, and in a context-sensitive manner, biasing voice to text conversion. In some implementations, the biasing of voice to text conversions is performed by a voice to text engine of a local agent, and the biasing is based at least in part on content provided to the local agent by a third-party (3P) agent that is in network communication with the local agent. In some of those implementations, the content includes contextual parameters that are provided by the 3P agent in combination with responsive content generated by the 3P agent during a dialog that: is between the 3P agent, and a user of a voice-enabled electronic device; and is facilitated by the local agent. The contextual parameters indicate potential feature(s) of further voice input that is to be provided in response to the responsive content generated by the 3P agent.
-
公开(公告)号:US20240274133A1
公开(公告)日:2024-08-15
申请号:US18434602
申请日:2024-02-06
Applicant: GOOGLE LLC
Inventor: Barnaby James , Bo Wang , Sunil Vemuri , David Schairer , Ulas Kirazci , Ertan Dogrultan , Petar Aleksic
IPC: G10L15/26 , G06F40/205 , G06F40/284 , G06F40/30 , G10L15/18 , G10L15/183 , G10L15/22 , G10L15/30
CPC classification number: G10L15/26 , G06F40/205 , G06F40/284 , G06F40/30 , G10L15/1815 , G10L15/183 , G10L15/22 , G10L15/30 , G10L2015/223 , G10L2015/228
Abstract: Implementations relate to dynamically, and in a context-sensitive manner, biasing voice to text conversion. In some implementations, the biasing of voice to text conversions is performed by a voice to text engine of a local agent, and the biasing is based at least in part on content provided to the local agent by a third-party (3P) agent that is in network communication with the local agent. In some of those implementations, the content includes contextual parameters that are provided by the 3P agent in combination with responsive content generated by the 3P agent during a dialog that: is between the 3P agent, and a user of a voice-enabled electronic device; and is facilitated by the local agent. The contextual parameters indicate potential feature(s) of further voice input that is to be provided in response to the responsive content generated by the 3P agent.
-
公开(公告)号:US11922209B2
公开(公告)日:2024-03-05
申请号:US17898205
申请日:2022-08-29
Applicant: GOOGLE LLC
Inventor: Jason Douglas , Carey Radebaugh , Ilya Firman , Ulas Kirazci , Luv Kothari
CPC classification number: G06F9/4843 , G10L15/1822 , G10L15/22 , G10L15/30 , G06F2209/482 , G10L2015/223 , G10L2015/228 , G10L15/34 , H04L12/281 , H04L12/2816
Abstract: Systems and methods of invoking functions of agents via digital assistant applications are provided. Each action-inventory can have an address template for an action by an agent. The address template can include a portion having an input variable used to execute the action. A data processing system can parse an input audio signal from a client device to identify a request and a parameter to be executed by the agent. The data processing system can select an action-inventory for the action corresponding to the request. The data processing system can generate, using the address template, an address. The address can include a substring having the parameter used to control execution of the action. The data processing system can direct an action data structure including the address to the agent to cause the agent to execute the action and to provide output for presentation.
-
37.
公开(公告)号:US20230343336A1
公开(公告)日:2023-10-26
申请号:US18217326
申请日:2023-06-30
Applicant: GOOGLE LLC
Inventor: Ulas Kirazci , Adam Coimbra , Abraham Lee , Wei Dong , Thushan Amarasiriwardena
IPC: G10L15/22 , G06F9/448 , G06F3/16 , G10L13/027
CPC classification number: G10L15/22 , G06F9/4498 , G06F3/167 , G10L13/027 , G10L2015/223 , G10L2015/228
Abstract: Techniques are described herein for multi-modal interaction between users, automated assistants, and other computing services. In various implementations, a user may engage with the automated assistant in order to further engage with a third party computing service. In some implementations, the user may advance through dialog state machines associated with third party computing service using both verbal input modalities and input modalities other than verbal modalities, such as visual/tactile modalities.
-
公开(公告)号:US20230260517A1
公开(公告)日:2023-08-17
申请号:US18125606
申请日:2023-03-23
Applicant: GOOGLE LLC
Inventor: Barnaby James , Bo Wang , Sunil Vemuri , David Schairer , Ulas Kirazci , Ertan Dogrultan , Petar Aleksic
IPC: G10L15/26 , G10L15/22 , G06F40/284 , G06F40/205 , G06F40/30 , G10L15/183 , G10L15/18 , G10L15/30
CPC classification number: G10L15/26 , G06F40/30 , G06F40/205 , G06F40/284 , G10L15/22 , G10L15/30 , G10L15/183 , G10L15/1815 , G10L2015/223 , G10L2015/228
Abstract: Implementations relate to dynamically, and in a context-sensitive manner, biasing voice to text conversion. In some implementations, the biasing of voice to text conversions is performed by a voice to text engine of a local agent, and the biasing is based at least in part on content provided to the local agent by a third-party (3P) agent that is in network communication with the local agent. In some of those implementations, the content includes contextual parameters that are provided by the 3P agent in combination with responsive content generated by the 3P agent during a dialog that: is between the 3P agent, and a user of a voice-enabled electronic device; and is facilitated by the local agent. The contextual parameters indicate potential feature(s) of further voice input that is to be provided in response to the responsive content generated by the 3P agent.
-
公开(公告)号:US20220413901A1
公开(公告)日:2022-12-29
申请号:US17898205
申请日:2022-08-29
Applicant: GOOGLE LLC
Inventor: Jason Douglas , Carey Radebaugh , Ilya Firman , Ulas Kirazci , Luv Kothari
Abstract: Systems and methods of invoking functions of agents via digital assistant applications are provided. Each action-inventory can have an address template for an action by an agent. The address template can include a portion having an input variable used to execute the action. A data processing system can parse an input audio signal from a client device to identify a request and a parameter to be executed by the agent. The data processing system can select an action-inventory for the action corresponding to the request. The data processing system can generate, using the address template, an address. The address can include a substring having the parameter used to control execution of the action. The data processing system can direct an action data structure including the address to the agent to cause the agent to execute the action and to provide output for presentation.
-
公开(公告)号:US11514907B2
公开(公告)日:2022-11-29
申请号:US16860679
申请日:2020-04-28
Applicant: Google LLC
Inventor: Gaurav Bhaya , Ulas Kirazci , Bradley Abrams , Adam Coimbra , Ilya Firman , Carey Radebaugh
IPC: G10L15/22 , G10L15/18 , G10L15/30 , G06F40/216 , G06F40/30 , G06F16/901 , G06F40/35
Abstract: The present disclosure is generally directed to the generation of voice-activated data flows in interconnected network. The voice-activated data flows can include input audio signals that include a request and are detected at a client device. The client device can transmit the input audio signal to a data processing system, where the input audio signal can be parsed and passed to the data processing system of a service provider to fulfill the request in the input audio signal. The present solution is configured to conserve network resources by reducing the number of network transmissions needed to fulfill a request.
-
-
-
-
-
-
-
-
-