-
公开(公告)号:US20180190274A1
公开(公告)日:2018-07-05
申请号:US15394872
申请日:2016-12-30
Applicant: Google Inc.
Inventor: Ulas Kirazci , Bo Wang , Steve Chen , Sunil Vemuri , Barnaby James , Valerie Nygaard
CPC classification number: G10L15/22 , G06F3/167 , G06F17/2785 , G06F17/30867 , G10L15/1815 , G10L15/1822 , G10L15/30 , G10L2015/223
Abstract: Some implementations are directed to selective invocation of a particular third-party (3P) agent by an automated assistant to achieve an intended action determined by the automated assistant during a dynamic dialog between the automated assistant and a user. In some of those implementations, the particular 3P agent is invoked with value(s) for parameter(s) that are determined during the dynamic dialog; and/or the particular 3P agent is selected, from a plurality of candidate 3P agents, for invocation based on the determined value(s) for the parameter(s) and/or based on other criteria. In some of those implementations, the automated assistant invokes the particular 3P agent by transmitting, to the particular 3P agent, a 3P invocation request that includes the determined value(s) for the parameter(s).
-
公开(公告)号:US20180166074A1
公开(公告)日:2018-06-14
申请号:US15378920
申请日:2016-12-14
Applicant: Google Inc.
Inventor: Vikram Aggarwal , Barnaby James
IPC: G10L15/22 , G10L15/02 , G10L21/007
CPC classification number: G10L15/22 , G10L15/02 , G10L15/08 , G10L21/007 , G10L25/51 , G10L2015/088 , G10L2015/223
Abstract: Methods, apparatus, and computer readable media are described related to recording, organizing, and making audio files available for consumption by voice-activated products. In various implementations, in response to receiving an input from a first user indicating that the first user intends to record audio content, audio content may be captured and stored. Input may be received from the first user indicating at least one identifier for the audio content. The stored audio content may be associated with the at least one identifier. A voice input may be received from a subsequent user. In response to determining that the voice input has particular characteristics, speech recognition may be biased in respect of the voice input towards recognition of the at least one identifier. In response to recognizing, based on the biased speech recognition, presence of the at least one identifier in the voice input, the stored audio content may be played.
-
公开(公告)号:US20180096681A1
公开(公告)日:2018-04-05
申请号:US15284473
申请日:2016-10-03
Applicant: Google Inc.
Inventor: Yuzhao Ni , Bo Wang , Barnaby James , Pravir Gupta , David Schairer
CPC classification number: G10L15/22 , G06F3/167 , G06F17/289 , G06F17/30401 , G06F17/30672 , G10L15/063 , G10L15/1815 , G10L15/1822 , G10L15/183 , G10L2015/088 , G10L2015/223 , G10L2015/225
Abstract: In various implementations, upon receiving a given voice command from a user, a voice-based trigger may be selected from a library of voice-based triggers previously used across a population of users. The library may include association(s) between each voice-based trigger and responsive action(s) previously performed in response to the voice-based trigger. The selecting may be based on a measure of similarity between the given voice command and the selected voice-based trigger. One or more responsive actions associated with the selected voice-based trigger in the library may be determined. Based on the one or more responsive actions, current responsive action(s) may be performed by the client device. Feedback associated with performance of the current responsive action(s) may be received from the user and used to alter a strength of an association between the selected voice-based trigger and the one or more responsive actions.
-
公开(公告)号:US10600418B2
公开(公告)日:2020-03-24
申请号:US15372188
申请日:2016-12-07
Applicant: Google Inc.
Inventor: Barnaby James , Bo Wang , Sunil Vemuri , David Schairer , Ulas Kirazci , Ertan Dogrultan , Petar Aleksic
Abstract: Implementations relate to dynamically, and in a context-sensitive manner, biasing voice to text conversion. In some implementations, the biasing of voice to text conversions is performed by a voice to text engine of a local agent, and the biasing is based at least in part on content provided to the local agent by a third-party (3P) agent that is in network communication with the local agent. In some of those implementations, the content includes contextual parameters that are provided by the 3P agent in combination with responsive content generated by the 3P agent during a dialog that: is between the 3P agent, and a user of a voice-enabled electronic device; and is facilitated by the local agent. The contextual parameters indicate potential feature(s) of further voice input that is to be provided in response to the responsive content generated by the 3P agent.
-
公开(公告)号:US10297254B2
公开(公告)日:2019-05-21
申请号:US15284473
申请日:2016-10-03
Applicant: Google Inc.
Inventor: Yuzhao Ni , Bo Wang , Barnaby James , Pravir Gupta , David Schairer
IPC: G10L15/00 , G06F17/30 , G06F17/28 , G10L15/22 , G10L15/06 , G10L15/18 , G06F3/16 , G10L15/08 , G06F16/242 , G06F16/33
Abstract: In various implementations, upon receiving a given voice command from a user, a voice-based trigger may be selected from a library of voice-based triggers previously used across a population of users. The library may include association(s) between each voice-based trigger and responsive action(s) previously performed in response to the voice-based trigger. The selecting may be based on a measure of similarity between the given voice command and the selected voice-based trigger. One or more responsive actions associated with the selected voice-based trigger in the library may be determined. Based on the one or more responsive actions, current responsive action(s) may be performed by the client device. Feedback associated with performance of the current responsive action(s) may be received from the user and used to alter a strength of an association between the selected voice-based trigger and the one or more responsive actions, wherein the altering includes incrementing or decrementing a count corresponding to the strength of association based on the feedback being positive or negative respectively.
-
公开(公告)号:US20190122657A1
公开(公告)日:2019-04-25
申请号:US15372188
申请日:2016-12-07
Applicant: Google Inc.
Inventor: Barnaby James , Bo Wang , Sunil Vemuri , David Schairer , Ulas Kirazci , Ertan Dogrultan , Petar Aleksic
Abstract: Implementations relate to dynamically, and in a context-sensitive manner, biasing voice to text conversion. In some implementations, the biasing of voice to text conversions is performed by a voice to text engine of a local agent, and the biasing is based at least in part on content provided to the local agent by a third-party (3P) agent that is in network communication with the local agent. In some of those implementations, the content includes contextual parameters that are provided by the 3P agent in combination with responsive content generated by the 3P agent during a dialog that: is between the 3P agent, and a user of a voice-enabled electronic device; and is facilitated by the local agent. The contextual parameters indicate potential feature(s) of further voice input that is to be provided in response to the responsive content generated by the 3P agent.
-
公开(公告)号:US10224031B2
公开(公告)日:2019-03-05
申请号:US15394872
申请日:2016-12-30
Applicant: Google Inc.
Inventor: Ulas Kirazci , Bo Wang , Steve Chen , Sunil Vemuri , Barnaby James , Valerie Nygaard
Abstract: Some implementations are directed to selective invocation of a particular third-party (3P) agent by an automated assistant to achieve an intended action determined by the automated assistant during a dynamic dialog between the automated assistant and a user. In some of those implementations, the particular 3P agent is invoked with value(s) for parameter(s) that are determined during the dynamic dialog; and/or the particular 3P agent is selected, from a plurality of candidate 3P agents, for invocation based on the determined value(s) for the parameter(s) and/or based on other criteria. In some of those implementations, the automated assistant invokes the particular 3P agent by transmitting, to the particular 3P agent, a 3P invocation request that includes the determined value(s) for the parameter(s).
-
-
-
-
-
-