-
公开(公告)号:US12272356B2
公开(公告)日:2025-04-08
申请号:US17842577
申请日:2022-06-16
Applicant: Google LLC
Inventor: Diego Melendo Casado , Jaclyn Konzelmann
Abstract: Techniques are described herein for enabling the use of “dynamic” or “context-specific” hot words for an automated assistant. In various implementations, an automated assistant may be operated at least in part on a computing device. Audio data captured by a microphone may be monitored for default hot word(s). Detection of one or more of the default hot words may trigger transition of the automated assistant from a limited hot word listening state into a speech recognition state. Transition of the computing device into a given state may be detected, and in response, the audio data captured by the microphone may be monitored for context-specific hot word(s), in addition to or instead of the default hot word(s). Detection of the context-specific hot word(s) may trigger the automated assistant to perform a responsive action associated with the given state, without requiring detection of default hot word(s).
-
公开(公告)号:US11785295B2
公开(公告)日:2023-10-10
申请号:US17588467
申请日:2022-01-31
Applicant: GOOGLE LLC
Inventor: Jaclyn Konzelmann , Tuan Nguyen , Vinay Bettadapura , Andrew Gallagher , Utsav Prabhu , Caroline Pantofaru
IPC: H04N21/442 , G06T7/70 , H04N21/258 , H04N21/41 , H04W12/64
CPC classification number: H04N21/44218 , G06T7/70 , H04N21/25875 , H04N21/25891 , H04N21/4126 , H04W12/64 , G06T2207/30196
Abstract: Implementations relate to an automated assistant that provides and manages output from one or more elements of output hardware of a computing device. The automated assistant manages dynamic adjustment of access permissions to the computing device according to, for example, a detected presence of one or more users. An active-user queue can be established each time a unique user enters a viewing window of a camera of the computing device when, up to that point, no user was considered active. Multiple image frames can be captured via the camera and processed to determine whether an initial user remains in the viewing window and/or whether another user has entered the viewing window. The initial user can be considered active as long as they are exclusively detected in the viewing window. Restricted content associated with the user may be rendered by the computing device whilst the user is active.
-
公开(公告)号:US20230316814A1
公开(公告)日:2023-10-05
申请号:US18204785
申请日:2023-06-01
Applicant: GOOGLE LLC
Inventor: Diego Melendo Casado , Tuan Nguyen , Jaclyn Konzelmann , Gustavo Moura , Tanya Kraljic
CPC classification number: G06V40/50 , G10L17/04 , G10L17/00 , G06V40/70 , G06V40/161
Abstract: Techniques are described herein for dialog-based enrollment of individual users for single- and/or multi-modal recognition by an automated assistant, as well as determining how to respond to a particular user's request based on the particular user being enrolled and/or recognized. Rather than requiring operation of a graphical user interface for individual enrollment, dialog-based enrollment enables users to enroll themselves (or others) by way of a human-to-computer dialog with the automated assistant.
-
公开(公告)号:US20220148339A1
公开(公告)日:2022-05-12
申请号:US17580334
申请日:2022-01-20
Applicant: Google LLC
Inventor: Diego Melendo Casado , Tuan Nguyen , Jaclyn Konzelmann , Gustavo Moura , Tanya Kraljic
Abstract: Techniques are described herein for dialog-based enrollment of individual users for single- and/or multi-modal recognition by an automated assistant, as well as determining how to respond to a particular user's request based on the particular user being enrolled and/or recognized. Rather than requiring operation of a graphical user interface for individual enrollment, dialog-based enrollment enables users to enroll themselves (or others) by way of a human-to-computer dialog with the automated assistant.
-
公开(公告)号:US20200349966A1
公开(公告)日:2020-11-05
申请号:US16622771
申请日:2019-05-02
Applicant: Google LLC
Inventor: Jaclyn Konzelmann , Kenneth Mixter , Sourish Chaudhuri , Tuan Nguyen , Hideaki Matsui , Caroline Pantofaru , Vinay Bettadapura
Abstract: Hot-word free adaptation of one or more function(s) of an automated assistant. Sensor data, from one or more sensor components of an assistant device that provides an automated assistant interface (graphical and/or audible), is processed to determine occurrence and/or confidence metric(s) of various attributes of a user that is proximal to the assistant device. Whether to adapt each of one or more of the function(s) of the automated assistant is based on the occurrence and/or the confidence of one or more of the various attributes. For example, certain processing of at least some of the sensor data can be initiated, such as initiating previously dormant local processing of at least some of the sensor data and/or initiating transmission of at least some of the audio data to remote automated assistant component(s).
-
公开(公告)号:US20230410803A1
公开(公告)日:2023-12-21
申请号:US18228948
申请日:2023-08-01
Applicant: GOOGLE LLC
Inventor: Lior Alon , Rafael Goldfarb , Dekel Auster , Dan Rasin , Michael Andrew Goodman , Trevor Strohman , Nino Tasca , Valerie Nygaard , Jaclyn Konzelmann
CPC classification number: G10L15/22 , G06F3/167 , G10L2015/223 , G10L15/1815 , G10L15/285 , G10L15/083
Abstract: Implementations described herein relate to reducing latency in automated assistant interactions. In some implementations, a client device can receive audio data that captures a spoken utterance of a user. The audio data can be processed to determine an assistant command to be performed by an automated assistant. The assistant command can be processed, using a latency prediction model, to generate a predicted latency to fulfill the assistant command. Further, the client device (or the automated assistant) can determine, based on the predicted latency, whether to audibly render pre-cached content for presentation to the user prior to audibly rendering content that is responsive to the spoken utterance. The pre-cached content can be tailored to the assistant command and audibly rendered for presentation to the user while the content is being obtained, and the content can be audibly rendered for presentation to the user subsequent to the pre-cached content.
-
公开(公告)号:US11688417B2
公开(公告)日:2023-06-27
申请号:US16622771
申请日:2019-05-02
Applicant: Google LLC
Inventor: Jaclyn Konzelmann , Kenneth Mixter , Sourish Chaudhuri , Tuan Nguyen , Hideaki Matsui , Caroline Pantofaru , Vinay Bettadapura
Abstract: Hot-word free adaptation of one or more function(s) of an automated assistant. Sensor data, from one or more sensor components of an assistant device that provides an automated assistant interface (graphical and/or audible), is processed to determine occurrence and/or confidence metric(s) of various attributes of a user that is proximal to the assistant device. Whether to adapt each of one or more of the function(s) of the automated assistant is based on the occurrence and/or the confidence of one or more of the various attributes. For example, certain processing of at least some of the sensor data can be initiated, such as initiating previously dormant local processing of at least some of the sensor data and/or initiating transmission of at least some of the audio data to remote automated assistant component(s).
-
8.
公开(公告)号:US20210304764A1
公开(公告)日:2021-09-30
申请号:US17346797
申请日:2021-06-14
Applicant: Google LLC
Inventor: Raunaq Shah , Jaclyn Konzelmann , Lisa Takehana , Ruxandra Davies , Adrian Diaconu
Abstract: Implementations set forth herein relate to employing dynamic regulations for governing responsiveness of multiple automated assistant devices, and specifically the responsiveness an automated assistant to a given spoken utterance that has been acknowledged by two or more of the assistant devices. The dynamic regulations can be context-dependent and adapted over time in order that the automated assistant can accommodate assistant interaction preferences that may vary from user to user. For instance, a spoken utterance such as “stop,” may be intended to affect different assistant actions based on a context in which the user provided the spoken utterance. The context can refer to a location of the user relative to other rooms in a home, a time of day, a user providing the spoken utterance, an arrangement of the assistant devices within a home, and/or a state of each device in the home.
-
9.
公开(公告)号:US20200302925A1
公开(公告)日:2020-09-24
申请号:US16343934
申请日:2018-08-23
Applicant: Google LLC
Inventor: Raunaq Shah , Jaclyn Konzelmann , Lisa Takehana , Ruxandra Davies , Adrian Diaconu
Abstract: Implementations set forth herein relate to employing dynamic regulations for governing responsiveness of multiple automated assistant devices, and specifically the responsiveness an automated assistant to a given spoken utterance that has been acknowledged by two or more of the assistant devices. The dynamic regulations can be context-dependent and adapted over time in order that the automated assistant can accommodate assistant interaction preferences that may vary from user to user. For instance, a spoken utterance such as “stop,” may be intended to affect different assistant actions based on a context in which the user provided the spoken utterance. The context can refer to a location of the user relative to other rooms in a home, a time of day, a user providing the spoken utterance, an arrangement of the assistant devices within a home, and/or a state of each device in the home.
-
公开(公告)号:US20250126323A1
公开(公告)日:2025-04-17
申请号:US18982816
申请日:2024-12-16
Applicant: GOOGLE LLC
Inventor: Jaclyn Konzelmann , Tuan Nguyen , Vinay Bettadapura , Andrew Gallagher , Utsav Prabhu , Caroline Pantofaru
IPC: H04N21/442 , G06T7/70 , H04N21/258 , H04N21/41 , H04W12/64
Abstract: Implementations relate to an automated assistant that provides and manages output from one or more elements of output hardware of a computing device. The automated assistant manages dynamic adjustment of access permissions to the computing device according to, for example, a detected presence of one or more users. An active-user queue can be established each time a unique user enters a viewing window of a camera of the computing device when, up to that point, no user was considered active. Multiple image frames can be captured via the camera and processed to determine whether an initial user remains in the viewing window and/or whether another user has entered the viewing window. The initial user can be considered active as long as they are exclusively detected in the viewing window. Restricted content associated with the user may be rendered by the computing device whilst the user is active.
-
-
-
-
-
-
-
-
-