-
公开(公告)号:US12277259B2
公开(公告)日:2025-04-15
申请号:US18375876
申请日:2023-10-02
Applicant: GOOGLE LLC
Inventor: Tuan Nguyen , Kenneth Mixter , Yuan Yuan
Abstract: Methods, apparatus, systems, and computer-readable media are provided for generating and/or adapting automated assistant content according to a distance of a user relative to an automated assistant interface that renders the automated assistant content. For instance, the automated assistant can provide data for a client device to render. The client device can request additional data when the user relocates closer to, or further from, the client device. In some implementations, a request for additional data can identify a distance between the user and the client device. In this way, the additional data can be generated or selected according to the distance in the request. Other implementations can allow an automated assistant to determine an active user from a group of users in an environment, and determine a distance between the active user and the client device in order that any rendered content can be tailored for the active user.
-
公开(公告)号:US11842737B2
公开(公告)日:2023-12-12
申请号:US17211409
申请日:2021-03-24
Applicant: Google LLC
Inventor: Tuan Nguyen , Yuan Yuan
CPC classification number: G10L15/24 , G06F3/013 , G06F18/251 , G06N3/04 , G10L15/16 , G10L2015/228
Abstract: Techniques are described herein for detecting and/or enrolling (or commissioning) new “hot commands” that are usable to cause an automated assistant to perform responsive action(s) without having to be first explicitly invoked. In various implementations, an automated assistant may be transitioned from a limited listening state into a full speech recognition state in response to a trigger event. While in the full speech recognition state, the automated assistant may receive and perform speech recognition processing on a spoken command from a user to generate a textual command. The textual command may be determined to satisfy a frequency threshold in a corpus of textual commands. Consequently, data indicative of the textual command may be enrolled as a hot command. Subsequent utterance of another textual command that is semantically consistent with the textual command may trigger performance of a responsive action by the automated assistant, without requiring explicit invocation.
-
3.
公开(公告)号:US20230230583A1
公开(公告)日:2023-07-20
申请号:US17579131
申请日:2022-01-19
Applicant: GOOGLE LLC
Inventor: Tuan Nguyen , Gabriel Leblanc , Tzu-Chan Chuang , Qiong Huang , William A. Truong , Yixing Cai , Alexey Galata , Yuan Yuan
IPC: G10L15/08 , G10L15/065 , G10L15/22 , G06V40/16 , G06F21/32
CPC classification number: G10L15/083 , G10L15/065 , G10L15/22 , G06V40/161 , G06F21/32 , G10L2015/088 , G10L2015/0636
Abstract: Hot word free adaptation, of one or more function(s) of an automated assistant, responsive to determining, based on gaze measure(s) and/or active speech measure(s), that a user is engaging with the automated assistant. Implementations relate to various techniques for mitigating false positive occurrences of and/or false negative occurrences, of hot word free adaptation, through utilization of personalized parameter(s) for at least some user(s) of an assistant device. The personalized parameter(s) are utilized in determining whether condition(s) are satisfied, where those condition(s), if satisfied, indicate that the user is engaging in hot word free interaction with the automated assistant and result in adaptation of function(s) of the automated assistant.
-
公开(公告)号:US11688417B2
公开(公告)日:2023-06-27
申请号:US16622771
申请日:2019-05-02
Applicant: Google LLC
Inventor: Jaclyn Konzelmann , Kenneth Mixter , Sourish Chaudhuri , Tuan Nguyen , Hideaki Matsui , Caroline Pantofaru , Vinay Bettadapura
Abstract: Hot-word free adaptation of one or more function(s) of an automated assistant. Sensor data, from one or more sensor components of an assistant device that provides an automated assistant interface (graphical and/or audible), is processed to determine occurrence and/or confidence metric(s) of various attributes of a user that is proximal to the assistant device. Whether to adapt each of one or more of the function(s) of the automated assistant is based on the occurrence and/or the confidence of one or more of the various attributes. For example, certain processing of at least some of the sensor data can be initiated, such as initiating previously dormant local processing of at least some of the sensor data and/or initiating transmission of at least some of the audio data to remote automated assistant component(s).
-
公开(公告)号:US10890969B2
公开(公告)日:2021-01-12
申请号:US16606529
申请日:2018-05-04
Applicant: Google LLC
Inventor: Yuan Yuan , Kenneth Mixter , Tuan Nguyen
Abstract: Invoking one or more previously dormant functions of an automated assistant in response to detecting, based on processing of vision data from one or more vision components: (1) a particular gesture (e.g., of one or more “invocation gestures”) of a user; and/or (2) detecting that a gaze of the user is directed at an assistant device that provides an automated assistant interface (graphical and/or audible) of the automated assistant. For example, the previously dormant function(s) can be invoked in response to detecting the particular gesture, detecting that the gaze of the user is directed at an assistant device for at least a threshold amount of time, and optionally that the particular gesture and the directed gaze of the user co-occur or occur within a threshold temporal proximity of one another.
-
公开(公告)号:US20200167597A1
公开(公告)日:2020-05-28
申请号:US16618532
申请日:2018-05-04
Applicant: Google LLC
Inventor: Tuan Nguyen , Kenneth Mixter , Yuan Yuan
Abstract: Methods, apparatus, systems, and computer-readable media are provided for generating and/or adapting automated assistant content according to a distance of a user relative to an automated assistant interface that renders the automated assistant content. For instance, the automated assistant can provide data for a client device to render. The client device can request additional data when the user relocates closer to, or further from, the client device. In some implementations, a request for additional data can identify a distance between the user and the client device. In this way, the additional data can be generated or selected according to the distance in the request. Other implementations can allow an automated assistant to determine an active user from a group of users in an environment, and determine a distance between the active user and the client device in order that any rendered content can be tailored for the active user.
-
公开(公告)号:US12190848B2
公开(公告)日:2025-01-07
申请号:US18500795
申请日:2023-11-02
Applicant: Google LLC
Inventor: Andrew Fergus Simpson , Ying Zhang , Tuan Nguyen , Ryan Ki Sing Chung , Christopher Joseph Findeisen , Chintan Trehan , Rajat Kumar Paharia
Abstract: Systems and methods for adjusting light emitted from a display of a device are provided. The adjusting includes obtaining, from light of an environment detected by at least one sensor, a measured color of light of the environment, and obtaining, from light of the environment detected by at least one sensor, a measured brightness of light of the environment. In response to the obtaining the measured color and the measured brightness of light, a color of light emitted from the display is adjusted from an initial color to a target color. A brightness of light emitted from the display is adjusted from an initial brightness to a target brightness.
-
公开(公告)号:US20250005293A1
公开(公告)日:2025-01-02
申请号:US18217313
申请日:2023-06-30
Applicant: GOOGLE LLC
Inventor: Tuan Nguyen , Sergei Volnov , William A. Truong , Yunfan Ye , Sana Mithani , Neel Joshi , Alexey Galata , Tzu-Chan Chuang , Liang-yu Chen , Qiong Huang , Krunal Shah , Sai Aditya Chitturu
Abstract: Implementations relate to leveraging large language model(s) (LLMs) and vision language model(s) (VLMs) to facilitate human-to-computer dialogs. In various implementations, one or more digital images may be processed using one or more VLMs to generate VLM output indicative of a state of an environment. An LLM prompt may be assembled based on the VLM output and a natural language input. The LLM prompt may be processed using one or more LLMs to generate content that is responsive to the natural language input. The content that is responsive to the natural language input may subsequently be rendered at one or more output devices.
-
公开(公告)号:US20240055003A1
公开(公告)日:2024-02-15
申请号:US18383314
申请日:2023-10-24
Applicant: GOOGLE LLC
Inventor: Tuan Nguyen , Yuan Yuan
CPC classification number: G10L15/24 , G06N3/04 , G06F3/013 , G10L15/16 , G06F18/251 , G10L2015/228
Abstract: Techniques are described herein for detecting and/or enrolling (or commissioning) new “hot commands” that are useable to cause an automated assistant to perform responsive action(s) without having to be first explicitly invoked. In various implementations, an automated assistant may be transitioned from a limited listening state into a full speech recognition state in response to a trigger event. While in the full speech recognition state, the automated assistant may receive and perform speech recognition processing on a spoken command from a user to generate a textual command. The textual command may be determined to satisfy a frequency threshold in a corpus of textual commands. Consequently, data indicative of the textual command may be enrolled as a hot command. Subsequent utterance of another textual command that is semantically consistent with the textual command may trigger performance of a responsive action by the automated assistant, without requiring explicit invocation.
-
公开(公告)号:US11785295B2
公开(公告)日:2023-10-10
申请号:US17588467
申请日:2022-01-31
Applicant: GOOGLE LLC
Inventor: Jaclyn Konzelmann , Tuan Nguyen , Vinay Bettadapura , Andrew Gallagher , Utsav Prabhu , Caroline Pantofaru
IPC: H04N21/442 , G06T7/70 , H04N21/258 , H04N21/41 , H04W12/64
CPC classification number: H04N21/44218 , G06T7/70 , H04N21/25875 , H04N21/25891 , H04N21/4126 , H04W12/64 , G06T2207/30196
Abstract: Implementations relate to an automated assistant that provides and manages output from one or more elements of output hardware of a computing device. The automated assistant manages dynamic adjustment of access permissions to the computing device according to, for example, a detected presence of one or more users. An active-user queue can be established each time a unique user enters a viewing window of a camera of the computing device when, up to that point, no user was considered active. Multiple image frames can be captured via the camera and processed to determine whether an initial user remains in the viewing window and/or whether another user has entered the viewing window. The initial user can be considered active as long as they are exclusively detected in the viewing window. Restricted content associated with the user may be rendered by the computing device whilst the user is active.
-
-
-
-
-
-
-
-
-