Generating and/or adapting automated assistant content according to a distance between user(s) and an automated assistant interface

    公开(公告)号:US12277259B2

    公开(公告)日:2025-04-15

    申请号:US18375876

    申请日:2023-10-02

    Applicant: GOOGLE LLC

    Abstract: Methods, apparatus, systems, and computer-readable media are provided for generating and/or adapting automated assistant content according to a distance of a user relative to an automated assistant interface that renders the automated assistant content. For instance, the automated assistant can provide data for a client device to render. The client device can request additional data when the user relocates closer to, or further from, the client device. In some implementations, a request for additional data can identify a distance between the user and the client device. In this way, the additional data can be generated or selected according to the distance in the request. Other implementations can allow an automated assistant to determine an active user from a group of users in an environment, and determine a distance between the active user and the client device in order that any rendered content can be tailored for the active user.

    Automated assistant interaction prediction using fusion of visual and audio input

    公开(公告)号:US11842737B2

    公开(公告)日:2023-12-12

    申请号:US17211409

    申请日:2021-03-24

    Applicant: Google LLC

    Abstract: Techniques are described herein for detecting and/or enrolling (or commissioning) new “hot commands” that are usable to cause an automated assistant to perform responsive action(s) without having to be first explicitly invoked. In various implementations, an automated assistant may be transitioned from a limited listening state into a full speech recognition state in response to a trigger event. While in the full speech recognition state, the automated assistant may receive and perform speech recognition processing on a spoken command from a user to generate a textual command. The textual command may be determined to satisfy a frequency threshold in a corpus of textual commands. Consequently, data indicative of the textual command may be enrolled as a hot command. Subsequent utterance of another textual command that is semantically consistent with the textual command may trigger performance of a responsive action by the automated assistant, without requiring explicit invocation.

    Invoking automated assistant function(s) based on detected gesture and gaze

    公开(公告)号:US10890969B2

    公开(公告)日:2021-01-12

    申请号:US16606529

    申请日:2018-05-04

    Applicant: Google LLC

    Abstract: Invoking one or more previously dormant functions of an automated assistant in response to detecting, based on processing of vision data from one or more vision components: (1) a particular gesture (e.g., of one or more “invocation gestures”) of a user; and/or (2) detecting that a gaze of the user is directed at an assistant device that provides an automated assistant interface (graphical and/or audible) of the automated assistant. For example, the previously dormant function(s) can be invoked in response to detecting the particular gesture, detecting that the gaze of the user is directed at an assistant device for at least a threshold amount of time, and optionally that the particular gesture and the directed gaze of the user co-occur or occur within a threshold temporal proximity of one another.

    GENERATING AND/OR ADAPTING AUTOMATED ASSISTANT CONTENT ACCORDING TO A DISTANCE BETWEEN USER(S) AND AN AUTOMATED ASSISTANT INTERFACE

    公开(公告)号:US20200167597A1

    公开(公告)日:2020-05-28

    申请号:US16618532

    申请日:2018-05-04

    Applicant: Google LLC

    Abstract: Methods, apparatus, systems, and computer-readable media are provided for generating and/or adapting automated assistant content according to a distance of a user relative to an automated assistant interface that renders the automated assistant content. For instance, the automated assistant can provide data for a client device to render. The client device can request additional data when the user relocates closer to, or further from, the client device. In some implementations, a request for additional data can identify a distance between the user and the client device. In this way, the additional data can be generated or selected according to the distance in the request. Other implementations can allow an automated assistant to determine an active user from a group of users in an environment, and determine a distance between the active user and the client device in order that any rendered content can be tailored for the active user.

    AUTOMATED ASSISTANT INTERACTION PREDICTION USING FUSION OF VISUAL AND AUDIO INPUT

    公开(公告)号:US20240055003A1

    公开(公告)日:2024-02-15

    申请号:US18383314

    申请日:2023-10-24

    Applicant: GOOGLE LLC

    Abstract: Techniques are described herein for detecting and/or enrolling (or commissioning) new “hot commands” that are useable to cause an automated assistant to perform responsive action(s) without having to be first explicitly invoked. In various implementations, an automated assistant may be transitioned from a limited listening state into a full speech recognition state in response to a trigger event. While in the full speech recognition state, the automated assistant may receive and perform speech recognition processing on a spoken command from a user to generate a textual command. The textual command may be determined to satisfy a frequency threshold in a corpus of textual commands. Consequently, data indicative of the textual command may be enrolled as a hot command. Subsequent utterance of another textual command that is semantically consistent with the textual command may trigger performance of a responsive action by the automated assistant, without requiring explicit invocation.

Patent Agency Ranking