User authentication, for assistant action, using data from other device(s) in a shared environment

    公开(公告)号:US12154576B2

    公开(公告)日:2024-11-26

    申请号:US17573431

    申请日:2022-01-11

    Applicant: GOOGLE LLC

    Abstract: Implementations set forth herein relate to an automated assistant that can solicit other devices for data that can assist with user authentication. User authentication can be streamlined for certain requests by removing a requirement that all authentication be performed at a single device and/or by a single application. For instance, the automated assistant can rely on data from other devices, which can indicate a degree to which a user is predicted to be present at a location of an assistant-enabled device. The automated assistant can process this data to make a determination regarding whether the user should be authenticated in response to an assistant input and/or pre-emptively before the user provides an assistant input. In some implementations, the automated assistant can perform one or more factors of authentication and utilize the data to verify the user in lieu of performing one or more other factors of authentication.

    Accelerometer-based endpointing measure(s) and /or gaze-based endpointing measure(s) for speech processing

    公开(公告)号:US12154561B2

    公开(公告)日:2024-11-26

    申请号:US17554280

    申请日:2021-12-17

    Applicant: GOOGLE LLC

    Abstract: An overall endpointing measure can be generated based on an audio-based endpointing measure and (1) an accelerometer-based endpointing measure and/or (2) a gaze-based endpointing measure. The overall endpointing measure can be used in determining whether a candidate endpoint is an actual endpoint. Various implementations include generating the audio-based endpointing measure by processing an audio data stream, capturing a spoken utterance of a user, using an audio model. Various implementations additionally or alternatively include generating the accelerometer-based endpointing measure by processing a stream of accelerometer data using an accelerometer model. Various implementations additionally or alternatively include processing an image data stream using a gaze model to generate the gaze-based endpointing measure.

    GENERATING AND UPDATING A CUSTOM AUTOMATED ASSISTANT BASED ON A DOMAIN-SPECIFIC RESOURCE

    公开(公告)号:US20240386886A1

    公开(公告)日:2024-11-21

    申请号:US18197573

    申请日:2023-05-15

    Applicant: GOOGLE LLC

    Abstract: Implementations herein related to customizing an automated assistant using domain-specific resources. One or more resources are processed to generate a natural language representation of the contents of the resources. The natural language representation is utilized to customize an automated assistant for interactions with a user. Various implementations include priming and fine-tuning large language models that are utilized to implement the automated assistant. Various implementations are directed to biasing speech recognition based on terms identified in the resources. Various implementations are directed to customizing the tone of the automated assistant based on information included in the resources.

    Handling contradictory queries on a shared device

    公开(公告)号:US12147470B2

    公开(公告)日:2024-11-19

    申请号:US17938455

    申请日:2022-10-06

    Applicant: Google LLC

    Abstract: A method for handling contradictory queries on a shared device includes receiving a first query issued by a first user, the first query specifying a first long-standing operation for a digital assistant to perform, and while the digital assistant is performing the first long-standing operation, receiving a second query, the second query specifying a second long-standing operation for the digital assistant to perform. The method also includes determining that the second query was issued by another user different than the first user and determining, using a query resolver, that performing the second long-standing operation would conflict with the first long-standing operation. The method further includes identifying one or more compromise operations for the digital assistant to perform, and instructing the digital assistant to perform a selected compromise operation among the identified one or more compromise operations.

    Multimodal intent understanding for automated assistant

    公开(公告)号:US12094454B2

    公开(公告)日:2024-09-17

    申请号:US17568920

    申请日:2022-01-05

    Applicant: GOOGLE LLC

    CPC classification number: G10L15/08 G06N20/00 G10L15/22 G10L2021/02163

    Abstract: Implementations described herein include detecting a stream of audio data that captures a spoken utterance of the user and that captures ambient noise occurring within a threshold time period of the spoken utterance being spoken by the user. Implementations further include processing a portion of the audio data that includes the ambient noise to determine ambient noise classification(s), processing a portion of the audio data that includes the spoken utterance to generate a transcription, processing both the transcription and the ambient noise classification(s) with a machine learning model to generate a user intent and parameter(s) for the user intent, and performing one or more automated assistant actions based on the user intent and using the parameter(s).

    Voice filtering other speakers from calls and audio messages

    公开(公告)号:US12087297B2

    公开(公告)日:2024-09-10

    申请号:US17930822

    申请日:2022-09-09

    Applicant: Google LLC

    Abstract: A method includes receiving a first instance of raw audio data corresponding to a voice-based command and receiving a second instance of the raw audio data corresponding to an utterance of audible contents for an audio-based communication spoken by a user. When a voice filtering recognition routine determines to activate voice filtering for at least the voice of the user, the method also includes obtaining a respective speaker embedding of the user and processing, using the respective speaker embedding, the second instance of the raw audio data to generate enhanced audio data for the audio-based communication that isolates the utterance of the audible contents spoken by the user and excludes at least a portion of the one or more additional sounds that are not spoken by the user The method also includes executing.

Patent Agency Ranking