Hotword-Based Speaker Recognition
    271.
    发明公开

    公开(公告)号:US20230145324A1

    公开(公告)日:2023-05-11

    申请号:US18149255

    申请日:2023-01-03

    Applicant: Google LLC

    Inventor: Matthew Sharifi

    CPC classification number: G10L17/00 G10L17/24 G10L15/22 G10L2015/088

    Abstract: Systems, methods performed by data processing apparatus and computer storage media encoded with computer programs for receiving an utterance from a user in a multi-user environment, each user having an associated set of available resources, determining that the received utterance includes at least one predetermined word, comparing speaker identification features of the uttered predetermined word with speaker identification features of each of a plurality of previous utterances of the predetermined word, the plurality of previous predetermined word utterances corresponding to different known users in the multi-user environment, attempting to identify the user associated with the uttered predetermined word as matching one of the known users in the multi-user environment, and based on a result of the attempt to identify, selectively providing the user with access to one or more resources associated with a corresponding known user.

    Distilling to a Target Device Based on Observed Query Patterns

    公开(公告)号:US20230111618A1

    公开(公告)日:2023-04-13

    申请号:US17644427

    申请日:2021-12-15

    Applicant: Google LLC

    Abstract: A method includes receiving user queries directed toward a cloud-based assistant service. For each received user query directed toward the cloud-based assistant service, the method also includes extracting one or more attributes from the user query and logging the user query into one or more of a plurality of category buckets based on the one or more attributes extracted from the user query. The method also includes determining when at least one of the plurality of category buckets includes a threshold number of the user queries logged into the at least one category bucket, and when the at least one of the plurality of category buckets includes the threshold number of the user queries, generating a distilled model of the cloud-based assistant service. The distilled model of the cloud-based assistant service is configured to execute on one or more target client devices.

    Automated assistant for facilitating communications through dissimilar messaging features of different applications

    公开(公告)号:US11568870B2

    公开(公告)日:2023-01-31

    申请号:US17110046

    申请日:2020-12-02

    Applicant: Google LLC

    Abstract: Implementations relate to an automated assistant that can respond to communications received via a third party application and/or other third party communication modality. The automated assistant can determine that the user is participating in multiple different conversations via multiple different third party communication services. In some implementations, conversations can be processed to identify particular features of the conversations. When the automated assistant is invoked to provide input to a conversation, the automated assistant can compare the input to the identified conversation features in order to select the particular conversation that is most relevant to the input. In this way, the automated assistant can assist with any of multiple disparate conversations that are each occurring via a different third party application.

    Voice Filtering Other Speakers From Calls And Audio Messages

    公开(公告)号:US20230005480A1

    公开(公告)日:2023-01-05

    申请号:US17930822

    申请日:2022-09-09

    Applicant: Google LLC

    Abstract: A method includes receiving a first instance of raw audio data corresponding to a voice-based command and receiving a second instance of the raw audio data corresponding to an utterance of audible contents for an audio-based communication spoken by a user. When a voice filtering recognition routine determines to activate voice filtering for at least the voice of the user, the method also includes obtaining a respective speaker embedding of the user and processing, using the respective speaker embedding, the second instance of the raw audio data to generate enhanced audio data for the audio-based communication that isolates the utterance of the audible contents spoken by the user and excludes at least a portion of the one or more additional sounds that are not spoken by the user The method also includes executing.

    AUTOMATED ASSISTANT ADAPTATION OF A RESPONSE TO AN UTTERANCE AND/OR OF PROCESSING OF THE UTTERANCE, BASED ON DETERMINED INTERACTION MEASURE

    公开(公告)号:US20220392449A1

    公开(公告)日:2022-12-08

    申请号:US17892803

    申请日:2022-08-22

    Applicant: GOOGLE LLC

    Abstract: Implementations set forth herein relate to an automated assistant that provides a response for certain user queries based on a level of interaction of the user with respect to the automated assistant. Interaction can be characterized by sensor data, which can be processed using one or more trained machine learning models in order to identify parameters for generating a response. In this way, the response can be limited to preserve computational resources and/or ensure that the response is more readily understood given the amount of interaction exhibited by the user. In some instances, a response that embodies information that is supplemental, to an otherwise suitable response, can be provided when a user is exhibiting a particular level of interaction. In other instances, such supplemental information can be withheld when the user is not exhibiting that particular level of interaction, at least in order to preserve computational resources.

    Proximity-Based Controls On A Second Device

    公开(公告)号:US20220342537A1

    公开(公告)日:2022-10-27

    申请号:US17811973

    申请日:2022-07-12

    Applicant: Google LLC

    Abstract: A method includes obtaining proximity information for each of a plurality of assistant-enabled devices within an environment of a user device. Each assistant-enabled device is controllable by an assistant application to perform a respective set of available actions associated with the assistant-enabled device. For each assistant-enabled device, the method also includes determining a proximity score based on the proximity information indicating a proximity estimation of the corresponding assistant-enabled device relative to the user device. The method further includes generating, using the proximity scores determined for the assistant-enabled devices, a ranked list of candidate assistant-enabled devices, and for each corresponding assistant-enabled device in the ranked list, displaying, in a graphical user interface (GUI), a respective set of controls for performing the respective set of actions associated with the corresponding assistant-enabled device.

    Automated assistant adaptation of a response to an utterance and/or of processing of the utterance, based on determined interaction measure

    公开(公告)号:US11455996B2

    公开(公告)日:2022-09-27

    申请号:US16947513

    申请日:2020-08-04

    Applicant: Google LLC

    Abstract: Implementations set forth herein relate to an automated assistant that provides a response for certain user queries based on a level of interaction of the user with respect to the automated assistant. Interaction can be characterized by sensor data, which can be processed using one or more trained machine learning models in order to identify parameters for generating a response. In this way, the response can be limited to preserve computational resources and/or ensure that the response is more readily understood given the amount of interaction exhibited by the user. In some instances, a response that embodies information that is supplemental, to an otherwise suitable response, can be provided when a user is exhibiting a particular level of interaction. In other instances, such supplemental information can be withheld when the user is not exhibiting that particular level of interaction, at least in order to preserve computational resources.

    INFERRING ASSISTANT ACTION(S) BASED ON AMBIENT SENSING BY ASSISTANT DEVICE(S)

    公开(公告)号:US20220272055A1

    公开(公告)日:2022-08-25

    申请号:US17185611

    申请日:2021-02-25

    Applicant: Google LLC

    Abstract: Implementations can determine an ambient state that reflects a state of a user and/or an environment of the user based on an instance of sensor data. The ambient state can be processed, using an ambient sensing machine learning (ML) model, to generate suggested action(s) that are suggested to be performed, on behalf of the user, by an automated assistant. In some implementations, a corresponding representation of the suggested action(s) can be provided for presentation to the user, and the suggested action(s) can be performed by the automated assistant in response to a user selection of the suggested action(s). In additional or alternative implementations, the suggested action(s) can be automatically performed by the automated assistant. Implementations can additionally or alternatively generate training instances for training the ambient sensing ML model based on interactions with the automated assistant.

Patent Agency Ranking