-
公开(公告)号:US11935519B2
公开(公告)日:2024-03-19
申请号:US16949151
申请日:2020-10-15
Applicant: Google LLC
Inventor: Matthew Sharifi , Victor Carbune
Abstract: Implementations can receive, at a computing device, audio data corresponding to a spoken utterance of a user, process the audio data to generate, for one or more parts of the spoken utterance, a plurality of speech hypotheses, select a given one of the speech hypotheses, cause the given one of the speech hypotheses to be incorporated as a portion of a transcription associated with the software application, and store the plurality of speech hypotheses. In some implementations, the plurality of speech hypotheses can be loaded at an additional computing device when the transcription is accessed at the additional computing device. In additional or alternative implementations, the plurality of speech hypotheses can be loaded into memory of the computing device when the software application is reactivated and/or when a subsequent dialog session associated with the transcription is initiated.
-
公开(公告)号:US11934895B2
公开(公告)日:2024-03-19
申请号:US18123870
申请日:2023-03-20
Applicant: GOOGLE LLC
Inventor: Sandro Feuz , Victor Carbune
IPC: G06F9/54 , H04L51/224
CPC classification number: G06F9/542 , H04L51/224
Abstract: Implementations set forth herein relate to intervening notifications provided by an application for mitigating computationally wasteful application launching behavior that is exhibited by some users. A state of a module of a target application can be identified by emulating user inputs previously provided by the user to the target application. In this way, the state of the module can be determined without visibly launching the target application. When the state of the module is determined to satisfy criteria for providing a notification to the user, the application can render a notification for the user. The application can provide intervening notifications for a variety of different target applications in order to reduce a frequency at which the user launches and closes applications to check for variations in target application content.
-
163.
公开(公告)号:US20240087568A1
公开(公告)日:2024-03-14
申请号:US18514730
申请日:2023-11-20
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Matthew Sharifi
CPC classification number: G10L15/22 , G10L15/1815 , H04L51/56 , G10L2015/223
Abstract: Implementations relate to an automated assistant that can respond to communications received via a third party application and/or other third party communication modality. The automated assistant can determine that the user is participating in multiple different conversations via multiple different third party communication services. In some implementations, conversations can be processed to identify particular features of the conversations. When the automated assistant is invoked to provide input to a conversation, the automated assistant can compare the input to the identified conversation features in order to select the particular conversation that is most relevant to the input. In this way, the automated assistant can assist with any of multiple disparate conversations that are each occurring via a different third party application.
-
公开(公告)号:US20240055002A1
公开(公告)日:2024-02-15
申请号:US18384764
申请日:2023-10-27
Applicant: Google LLC
Inventor: Matthew Sharifi , Victor Carbune
Abstract: Techniques are described herein for identifying a failed hotword attempt. A method includes: receiving first audio data; processing the first audio data to generate a first predicted output; determining that the first predicted output satisfies a secondary threshold but does not satisfy a primary threshold; receiving second audio data; processing the second audio data to generate a second predicted output; determining that the second predicted output satisfies the secondary threshold but does not satisfy the primary threshold; in response to the first predicted output and the second predicted output satisfying the secondary threshold but not satisfying the primary threshold, and in response to the first spoken utterance and the second spoken utterance satisfying one or more temporal criteria relative to one another, identifying a failed hotword attempt; and in response to identifying the failed hotword attempt, providing a hint that is responsive to the failed hotword attempt.
-
165.
公开(公告)号:US20240046935A1
公开(公告)日:2024-02-08
申请号:US18382735
申请日:2023-10-23
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
Abstract: Implementations are directed to biasing speaker authentication on a per-user basis and on a device-by-device basis and/or contextual feature(s) basis. In some of those implementations, in performing speaker authentication based on a spoken utterance, different biasing parameters are determined for each of multiple different registered users of an assistant device at which the spoken utterance was detected. In those implementations, each of the biasing parameters can be used to make it more likely or less likely (in dependence of the biasing parameter) that a corresponding registered user will be verified using the speaker authentication. Through utilization of biasing parameter(s) in performing speaker authentication, accuracy and/or robustness of speaker authentication can be increased.
-
公开(公告)号:US20240005924A1
公开(公告)日:2024-01-04
申请号:US18369610
申请日:2023-09-18
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G10L15/22 , G10L13/027 , H04L9/32
CPC classification number: G10L15/22 , H04L9/3231 , G10L13/027
Abstract: Implementations set forth herein relate to an automated assistant that can operate in a transient personalization mode, and/or assist a separate automated assistant with providing output according to a transient personalization mode. The transient personalization mode can allow a guest user of an assistant enabled-device to receive personalized responses from the assistant-enabled device—despite not being signed into the assistant-enabled device. A host automated assistant of the assistant-enabled device can securely communicate with a guest user's automated assistant through a backend process. In this way, input queries from the guest user to the host automated assistant can be personalized according to the guest automated assistant—without the guest user directly engaging with their own personal device.
-
公开(公告)号:US11860668B2
公开(公告)日:2024-01-02
申请号:US17581390
申请日:2022-01-21
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Daniel Keysers , Thomas Deselaers
IPC: G06F3/04817 , G06F3/0481 , G06F3/0482 , G06F9/451 , G06Q50/12 , G06V40/10 , G06Q30/0601
CPC classification number: G06F3/04817 , G06F3/0481 , G06F3/0482 , G06F9/451 , G06Q30/0601 , G06Q50/12 , G06V40/10
Abstract: Systems and methods enable a computing system to recognize a sequence of repeated actions and offer to automatically repeat any such recognized actions. An example method includes determining a current sequence of user actions is similar to previous sequence of user actions, determining whether the previous sequence is reproducible and, when reproducible, initiating display of a prompt that requests approval for completing the current sequence based on the previous sequence and, responsive to receiving an indication of approval, completing the previous sequence. Another example method included determining that a first current sequence of user interactions is complete and is not similar to any saved sequence of user interactions, saving the first current sequence as a previous sequence, identifying a second sequence as satisfying a similarity threshold with the previous sequence, and initiating display of a prompt that requests approval for saving the previous sequence as a shortcut.
-
公开(公告)号:US20230419951A1
公开(公告)日:2023-12-28
申请号:US18367859
申请日:2023-09-13
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
CPC classification number: G10L15/01 , G01S3/8006 , G10L15/08 , G10L15/32 , H04R29/006 , G10L2015/088
Abstract: Implementations can detect respective audio data that captures an acoustic event at multiple assistant devices in an ecosystem that includes a plurality of assistant devices, process the respective audio data locally at each of the multiple assistant devices to generate respective measures that are associated with the acoustic event using respective event detection models, process the respective measures to determine whether the detected acoustic event is an actual acoustic event, and cause an action associated with the actional acoustic event to be performed in response to determining that the detected acoustic event is the actual acoustic event. In some implementations, the multiple assistant devices that detected the respective audio data are anticipated to detect the respective audio data that captures the actual acoustic event based on a plurality of historical acoustic events being detected at each of the multiple assistant devices.
-
公开(公告)号:US20230409973A1
公开(公告)日:2023-12-21
申请号:US18241707
申请日:2023-09-01
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G06N20/00 , G10L15/22 , G06F3/16 , G06F3/0488 , G06F18/22 , G06F18/214
CPC classification number: G06N20/00 , G10L15/22 , G06F3/167 , G06F3/0488 , G06F18/22 , G06F18/214 , G10L2015/223
Abstract: Implementations relate to an automated assistant that can automate repeatedly performed procedures. The automation can involve communicating with different users, organizations, and/or other automated assistants. The automated assistant, with prior permission from respective user(s), can detect repeated performance of a particular series of manually initiated computational actions. Based on this determination, the automated assistant can determine automated assistant computational action(s) that can be performed by the automated assistant in order to reduce latency in performing a procedure, reduce quantity and/or size of transmissions in performing the procedure, and/or reduce an amount of client device resources required for performing the procedure. Such actions can include communicating with an additional automated assistant that may be associated with another user and/or organization. In these and other manners, manually initiated computational actions that include electronic communications amongst users can be converted to backend operations amongst instances of automated assistants to achieve technical benefits.
-
公开(公告)号:US11830486B2
公开(公告)日:2023-11-28
申请号:US17081645
申请日:2020-10-27
Applicant: Google LLC
Inventor: Matthew Sharifi , Victor Carbune
Abstract: Techniques are described herein for identifying a failed hotword attempt. A method includes: receiving first audio data; processing the first audio data to generate a first predicted output; determining that the first predicted output satisfies a secondary threshold but does not satisfy a primary threshold; receiving second audio data; processing the second audio data to generate a second predicted output; determining that the second predicted output satisfies the secondary threshold but does not satisfy the primary threshold; in response to the first predicted output and the second predicted output satisfying the secondary threshold but not satisfying the primary threshold, and in response to the first spoken utterance and the second spoken utterance satisfying one or more temporal criteria relative to one another, identifying a failed hotword attempt; and in response to identifying the failed hotword attempt, providing a hint that is responsive to the failed hotword attempt.
-
-
-
-
-
-
-
-
-