-
31.
公开(公告)号:US20180107650A1
公开(公告)日:2018-04-19
申请号:US15295192
申请日:2016-10-17
Applicant: Google LLC
Inventor: Marcos Calvo , Victor Carbune , Henry Rowley , Thomas Deselaers
IPC: G06F17/27 , G06F3/0488 , G06F3/0482
CPC classification number: G06F17/2765 , G06F3/0482 , G06F3/04883 , G06F17/275 , G06K9/00402
Abstract: A first handwriting input is received comprising strokes corresponding to a set of first characters comprising one or more first characters forming a first language model unit. A set of candidate first characters and a set of candidate first language model units with corresponding probability scores are determined based on an analysis of the one or more sets of candidate first characters using the first language model and a corresponding first character recognition model. When no first probability score satisfies a threshold, one or more sets of candidate second characters and a set of candidate second language model units are determined based on an analysis of the first handwriting input using a second language model and a corresponding second character recognition model. A first candidate list is then output comprising at least one of the set of candidate second language model units.
-
32.
公开(公告)号:US12227261B2
公开(公告)日:2025-02-18
申请号:US18378080
申请日:2023-10-09
Applicant: GOOGLE LLC
Inventor: Sandro Feuz , Thomas Deselaers
Abstract: Implementations set forth herein relate to generating a pre-call analysis for one or more users that are receiving and/or initializing a call with one or more other users, and/or prioritizing pre-call content according to whether security-related value was gleaned from provisioning certain pre-call content. One or more machine learning models can be employed for determining the pre-call content to be cached and/or presented prior to a user accepting a call from another user. Feedback provided before, during, and/or after the call can be used as a basis from which to prioritize certain content and/or sources of content when generating pre-call content for a subsequent call. Other information, such as contextual data (e.g., calendar entries, available peripheral devices, location, etc.) corresponding to the previous call and/or the subsequent call, can also be used as a basis from which to provide a pre-call analysis.
-
公开(公告)号:US20240344833A1
公开(公告)日:2024-10-17
申请号:US18754107
申请日:2024-06-25
Applicant: GOOGLE LLC
Inventor: Thomas Deselaers , Sandro Feuz
IPC: G01C21/34 , G01C21/36 , G06N5/04 , G06N20/00 , H04L51/046
CPC classification number: G01C21/3438 , G01C21/3691 , G06N5/04 , G06N20/00 , H04L51/046 , G01C21/362
Abstract: A navigation service determines that a first user intends to navigate to a shared destination from a first location, at a first time, and that a second user intends to navigate to the shared destination from a second location, at a second time within a threshold interval of the first time. The navigation service notifies the first user using an electronic notification that the second user intends to navigate to the shared destination, receives from the first user an electronic request to coordinate navigation to the shared destination with the second user, and in response to receiving the electronic request, provides navigation directions to the shared destination to a device associated with the first user in view of a progress of the second user toward the shared destination.
-
34.
公开(公告)号:US20240249719A1
公开(公告)日:2024-07-25
申请号:US17058895
申请日:2020-11-10
Applicant: Google LLC
Inventor: Thomas Deselaers , Sandro Feuz
CPC classification number: G10L15/22 , G06F3/167 , G06F9/547 , G10L15/063 , G10L2015/223 , G10L2015/228
Abstract: Implementations relate to an automated assistant that is capable of interacting with non-assistant applications that do not have functionality explicitly provided for interfacing with certain automated assistants. Application data, such as annotation data and/or GUI data, associated with a non-assistant application, can be processed to map such data into an embedding space. An assistant input command can then be processed and mapped to the same embedding space, and a distance from the assistant input command embedding and the non-assistant application data embedding can be determined. When the distance between the assistant input command embedding and the non-assistant application data embedding satisfies threshold(s), the automated assistant can generate instruction(s), for the non-assistant application, that correspond to the non-assistant application data. For instance, the instruction(s) can simulate user input(s) that cause the non-assistant application to perform one or more operations characterized by, or otherwise associated with, the non-assistant application data.
-
公开(公告)号:US11995119B2
公开(公告)日:2024-05-28
申请号:US17142974
申请日:2021-01-06
Applicant: Google LLC
Inventor: Sandro Feuz , Thomas Deselaers
IPC: G06F16/53 , G06F16/41 , G06F16/438 , G06F16/73 , G06F16/951 , G06F21/32 , G06F21/60 , G06F21/62 , G06N3/045 , G06N20/00 , G06V40/16
CPC classification number: G06F16/53 , G06F16/41 , G06F16/438 , G06F16/73 , G06F21/602 , G06F21/6245 , G06N20/00 , G06V40/172
Abstract: This document describes techniques and systems that enable privacy controls for sharing embeddings for searching and indexing media content. A set of images of a user's face are obtained and a machine-learned model is applied to the set of images to generate a user-specific dataset of face embeddings for the user. Media content stored in a media storage is indexed by applying the machine-learned model to the media content to provide indexed media information identifying one or more faces shown in the media content. Access to the indexed media information by another user querying the media content for images or videos depicting the user is controlled based on a digital key shared by the user with the other user, where the digital key is associated with the user-specific dataset and the user-specific dataset is usable to identify the images or videos depicting the user.
-
公开(公告)号:US11860668B2
公开(公告)日:2024-01-02
申请号:US17581390
申请日:2022-01-21
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Daniel Keysers , Thomas Deselaers
IPC: G06F3/04817 , G06F3/0481 , G06F3/0482 , G06F9/451 , G06Q50/12 , G06V40/10 , G06Q30/0601
CPC classification number: G06F3/04817 , G06F3/0481 , G06F3/0482 , G06F9/451 , G06Q30/0601 , G06Q50/12 , G06V40/10
Abstract: Systems and methods enable a computing system to recognize a sequence of repeated actions and offer to automatically repeat any such recognized actions. An example method includes determining a current sequence of user actions is similar to previous sequence of user actions, determining whether the previous sequence is reproducible and, when reproducible, initiating display of a prompt that requests approval for completing the current sequence based on the previous sequence and, responsive to receiving an indication of approval, completing the previous sequence. Another example method included determining that a first current sequence of user interactions is complete and is not similar to any saved sequence of user interactions, saving the first current sequence as a previous sequence, identifying a second sequence as satisfying a similarity threshold with the previous sequence, and initiating display of a prompt that requests approval for saving the previous sequence as a shortcut.
-
公开(公告)号:US11756537B2
公开(公告)日:2023-09-12
申请号:US17962636
申请日:2022-10-10
Applicant: GOOGLE LLC
Inventor: Pedro Gonnet Anders , Victor Carbune , Daniel Keysers , Thomas Deselaers , Sandro Feuz
CPC classification number: G10L15/1815 , G10L15/19 , G10L15/22 , G10L2015/223
Abstract: Techniques are described herein for enabling an automated assistant to adjust its behavior depending on a detected age range and/or “vocabulary level” of a user who is engaging with the automated assistant. In various implementations, data indicative of a user's utterance may be used to estimate one or more of the user's age range and/or vocabulary level. The estimated age range/vocabulary level may be used to influence various aspects of a data processing pipeline employed by an automated assistant. In various implementations, aspects of the data processing pipeline that may be influenced by the user's age range/vocabulary level may include one or more of automated assistant invocation, speech-to-text (“STT”) processing, intent matching, intent resolution (or fulfillment), natural language generation, and/or text-to-speech (“TTS”) processing. In some implementations, one or more tolerance thresholds associated with one or more of these aspects, such as grammatical tolerances, vocabularic tolerances, etc., may be adjusted.
-
公开(公告)号:US11683320B2
公开(公告)日:2023-06-20
申请号:US17237573
申请日:2021-04-22
Applicant: Google LLC
Inventor: Victor Carbune , Thomas Deselaers , Sandro Feuz
IPC: H04L9/40 , G10L15/22 , H04L67/12 , H04L67/30 , G06F16/635 , G06F3/16 , G10L17/00 , G06F21/32 , G06F9/50 , G10L15/30
CPC classification number: H04L63/107 , G06F3/167 , G06F9/5055 , G06F16/635 , G06F21/32 , G10L15/22 , G10L17/00 , H04L63/0861 , H04L67/12 , H04L67/30 , G06F2221/2111 , G10L15/30 , G10L2015/226 , G10L2015/227
Abstract: The present disclosure is generally directed to a data processing system for customizing content in a voice activated computer network environment. With user consent, the data processing system can improve the efficiency and effectiveness of auditory data packet transmission over one or more computer networks by, for example, increasing the accuracy of the voice identification process used in the generation of customized content. The present solution can make accurate identifications while generating fewer audio identification models, which are computationally intensive to generate.
-
公开(公告)号:US11521618B2
公开(公告)日:2022-12-06
申请号:US16716654
申请日:2019-12-17
Applicant: Google LLC
Inventor: Victor Carbune , Pedro Gonnet Anders , Thomas Deselaers , Sandro Feuz
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for collaboration between multiple voice controlled devices are disclosed. In one aspect, a method includes the actions of identifying, by a first computing device, a second computing device that is configured to respond to a particular, predefined hotword; receiving audio data that corresponds to an utterance; receiving a transcription of additional audio data outputted by the second computing device in response to the utterance; based on the transcription of the additional audio data and based on the utterance, generating a transcription that corresponds to a response to the additional audio data; and providing, for output, the transcription that corresponds to the response.
-
公开(公告)号:US11495217B2
公开(公告)日:2022-11-08
申请号:US16728582
申请日:2019-12-27
Applicant: Google LLC
Inventor: Pedro Gonnet Anders , Victor Carbune , Daniel Keysers , Thomas Deselaers , Sandro Feuz
Abstract: Techniques are described herein for enabling an automated assistant to adjust its behavior depending on a detected age range and/or “vocabulary level” of a user who is engaging with the automated assistant. In various implementations, data indicative of a user's utterance may be used to estimate one or more of the user's age range and/or vocabulary level. The estimated age range/vocabulary level may be used to influence various aspects of a data processing pipeline employed by an automated assistant. In various implementations, aspects of the data processing pipeline that may be influenced by the user's age range/vocabulary level may include one or more of automated assistant invocation, speech-to-text (“STT”) processing, intent matching, intent resolution (or fulfillment), natural language generation, and/or text-to-speech (“TTS”) processing. In some implementations, one or more tolerance thresholds associated with one or more of these aspects, such as grammatical tolerances, vocabularic tolerances, etc., may be adjusted.
-
-
-
-
-
-
-
-
-