-
31.
公开(公告)号:US12118998B2
公开(公告)日:2024-10-15
申请号:US18231112
申请日:2023-08-07
Applicant: GOOGLE LLC
Inventor: Mugurel Ionut Andreica , Vladimir Vuskovic , Joseph Lange , Sharon Stovezky , Marcin Nowak-Przygodzki
CPC classification number: G10L15/22 , G06N3/08 , G10L15/02 , G10L2015/223
Abstract: Implementations are set forth herein for creating an order of execution for actions that were requested by a user, via a spoken utterance to an automated assistant. The order of execution for the requested actions can be based on how each requested action can, or is predicted to, affect other requested actions. In some implementations, an order of execution for a series of actions can be determined based on an output of a machine learning model, such as a model that has been trained according to supervised learning. A particular order of execution can be selected to mitigate waste of processing, memory, and network resources—at least relative to other possible orders of execution. Using interaction data that characterizes past performances of automated assistants, certain orders of execution can be adapted over time, thereby allowing the automated assistant to learn from past interactions with one or more users.
-
公开(公告)号:US12073832B2
公开(公告)日:2024-08-27
申请号:US17588451
申请日:2022-01-31
Applicant: GOOGLE LLC
Inventor: Gleb Skobeltsyn , Olga Kapralova , Konstantin Shagin , Vladimir Vuskovic , Yufei Zhao , Bradley Nelson , Alessio Macrì , Abraham Lee
CPC classification number: G10L15/22 , G06F3/167 , G10L15/18 , G10L15/28 , G10L2015/223
Abstract: Implementations described herein relate to providing suggestions, via a display modality, for completing a spoken utterance for an automated assistant, in order to reduce a frequency and/or a length of time that the user will participate in a current and/or subsequent dialog session with the automated assistant. A user request can be compiled from content of an ongoing spoken utterance and content of any selected suggestion elements. When a currently compiled portion of the user request (from content of a selected suggestion(s) and an incomplete spoken utterance) is capable of being performed via the automated assistant, any actions corresponding to the currently compiled portion of the user request can be performed via the automated assistant. Furthermore, any further content resulting from performance of the actions, along with any discernible context, can be used for providing further suggestions.
-
33.
公开(公告)号:US20240161743A1
公开(公告)日:2024-05-16
申请号:US18406752
申请日:2024-01-08
Applicant: GOOGLE LLC
Inventor: Michael Fink , Vladimir Vuskovic , Shimon Or Salant , Deborah Cohen , Asaf Revach , David Kogan , Andrew Callahan , Richard Borovoy , Andrew Richardson , Eran Ofek , Idan Szpektor , Jonathan Berant , Yossi Matias
CPC classification number: G10L15/22 , G06F40/284 , G06N3/08 , G10L15/063 , G10L15/1815 , G10L15/20 , G10L2015/223
Abstract: Generating expanded responses that guide continuance of a human-to computer dialog that is facilitated by a client device and that is between at least one user and an automated assistant. The expanded responses are generated by the automated assistant in response to user interface input provided by the user via the client device, and are caused to be rendered to the user via the client device, as a response, by the automated assistant, to the user interface input of the user. An expanded response is generated based on at least one entity of interest determined based on the user interface input, and is generated to incorporate content related to one or more additional entities that are related to the entity of interest, but that are not explicitly referenced by the user interface input.
-
公开(公告)号:US11942082B2
公开(公告)日:2024-03-26
申请号:US17825778
申请日:2022-05-26
Applicant: GOOGLE LLC
Inventor: James Kuczmarski , Vibhor Jain , Amarnag Subramanya , Nimesh Ranjan , Melvin Jose Johnson Premkumar , Vladimir Vuskovic , Luna Dai , Daisuke Ikeda , Nihal Sandeep Balani , Jinna Lei , Mengmeng Niu , Hongjie Chai , Wangqing Yuan
IPC: G06F40/47 , G06F16/33 , G06F16/332 , G06F18/22 , G06F40/58 , G06N20/00 , G10L15/00 , G10L15/183 , G10L15/22 , H04L51/02
CPC classification number: G10L15/183 , G06F16/3329 , G06F16/3337 , G06F18/22 , G06F40/47 , G06F40/58 , G06N20/00 , G10L15/005 , G10L15/22 , H04L51/02
Abstract: Techniques described herein relate to facilitating end-to-end multilingual communications with automated assistants. In various implementations, speech recognition output may be generated based on voice input in a first language. A first language intent may be identified based on the speech recognition output and fulfilled in order to generate a first natural language output candidate in the first language. At least part of the speech recognition output may be translated to a second language to generate an at least partial translation, which may then be used to identify a second language intent that is fulfilled to generate a second natural language output candidate in the second language. Scores may be determined for the first and second natural language output candidates, and based on the scores, a natural language output may be selected for presentation.
-
公开(公告)号:US11930241B2
公开(公告)日:2024-03-12
申请号:US16259946
申请日:2019-01-28
Applicant: Google LLC
Inventor: Vladimir Vuskovic , Dhruv Bakshi , Amaury Forgeot d'Arc , Christoph Poropatits
IPC: H04N21/431 , G06F16/48 , G06F18/214 , G06V20/40 , G11B27/031 , H04N5/262 , H04N7/12 , H04N21/234 , H04N21/2343 , H04N21/258 , H04N21/2668 , H04N21/2743 , H04N21/435 , H04N21/44 , H04N21/4722 , H04N21/475 , H04N21/81 , H04N21/84 , H04N21/854
CPC classification number: H04N21/4312 , G06F16/48 , G06F18/214 , G06V20/41 , G11B27/031 , H04N5/262 , H04N7/12 , H04N21/23424 , H04N21/23439 , H04N21/25825 , H04N21/2668 , H04N21/2743 , H04N21/4355 , H04N21/44016 , H04N21/4722 , H04N21/4756 , H04N21/812 , H04N21/8126 , H04N21/84 , H04N21/85403
Abstract: Systems and methods for optimizing videos are disclosed. A method of the present disclosure includes analyzing a video using a plurality of rules to determine one or more optimizations for the video, the one or more optimizations reflecting a subset of properties to improve viewership statistics of the video by automatically modifying characteristics of the video based on the subset of properties that is indicative of improved viewership statistics pertaining to a plurality of users of a media hosting service, wherein the characteristics comprise at least one characteristic, which when automatically modified, results in a change to the video that correlates to improved viewership statistics pertaining to the plurality of users. The method further includes causing the characteristics of the video to be modified to implement the one or more optimizations, and causing the video with the modified characteristics to be presented to at least a subset of the plurality of users.
-
公开(公告)号:US11929069B2
公开(公告)日:2024-03-12
申请号:US17411532
申请日:2021-08-25
Applicant: Google LLC
Inventor: Vladimir Vuskovic , Stephan Wenger , Zineb Ait Bahajji , Martin Baeuml , Alexandru Dovlecel , Gleb Skobeltsyn
IPC: G10L15/22 , G06F40/295 , G06F40/35 , G06F40/56 , G10L15/18
CPC classification number: G10L15/22 , G06F40/295 , G06F40/35 , G06F40/56 , G10L15/1815 , G10L15/222 , G10L2015/227
Abstract: Methods, apparatus, and computer readable media are described related to automated assistants that proactively incorporate, into human-to-computer dialog sessions, unsolicited content of potential interest to a user. In various implementations, based on content of an existing human-to-computer dialog session between a user and an automated assistant, an entity mentioned by the user or automated assistant may be identified. Fact(s)s related to the entity or to another entity that is related to the entity may be identified based on entity data contained in database(s). For each of the fact(s), a corresponding measure of potential interest to the user may be determined. Unsolicited natural language content may then be generated that includes one or more of the facts selected based on the corresponding measure(s) of potential interest. The automated assistant may then incorporate the unsolicited content into the existing human-to-computer dialog session or a subsequent human-to-computer dialog session.
-
公开(公告)号:US11875788B2
公开(公告)日:2024-01-16
申请号:US17211488
申请日:2021-03-24
Applicant: Google LLC
Inventor: James Kuczmarski , Vibhor Jain , Amarnag Subramanya , Nimesh Ranjan , Melvin Jose Johnson Premkumar , Vladimir Vuskovic , Luna Dai , Daisuke Ikeda , Nihal Sandeep Balani , Jinna Lei , Mengmeng Niu
IPC: G06F40/00 , G10L15/183 , G10L15/00 , G10L15/22 , G06F16/33 , G06N20/00 , G06F16/332 , G06F40/47 , G06F40/58 , H04L51/02 , G06F18/22
CPC classification number: G10L15/183 , G06F16/3329 , G06F16/3337 , G06F18/22 , G06F40/47 , G06F40/58 , G06N20/00 , G10L15/005 , G10L15/22 , H04L51/02
Abstract: Techniques described herein relate to facilitating end-to-end multilingual communications with automated assistants. In various implementations, speech recognition output may be generated based on voice input in a first language. A first language intent may be identified based on the speech recognition output and fulfilled in order to generate a first natural language output candidate in the first language. At least part of the speech recognition output may be translated to a second language to generate an at least partial translation, which may then be used to identify a second language intent that is fulfilled to generate a second natural language output candidate in the second language. Scores may be determined for the first and second natural language output candidates, and based on the scores, a natural language output may be selected for presentation.
-
公开(公告)号:US11552814B2
公开(公告)日:2023-01-10
申请号:US16927373
申请日:2020-07-13
Applicant: Google LLC
Inventor: Vladimir Vuskovic , Yariv Adan
IPC: H04L12/18 , H04L51/02 , G06F16/00 , G06F40/40 , G06Q10/00 , H04L51/216 , H04L67/75 , H04L65/4038 , G06Q10/06
Abstract: Techniques are described herein for automated assistants that proactively provide content to participant(s) of multi-participant message exchange threads (e.g., group chats, audio and/or video calls in which oral messages are transcribed for analysis, etc.) based on signals such as individual participant profiles associated with participant(s). In various implementations, automated assistant(s) that may not be explicitly invoked may analyze content of a message exchange thread involving multiple human participants and/or document(s) associated with the message exchange thread. Based on the analyzing, the automated assistant(s) may identify topic(s) pertinent to the message exchange thread. Based on individual participant profiles associated with the participants, the automated assistant(s) may identify shared interest(s) of the participants. The automated assistant(s) may then select new content based both on the pertinent topic(s) and the shared interest(s) of the participants and proactively provide the new content to one or more of the participants.
-
公开(公告)号:US11521037B2
公开(公告)日:2022-12-06
申请号:US17330892
申请日:2021-05-26
Applicant: GOOGLE LLC
Inventor: Yariv Adan , Vladimir Vuskovic , Behshad Behzadi
Abstract: An example method includes receiving, by a computational assistant executing at one or more processors, a representation of an utterance spoken at a computing device; identifying, based on the utterance, a task to be performed by the computational assistant; responsive to determining, by the computational assistant, that complete performance of the task will take more than a threshold amount of time, outputting, for playback by one or more speakers operably connected to the computing device, synthesized voice data that informs a user of the computing device that complete performance of the task will not be immediate; and performing, by the computational assistant, the task.
-
公开(公告)号:US20220284198A1
公开(公告)日:2022-09-08
申请号:US17825778
申请日:2022-05-26
Applicant: GOOGLE LLC
Inventor: James Kuczmarski , Vibhor Jain , Amarnag Subramanya , Nimesh Ranjan , Melvin Jose Johnson Premkumar , Vladimir Vuskovic , Luna Dai , Daisuke Ikeda , Nihal Sandeep Balani , Jinna Lei , Mengmeng Niu , Hongjie Chai , Wangqing Yuan
Abstract: Techniques described herein relate to facilitating end-to-end multilingual communications with automated assistants. In various implementations, speech recognition output may be generated based on voice input in a first language. A first language intent may be identified based on the speech recognition output and fulfilled in order to generate a first natural language output candidate in the first language. At least part of the speech recognition output may be translated to a second language to generate an at least partial translation, which may then be used to identify a second language intent that is fulfilled to generate a second natural language output candidate in the second language. Scores may be determined for the first and second natural language output candidates, and based on the scores, a natural language output may be selected for presentation.
-
-
-
-
-
-
-
-
-