-
21.
公开(公告)号:US12223944B2
公开(公告)日:2025-02-11
申请号:US17744440
申请日:2022-05-13
Applicant: GOOGLE LLC
Inventor: Martin Baeuml , Thushan Amarasiriwardena , Roberto Pieraccini , Gianluca Martini
IPC: G06F40/56 , G06F3/16 , G06F16/332 , G06F40/169 , G06T7/20 , G06V20/40 , G06V40/20 , G10L13/02 , G10L13/033 , G10L13/08 , G10L13/10 , G10L15/06 , G10L15/18 , G10L15/183 , G10L15/22 , G10L25/57 , H04N5/04
Abstract: Implementations relate to dynamically adapting a given assistant output based on a given persona, from among a plurality of disparate personas, assigned to an automated assistant. In some implementations, the given assistant output can be generated and subsequently adapted based on the given persona assigned to the automated assistant. In other implementations, the given assistant output can be generated specific to the given persona and without having to subsequently adapt the given assistant output to the given persona. Notably, the given assistant output can include a stream of textual content to be synthesized for audible presentation to the user, and a stream of visual cues utilized in controlling a display of a client device and/or in controlling a visualized representation of the automated assistant. Various implementations utilize large language models (LLMs), or output previously generated utilizing LLMs, to reflect the given persona in the given assistant output.
-
公开(公告)号:US20240304184A1
公开(公告)日:2024-09-12
申请号:US18120216
申请日:2023-03-10
Applicant: GOOGLE LLC
Inventor: Roberto Pieraccini , Wangqing Yuan , Martin Baeuml
CPC classification number: G10L15/197 , G06F40/35 , G10L15/063 , G10L15/1807 , G10L15/1815 , G10L15/22 , G10L15/30
Abstract: As part of an ongoing dialog between a user and an automated assistant, processor(s) can receive a natural language (NL) based input from the user during a turn of the ongoing dialog, obtain style signal(s) for the turn, and determine, based on the style signal(s), a NL based response style that is not specified in the NL based input. Further, the processor(s) can process, using a large language model (LLM), the NL based input and a NL based response style tag for the NL based response style to generate LLM output, determine, based on the LLM output, a NL based response in the NL based response style, and cause the NL based response to be rendered. In some implementations, a LLM behavior controller is utilized to determine the NL based response style, whereas in other implementations, the LLM is fine-tuned to determine the NL based response style.
-
公开(公告)号:US20230377571A1
公开(公告)日:2023-11-23
申请号:US18230581
申请日:2023-08-04
Applicant: GOOGLE LLC
Inventor: Vladimir Vuskovic , Stephan Wenger , Zineb Ait Bahajji , Martin Baeuml , Alexandru Dovlecel , Gleb Skobeltsyn
IPC: G10L15/22 , G06F40/35 , G06F40/56 , G06F40/295 , G10L15/18
CPC classification number: G10L15/22 , G06F40/35 , G06F40/56 , G06F40/295 , G10L15/1815 , G10L15/222 , G10L2015/227
Abstract: Methods, apparatus, and computer readable media are described related to automated assistants that proactively incorporate, into human-to-computer dialog sessions, unsolicited content of potential interest to a user. In various implementations, based on content of an existing human-to-computer dialog session between a user and an automated assistant, an entity mentioned by the user or automated assistant may be identified. Fact(s)s related to the entity or to another entity that is related to the entity may be identified based on entity data contained in database(s). For each of the fact(s), a corresponding measure of potential interest to the user may be determined. Unsolicited natural language content may then be generated that includes one or more of the facts selected based on the corresponding measure(s) of potential interest. The automated assistant may then incorporate the unsolicited content into the existing human-to-computer dialog session or a subsequent human-to-computer dialog session.
-
公开(公告)号:US20210383809A1
公开(公告)日:2021-12-09
申请号:US17411532
申请日:2021-08-25
Applicant: Google LLC
Inventor: Vladimir Vuskovic , Stephan Wenger , Zineb Ait Bahajji , Martin Baeuml , Alexandru Dovlecel , Gleb Skobeltsyn
IPC: G10L15/22 , G06F40/35 , G06F40/56 , G06F40/295 , G10L15/18
Abstract: Methods, apparatus, and computer readable media are described related to automated assistants that proactively incorporate, into human-to-computer dialog sessions, unsolicited content of potential interest to a user. In various implementations, based on content of an existing human-to-computer dialog session between a user and an automated assistant, an entity mentioned by the user or automated assistant may be identified. Fact(s)s related to the entity or to another entity that is related to the entity may be identified based on entity data contained in database(s). For each of the fact(s), a corresponding measure of potential interest to the user may be determined. Unsolicited natural language content may then be generated that includes one or more of the facts selected based on the corresponding measure(s) of potential interest. The automated assistant may then incorporate the unsolicited content into the existing human-to-computer dialog session or a subsequent human-to-computer dialog session.
-
公开(公告)号:US20180322880A1
公开(公告)日:2018-11-08
申请号:US15825919
申请日:2017-11-29
Applicant: Google LLC
Inventor: Vladimir Vuskovic , Stephan Wenger , Zineb Ait Bahajji , Martin Baeuml , Alexandru Dovlecel , Gleb Skobeltsyn
Abstract: Methods, apparatus, and computer readable media are described related to automated assistants that proactively incorporate, into human-to-computer dialog sessions, unsolicited content of potential interest to a user. In various implementations, based on content of an existing human-to-computer dialog session between a user and an automated assistant, an entity mentioned by the user or automated assistant may be identified. Fact(s)s related to the entity or to another entity that is related to the entity may be identified based on entity data contained in database(s). For each of the fact(s), a corresponding measure of potential interest to the user may be determined. Unsolicited natural language content may then be generated that includes one or more of the facts selected based on the corresponding measure(s) of potential interest. The automated assistant may then incorporate the unsolicited content into the existing human-to-computer dialog session or a subsequent human-to-computer dialog session.
-
公开(公告)号:US10026398B2
公开(公告)日:2018-07-17
申请号:US15205505
申请日:2016-07-08
Applicant: Google LLC
Inventor: Behshad Behzadi , Dmitry Osmakov , Martin Baeuml , Gleb Skobeltsyn
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for predicting follow-up queries to an initial transcription of an utterance. In some implementations, one or more follow-up queries that are pre-associated with a transcription of an initial utterance of a user are identified. A new or modified language model in which a respective probability associated with one or more of the follow-up queries is increased with respect to an initial language model is obtained. Subsequent audio data corresponding to a subsequent utterance of the user is then received. The subsequent audio data is processed using the new or modified language model to generate a transcription of the subsequent utterance. The transcription of the subsequent utterance is then provided for output to the user.
-
-
-
-
-