-
公开(公告)号:US12283270B2
公开(公告)日:2025-04-22
申请号:US17541098
申请日:2021-12-02
Applicant: GOOGLE LLC
Inventor: Asaf Aharoni , Yaniv Leviathan , Eyal Segalis , Gal Elidan , Sasha Goldshtein , Tomer Amiaz , Deborah Cohen
Abstract: Implementations are directed to providing a voice bot development platform that enables a third-party developer to train a voice bot based on training instance(s). The training instance(s) can each include training input and training output. The training input can include a portion of a corresponding conversation and a prior context of the corresponding conversation. The training output can include a corresponding ground truth response to the portion of the corresponding conversation. Subsequent to training, the voice bot can be deployed for conducting conversations on behalf of a third-party. In some implementations, the voice bot is further trained based on a corresponding feature emphasis input that attentions the voice bot to a particular feature of the portion of the corresponding conversation. In some additional or alternative implementations, the voice bot is further trained to interact with third-party system(s) via remote procedure calls (RPCs).
-
公开(公告)号:US20250095632A1
公开(公告)日:2025-03-20
申请号:US18802567
申请日:2024-08-13
Applicant: GOOGLE LLC
Inventor: Sasha Goldshtein , Yoav Tzur , Gal Moshitch , ChiTa Tsai , Sharon Sultan
Abstract: Implementations are directed to providing a voice wrapper to an existing first-party text-based chatbot to enable the existing first-party text-based chatbot to engage in corresponding voice-based conversations. The voice wrapper can include a plurality of components. For instance, the voice wrapper can include a plurality of input components for utilization in responding to a spoken utterance, and in lieu of the existing first-party text-based chatbot, and/or to modify input to be provided to the existing first-party text-based chatbot in responding to the spoken utterance. Also, for instance, the voice wrapper can include a plurality of output components for utilization in responding to the spoken utterance, to reduce perceived latency of the existing first-party text-based chatbot, and/or to modify output generated by the existing first-party text-based chatbot in responding to the spoken utterance.
-
3.
公开(公告)号:US20240146668A1
公开(公告)日:2024-05-02
申请号:US18403401
申请日:2024-01-03
Applicant: GOOGLE LLC
Inventor: Asaf Aharoni , Eyal Segalis , Ofer Ron , Sasha Goldshtein , Tomer Amiaz , Razvan Mathias , Yaniv Leviathan
CPC classification number: H04L51/02 , G06N20/00 , G10L15/063 , G10L15/10 , G10L15/22
Abstract: Implementations are directed to updating a trained voice bot that is deployed for conducting conversations on behalf of a third-party. A third-party developer can interact with a voice bot development system that enables the third-party developer to train, update, validate, and monitor performance of the trained voice bot. In various implementations, the trained voice bot can be updated by updating a corpus of training instances that was initially utilized to train the voice bot, and updating the trained voice bot based on the updated corpus. In some implementations, the corpus of training instances may be updated in response to identifying occurrence(s) of behavioral error(s) of the trained voice bot while the conversations are being conducted on behalf of the third-party. In additional or alternative implementations, the corpus of training instances may be updated in response to determining the trained voice bot does not include a desired behavior.
-
公开(公告)号:US11804211B2
公开(公告)日:2023-10-31
申请号:US17112418
申请日:2020-12-04
Applicant: Google LLC
Inventor: Asaf Aharoni , Yaniv Leviathan , Eyal Segalis , Gal Elidan , Sasha Goldshtein , Tomer Amiaz , Deborah Cohen
CPC classification number: G10L15/063 , G06N20/00 , G10L15/02 , G10L15/04 , G10L15/22 , H04L67/133 , H04M3/493 , G10L2015/0635
Abstract: Implementations are directed to providing a voice bot development platform that enables a third-party developer to train a voice bot based on training instance(s). The training instance(s) can each include training input and training output. The training input can include a portion of a corresponding conversation and a prior context of the corresponding conversation. The training output can include a corresponding ground truth response to the portion of the corresponding conversation. Subsequent to training, the voice bot can be deployed for conducting conversations on behalf of a third-party. In some implementations, the voice bot is further trained based on a corresponding feature emphasis input that attentions the voice bot to a particular feature of the portion of the corresponding conversation. In some additional or alternative implementations, the voice bot is further trained to interact with third-party system(s) via remote procedure calls (RPCs).
-
公开(公告)号:US20220180858A1
公开(公告)日:2022-06-09
申请号:US17541098
申请日:2021-12-02
Applicant: GOOGLE LLC
Inventor: Asaf Aharoni , Yaniv LEVIATHAN , Eyal SEGALIS , Gal ELIDAN , Sasha Goldshtein , Tomer Amiaz , Deborah Cohen
Abstract: Implementations are directed to providing a voice bot development platform that enables a third-party developer to train a voice bot based on training instance(s). The training instance(s) can each include training input and training output. The training input can include a portion of a corresponding conversation and a prior context of the corresponding conversation. The training output can include a corresponding ground truth response to the portion of the corresponding conversation. Subsequent to training, the voice bot can be deployed for conducting conversations on behalf of a third-party. In some implementations, the voice bot is further trained based on a corresponding feature emphasis input that attentions the voice bot to a particular feature of the portion of the corresponding conversation. In some additional or alternative implementations, the voice bot is further trained to interact with third-party system(s) via remote procedure calls (RPCs).
-
公开(公告)号:US20250133036A1
公开(公告)日:2025-04-24
申请号:US18385692
申请日:2023-10-31
Applicant: GOOGLE LLC
Inventor: Sasha Goldshtein , Yoav Tzur
IPC: H04L51/02 , G06F40/205 , G06F40/30
Abstract: Implementations are directed to simulating automated telephone call(s) to be performed by an automated assistant. Processor(s) can receive user input and determine, based on the user input, that the user input includes: a request to cause the automated assistant to initiate an automated telephone call with an entity, and a task to be performed during the automated telephone call. In some implementations, the processor(s) can cause a simulation of the automated telephone call to be performed to simulate the task, and, based on a result of the simulation, determine whether to initiate the automated telephone call or to refrain from initiating the automated telephone call. In additional or alternative implementations, the processor(s) can determine whether to cause the simulation of the automated telephone call to be performed based on a type of the entity, a type of the task, and/or whether a prior simulation of the task has been performed.
-
公开(公告)号:US20250046305A1
公开(公告)日:2025-02-06
申请号:US18228411
申请日:2023-07-31
Applicant: GOOGLE LLC
Inventor: Sasha Goldshtein
Abstract: Implementations are directed to generating voice-based chatbot policy override(s) and/or utilizing voice-based chatbot policy override(s) in conjunction with existing voice-based chatbot(s). The voice-based chatbot policy override(s) can correspond to, for example, machine learning (ML) model(s) that supplement functionality of the existing voice-based chatbot(s). Notably, the voice-based chatbot policy override(s) are associated with rule(s) (e.g., by virtue of training the ML model(s) that correspond to the voice-based chatbot policy override(s)) for when the voice-based chatbot policy override(s) should be utilized in lieu of the existing voice-based chatbot(s) in responding to spoken utterance(s) of human user(s) engaged in corresponding conversation(s) with the voice-based chatbot policy override(s). Nonetheless, from a perspective of the human user(s), it appears as if they are still engaging in the corresponding conversations with the existing voice-based chatbot(s). Thus, the functionality of the existing voice-based chatbot(s) can be supplemented without having to re-train the existing voice-based chatbot(s).
-
公开(公告)号:US20220405478A1
公开(公告)日:2022-12-22
申请号:US17774460
申请日:2019-11-04
Applicant: Google LLC
Inventor: Tal Cohen , Tal Snir , Sivan Eiger , Zahi Akiva , Gadi Ben Amram , Ran Dahan , Sasha Goldshtein , Yossi Matias , Shoji Ogura
IPC: G06F40/295 , G10L15/197 , G06V20/40 , G06V40/10 , G06F3/0488 , G06V40/20 , G06F16/783
Abstract: Implementations are provided for automatically mining corpus(es) of electronic video files for video clips that contain spoken utterances that are suitable usage examples to accompany or compliment dictionary definitions. These video clips may then be associated with target n-grams in a searchable database, such as a database underlying an online dictionary. In various implementations, a set of candidate video clips in which a target n-gram is uttered in a target context may be identified from a corpus of electronic video files. For each candidate video clip of the set, pre-existing manual subtitles associated with the candidate video clip may be compared to text generated based on speech recognition processing of an audio portion of the candidate video clip. Based at least in part on the comparing, a measure of suitability as a dictionary usage example may be calculated for the candidate video clip.
-
公开(公告)号:US20250097168A1
公开(公告)日:2025-03-20
申请号:US18370239
申请日:2023-09-19
Applicant: GOOGLE LLC
Inventor: Sasha Goldshtein , Yoav Tzur , Shlomo Fruchter , Gal Moshitch , ChiTa Tsai , Sharon Sultan
Abstract: Implementations are directed to providing a voice wrapper to an existing third-party text-based chatbot to enable the existing third-party text-based chatbot to engage in corresponding voice-based conversations. The voice wrapper can include a plurality of components. For instance, the voice wrapper can include a plurality of input components for utilization in responding to a spoken utterance, and in lieu of the existing third-party text-based chatbot, and/or to modify input to be provided to the existing third-party text-based chatbot in responding to the spoken utterance. Also, for instance, the voice wrapper can include a plurality of output components for utilization in responding to the spoken utterance, to reduce perceived latency of the existing third-party text-based chatbot, and/or to modify output generated by the existing third-party text-based chatbot in responding to the spoken utterance.
-
公开(公告)号:US12197868B2
公开(公告)日:2025-01-14
申请号:US17774460
申请日:2019-11-04
Applicant: Google LLC
Inventor: Tal Cohen , Tal Snir , Sivan Eiger , Zahi Akiva , Gadi Ben Amram , Ran Dahan , Sasha Goldshtein , Yossi Matias , Shoji Ogura
IPC: G10L15/20 , G06F3/0488 , G06F16/783 , G06F40/295 , G06V20/40 , G06V40/10 , G06V40/20 , G10L15/197
Abstract: Implementations are provided for automatically mining corpus(es) of electronic video files for video clips that contain spoken utterances that are suitable usage examples to accompany or compliment dictionary definitions. These video clips may then be associated with target n-grams in a searchable database, such as a database underlying an online dictionary. In various implementations, a set of candidate video clips in which a target n-gram is uttered in a target context may be identified from a corpus of electronic video files. For each candidate video clip of the set, pre-existing manual subtitles associated with the candidate video clip may be compared to text generated based on speech recognition processing of an audio portion of the candidate video clip. Based at least in part on the comparing, a measure of suitability as a dictionary usage example may be calculated for the candidate video clip.
-
-
-
-
-
-
-
-
-