Example-based voice bot development techniques

    公开(公告)号:US12283270B2

    公开(公告)日:2025-04-22

    申请号:US17541098

    申请日:2021-12-02

    Applicant: GOOGLE LLC

    Abstract: Implementations are directed to providing a voice bot development platform that enables a third-party developer to train a voice bot based on training instance(s). The training instance(s) can each include training input and training output. The training input can include a portion of a corresponding conversation and a prior context of the corresponding conversation. The training output can include a corresponding ground truth response to the portion of the corresponding conversation. Subsequent to training, the voice bot can be deployed for conducting conversations on behalf of a third-party. In some implementations, the voice bot is further trained based on a corresponding feature emphasis input that attentions the voice bot to a particular feature of the portion of the corresponding conversation. In some additional or alternative implementations, the voice bot is further trained to interact with third-party system(s) via remote procedure calls (RPCs).

    VOICE WRAPPER(S) FOR EXISTING FIRST-PARTY TEXT-BASED CHATBOT(S)

    公开(公告)号:US20250095632A1

    公开(公告)日:2025-03-20

    申请号:US18802567

    申请日:2024-08-13

    Applicant: GOOGLE LLC

    Abstract: Implementations are directed to providing a voice wrapper to an existing first-party text-based chatbot to enable the existing first-party text-based chatbot to engage in corresponding voice-based conversations. The voice wrapper can include a plurality of components. For instance, the voice wrapper can include a plurality of input components for utilization in responding to a spoken utterance, and in lieu of the existing first-party text-based chatbot, and/or to modify input to be provided to the existing first-party text-based chatbot in responding to the spoken utterance. Also, for instance, the voice wrapper can include a plurality of output components for utilization in responding to the spoken utterance, to reduce perceived latency of the existing first-party text-based chatbot, and/or to modify output generated by the existing first-party text-based chatbot in responding to the spoken utterance.

    EXAMPLE-BASED VOICE BOT DEVELOPMENT TECHNIQUES

    公开(公告)号:US20220180858A1

    公开(公告)日:2022-06-09

    申请号:US17541098

    申请日:2021-12-02

    Applicant: GOOGLE LLC

    Abstract: Implementations are directed to providing a voice bot development platform that enables a third-party developer to train a voice bot based on training instance(s). The training instance(s) can each include training input and training output. The training input can include a portion of a corresponding conversation and a prior context of the corresponding conversation. The training output can include a corresponding ground truth response to the portion of the corresponding conversation. Subsequent to training, the voice bot can be deployed for conducting conversations on behalf of a third-party. In some implementations, the voice bot is further trained based on a corresponding feature emphasis input that attentions the voice bot to a particular feature of the portion of the corresponding conversation. In some additional or alternative implementations, the voice bot is further trained to interact with third-party system(s) via remote procedure calls (RPCs).

    SIMULATION OF AUTOMATED TELEPHONE CALL(S)

    公开(公告)号:US20250133036A1

    公开(公告)日:2025-04-24

    申请号:US18385692

    申请日:2023-10-31

    Applicant: GOOGLE LLC

    Abstract: Implementations are directed to simulating automated telephone call(s) to be performed by an automated assistant. Processor(s) can receive user input and determine, based on the user input, that the user input includes: a request to cause the automated assistant to initiate an automated telephone call with an entity, and a task to be performed during the automated telephone call. In some implementations, the processor(s) can cause a simulation of the automated telephone call to be performed to simulate the task, and, based on a result of the simulation, determine whether to initiate the automated telephone call or to refrain from initiating the automated telephone call. In additional or alternative implementations, the processor(s) can determine whether to cause the simulation of the automated telephone call to be performed based on a type of the entity, a type of the task, and/or whether a prior simulation of the task has been performed.

    VOICE-BASED CHATBOT POLICY OVERRIDE(S) FOR EXISTING VOICE-BASED CHATBOT(S)

    公开(公告)号:US20250046305A1

    公开(公告)日:2025-02-06

    申请号:US18228411

    申请日:2023-07-31

    Applicant: GOOGLE LLC

    Inventor: Sasha Goldshtein

    Abstract: Implementations are directed to generating voice-based chatbot policy override(s) and/or utilizing voice-based chatbot policy override(s) in conjunction with existing voice-based chatbot(s). The voice-based chatbot policy override(s) can correspond to, for example, machine learning (ML) model(s) that supplement functionality of the existing voice-based chatbot(s). Notably, the voice-based chatbot policy override(s) are associated with rule(s) (e.g., by virtue of training the ML model(s) that correspond to the voice-based chatbot policy override(s)) for when the voice-based chatbot policy override(s) should be utilized in lieu of the existing voice-based chatbot(s) in responding to spoken utterance(s) of human user(s) engaged in corresponding conversation(s) with the voice-based chatbot policy override(s). Nonetheless, from a perspective of the human user(s), it appears as if they are still engaging in the corresponding conversations with the existing voice-based chatbot(s). Thus, the functionality of the existing voice-based chatbot(s) can be supplemented without having to re-train the existing voice-based chatbot(s).

    Using Video Clips as Dictionary Usage Examples

    公开(公告)号:US20220405478A1

    公开(公告)日:2022-12-22

    申请号:US17774460

    申请日:2019-11-04

    Applicant: Google LLC

    Abstract: Implementations are provided for automatically mining corpus(es) of electronic video files for video clips that contain spoken utterances that are suitable usage examples to accompany or compliment dictionary definitions. These video clips may then be associated with target n-grams in a searchable database, such as a database underlying an online dictionary. In various implementations, a set of candidate video clips in which a target n-gram is uttered in a target context may be identified from a corpus of electronic video files. For each candidate video clip of the set, pre-existing manual subtitles associated with the candidate video clip may be compared to text generated based on speech recognition processing of an audio portion of the candidate video clip. Based at least in part on the comparing, a measure of suitability as a dictionary usage example may be calculated for the candidate video clip.

    VOICE WRAPPER(S) FOR EXISTING THIRD-PARTY TEXT-BASED CHATBOT(S)

    公开(公告)号:US20250097168A1

    公开(公告)日:2025-03-20

    申请号:US18370239

    申请日:2023-09-19

    Applicant: GOOGLE LLC

    Abstract: Implementations are directed to providing a voice wrapper to an existing third-party text-based chatbot to enable the existing third-party text-based chatbot to engage in corresponding voice-based conversations. The voice wrapper can include a plurality of components. For instance, the voice wrapper can include a plurality of input components for utilization in responding to a spoken utterance, and in lieu of the existing third-party text-based chatbot, and/or to modify input to be provided to the existing third-party text-based chatbot in responding to the spoken utterance. Also, for instance, the voice wrapper can include a plurality of output components for utilization in responding to the spoken utterance, to reduce perceived latency of the existing third-party text-based chatbot, and/or to modify output generated by the existing third-party text-based chatbot in responding to the spoken utterance.

    Using video clips as dictionary usage examples

    公开(公告)号:US12197868B2

    公开(公告)日:2025-01-14

    申请号:US17774460

    申请日:2019-11-04

    Applicant: Google LLC

    Abstract: Implementations are provided for automatically mining corpus(es) of electronic video files for video clips that contain spoken utterances that are suitable usage examples to accompany or compliment dictionary definitions. These video clips may then be associated with target n-grams in a searchable database, such as a database underlying an online dictionary. In various implementations, a set of candidate video clips in which a target n-gram is uttered in a target context may be identified from a corpus of electronic video files. For each candidate video clip of the set, pre-existing manual subtitles associated with the candidate video clip may be compared to text generated based on speech recognition processing of an audio portion of the candidate video clip. Based at least in part on the comparing, a measure of suitability as a dictionary usage example may be calculated for the candidate video clip.

Patent Agency Ranking