REAL-TIME NATURAL LANGUAGE PROCESSING AND FULFILLMENT

    公开(公告)号:US20240161737A1

    公开(公告)日:2024-05-16

    申请号:US18055821

    申请日:2022-11-15

    申请人: SoundHound, Inc.

    摘要: A system and method of real-time feedback confirmation to solicit a virtual assistant response from an evolving semantic state of at least a portion of an utterance. A user accesses a virtual assistant on an electronic device having the system and/or method configured to capture a command, a question, and/or a fulfillment request from audio such as, the speech emitted from the speaking user. The speech may be intercepted by a speech engine configured to transcribe the speech into text that is matched with the fragment pattern's regular expression to generate a fragment and/or the speech may be processed with a machine learning model to identify fragments. The fragments are identified by a domain handler configured to update a data structure of the current semantic state of the utterance in real-time on an interface of an electronic device.

    ALTERNATE RESPONSE GENERATION
    72.
    发明公开

    公开(公告)号:US20240153525A1

    公开(公告)日:2024-05-09

    申请号:US18507715

    申请日:2023-11-13

    摘要: Techniques for performing conversation recovery of a system/user exchange are described. In response to determining that an action responsive to a user input cannot be performed, a system may determine a topic to recommend to a user. The topic may be unrelated to the original substance of the user input. The system may have access to various data representing a context in which a user provides an input to the system. The system may use these inputs and various data at runtime to make a determination regarding whether a user should be recommended a topic, as well as what that topic should be. The system may cause a question be output to the user, with the question asking the user about the topic, for example whether the user would like a song played, whether the user would like to hear information about a particular individual (e.g., artist), whether the user would like to know about a particular skill (e.g., a skill having a significantly high popularity among users of the system), or whether the user would like to know about some other topic. If the user responds affirmatively to the recommended topic, the system may pass the user experience off to an appropriate component of the system (e.g., one that is configured to perform an action related to the topic). If the user responds negatively, does not respond at all, or the system is unsure whether the user's response was affirmative or negative, the system may cease interaction with the user, thereby enabling the user to interact with the system as the user desires.

    Multi-phrase responding in full duplex voice conversation

    公开(公告)号:US11979360B2

    公开(公告)日:2024-05-07

    申请号:US17278632

    申请日:2018-10-25

    发明人: Li Zhou

    IPC分类号: G10L15/18 G10L25/78 H04L51/02

    CPC分类号: H04L51/02 G10L15/18 G10L25/78

    摘要: The present disclosure provides method and apparatus for responding in a voice conversation by an electronic conversational agent. A voice input may be received in an audio upstream. In response to the voice input, a primary response and at least one supplementary response may be generated. A primary voice output may be generated based on the primary response. At least one supplementary voice output may be generated based on the at least one supplementary response. The primary voice output and the at least one supplementary voice output may be provided in an audio downstream, wherein the at least one supplementary voice output is provided during a time period adjacent to the primary voice output in the audio downstream.

    METHOD FOR PROVIDING SPEECH BUBBLE IN VIDEO CONFERENCE AND SYSTEM THEREOF

    公开(公告)号:US20240146878A1

    公开(公告)日:2024-05-02

    申请号:US18384219

    申请日:2023-10-26

    发明人: Ji Seon CHOI

    IPC分类号: H04N7/15 G10L15/18 G10L15/26

    摘要: Provided is a method for providing a speech bubble in a video conference. The method is performed by a user terminal and includes: receiving a first speech text converted from a voice signal of a first conference participant participating in a video conference into text; determining whether to activate a cartoon mode; displaying, based on determining to activate the cartoon mode, a conference screen including a first participant object and a first speech bubble, wherein the first participant object indicates the first conference participant and the first speech bubble is generated using the first speech text; and displaying, in response to a user input to select the first speech bubble, a sequence of speech texts of the video conference, the sequence including a speech text corresponding to the first speech bubble.

    AUTOMATED CONVERSATIONAL MENU OVERLAY WITH IVR IN CONTACT CENTERS

    公开(公告)号:US20240146842A1

    公开(公告)日:2024-05-02

    申请号:US17975833

    申请日:2022-10-28

    IPC分类号: H04M3/493 G10L15/18 H04M3/51

    摘要: At a contact center that operates to provide customer service to callers by receiving calls from the callers and directing the callers to desired endpoints that deliver information to the callers based upon an expressed intent of each caller associated with each call, a current call by the contact center is processed utilizing an Interactive Voice Response (IVR) model or a Conversational Menu (CM) model. The IVR model processes the current call by directing the current caller to a selected endpoint via navigation through a specific navigational path of nodes in a hierarchical IVR tree of nodes. The CM model processes the call by analyzing a verbal utterance of the current caller at an initiation of the current call and directing the caller to the selected endpoint so as to bypass nodes within the specific navigational path of nodes of the hierarchical IVR tree of nodes.