Abstract:
Disclosed is an AI avatar coaching system based on a free speech emotion analysis for acting for CS managers. The AI avatar coaching system includes: an AI avatar coach server generating an AI avatar coach video for practical counseling training, and providing the generated AI avatar coach video; an educated/inexperienced counselor terminal receiving and outputting the AI avatar coach video provided from the AI avatar coach server; a purchase customer terminal performing a voice call for a counseling inquiry of a purchase customer; a counselor terminal performing the voice call for a counselor to perform counseling processing for the counseling inquiry of the purchase customer; and an omni channel customer/company consulting service server setting a voice call session for the voice call between the purchase customer terminal and the counselor terminal, and transmitting a report for the counseling inquiry and the counseling processing, in order to act for counseling services for multiple selling company customers. By the AI avatar coaching system based on a free speech emotion analysis for acting for CS managers, there is an effect that a counseling video of an experienced counselor is configured to be simulated into an avatar video and provided to educated/inexperienced counselors to learn a counseling/response method and effectively train the counselors through a specific practical cases.
Abstract:
A bridging for using a non-voice-based user interface, such as a text chat interface, with a voice-enabled interactive voice response system which, during a non-voice-based communication session with a client user device, receives from the client user device, a non-voice entry entered by a client user into the communication session; identifies one or more elements in the non-voice entry constrained by one or more allowed responses by the voice-enabled interactive voice response system; maps the one or more elements to one or more of the allowed responses; and passes the mapped one or more identified elements to a voice-enabled interactive voice response system as a input via emulation of a voice recognition analysis response.
Abstract:
A method and apparatus of processing caller experiences is disclosed. One example method may include determining a call event type occurring during a call and assigning a weight to the call event type via a processing device. The method may also include calculating a caller experience metric value representing a caller's current call status responsive to determining the at least one call event type, the caller experience metric being a function of the current event type weight and a discounting variable that discounts a value of past events. The method may also provide comparing the caller experience metric to a predefined threshold value and determining whether to perform at least one of transferring the call to a live agent and switching from a current caller modality to a different caller modality.
Abstract:
A method and a system for voice transmission control. The method comprises: receiving, by a voice answering device, a voice command and transmitting the voice command to a sound control server through a network data transmission channel; recognizing, by the sound control server, the voice command, generating corresponding second VXML control information based on a recognition result, and transmitting the second VXML control information to the voice answering device through the network data transmission channel; and performing, by the voice answering device, an operation according to the received second VXML control information. With this method, the architecture and workflow of the communication system can be simplified, and the difficulty of design thereof can be reduced.
Abstract:
Methods, systems, and devices for cross-linking events and persons using anonymized voice fingerprint identifiers (IDs) and call metadata are described. The method can include retrieving, form a centralized database, call metadata associated with a caller index ID. The method can include determining call metadata characteristics for the call metadata. The method can include matching the call metadata characteristics of the call metadata with characteristics associated with a personality type of a psychological behavioral model. The method can include generating a caller profile that comprises personality type information for the personality type. The method can include associating the caller profile with the caller index ID. The method can include storing the caller profile in an entry in the centralized database associated with the caller index ID.
Abstract:
A facility and method for analyzing and classifying calls without transcription via keyword spotting is disclosed. The facility uses a group of calls having known outcomes to generate one or more domain- or entity-specific grammars containing keywords and related information that are indicative of particular outcome. The facility monitors telephone calls by determining the domain or entity associated with the call, loading the appropriate grammar or grammars associated with the determined domain or entity, and tracking keywords contained in the loaded grammar or grammars that are spoken during the monitored call, along with additional information. The facility performs a statistical analysis on the tracked keywords and additional information to determine a classification for the monitored telephone call.
Abstract:
A method for controlling a mute function in a telephone device, and the telephone device and a computer program product that implements the method. The method includes: while the mute function is active, voice recognition software in the telephone device processes sound detected by a microphone in the telephone device to recognize and identify one or more specific words as having been spoken by a specific person and not by another person, and in response, the telephone device activates an alarm in the telephone device to communicate that the mute function is active.
Abstract:
The present invention relates to a system for retrieving information from a network such as the Internet. A user creates a user-defined record in a database that identifies an information source, such as a web site, containing information of interest to the user. This record identifies the location of the information source and also contains a recognition grammar based upon a speech command assigned by the user. Upon receiving the speech command from the user that is described within the recognition grammar, a network interface system accesses the information source and retrieves the information requested by the user.
Abstract:
Apparatus and method for sharing state information using a web-enabled system and a phone service system are disclosed. In some embodiments, a presence module is used to identify a currently accessed web page to an agent during an on-line session. In some embodiments, documents are delivered to a user through a web browser concurrent with an audio message delivered by phone. Concurrent delivery of documents configured to accept an electronic signature is disclosed.
Abstract:
A method and apparatus of processing caller experiences is disclosed. One example method may include determining a call event type occurring during a call and assigning a weight to the call event type via a processing device. The method may also include calculating a caller experience metric value representing a caller's current call status responsive to determining the at least one call event type, the caller experience metric being a function of the current event type weight and a discounting variable that discounts a value of past events. The method may also provide comparing the caller experience metric to a predefined threshold value and determining whether to perform at least one of transferring the call to a live agent and switching from a current caller modality to a different caller modality.