Abstract:
Example methods, apparatus and articles to manage routing in networks are disclosed. A disclosed example method includes identifying a first network element associated with a problematic network element; identifying a first maximum transmission unit of the first network element and a second maximum transmission unit of the problematic network element; determining whether the first and second maximum transmission units are different; and when the first and second maximum transmission units are different, identifying a value of a greater one of the first and second maximum transmission units and configuring the first network element and the problematic network element to each have the identified value.
Abstract:
Structure of conversations between users and agents and/or systems is discovered and interactively displayed to analysts, thereby better supporting development of automated conversation handling systems for different domains. A corpus of prior dialogs of users with agents (without preexisting semantic labels indicating purposes for different parts of the dialogs) is taken as input, and embeddings are generated for textual units (e.g., rounds) of the dialogs. The embeddings are used to cluster the textual units, and the clusters and their relationships are visualized within a user interface that analysts may use to explore and fine-tune the structure of the conversations.
Abstract:
Within an environment in which users converse at least partly with human agents to accomplish a desired task, a server assists the agents by identifying workflows that are most applicable to the current conversation. Workflow selection functionality identifies one or more candidate workflows based on techniques such as user intent inference, conversation state tracking, or search, according to various embodiments. The identified candidate workflows are either automatically selected on behalf of the agent, or are presented to the agent for manual selection.
Abstract:
A computer-implemented method for providing agent assisted transcriptions of user utterances. A user utterance is received in response to a prompt provided to the user at a remote client device. An automatic transcription is generated from the utterance using a language model based upon an application or context, and presented to a human agent. The agent reviews the transcription and may replace at least a portion of the transcription with a corrected transcription. As the agent inputs the corrected transcription, accelerants are presented to the user comprising suggested texted to be inputted. The accelerants may be determined based upon an agent input, an application or context of the transcription, the portion of the transcription being replaced, or any combination thereof. In some cases, the user provides textual input, to which the agent transcribes an intent associated with the input with the aid of one or more accelerants.
Abstract:
An interactive response system combines human intelligence (HI) subsystems with artificial intelligence (AI) subsystems to facilitate overall capability of multi-channel user interfaces. The system permits imperfect AI subsystems to nonetheless lessen the burden on HI subsystems. A combined AI and HI proxy is used to implement an interactive omnichannel system, and the proxy dynamically determines how many AI and HI subsystems are to perform recognition for any particular utterance, based on factors such as confidence thresholds of the AI recognition and availability of HI resources. Furthermore the system uses information from prior recognitions to automatically build, test, predict confidence, and maintain AI models and HI models for system recognition improvements.
Abstract:
A natural language processing system has a hierarchy of user intents related to a domain of interest, the hierarchy having specific intents corresponding to leaf nodes of the hierarchy, and more general intents corresponding to ancestor nodes of the leaf nodes. The system also has a trained understanding model that can classify natural language utterances according to user intent. When the understanding model cannot determine with sufficient confidence that a natural language utterance corresponds to one of the specific intents, the natural language processing system traverses the hierarchy of intents to find a more general user intent that is related to the most applicable specific intent of the utterance and for which there is sufficient confidence. The general intent can then be used to prompt the user with questions applicable to the general intent to obtain the missing information needed for a specific intent.
Abstract:
A masking system prevents a human agent from receiving sensitive personal information (SPI) provided by a caller during caller-agent communication. The masking system includes components for detecting the SPI, including automated speech recognition and natural language processing systems. When the caller communicates with the agent, e.g., via a phone call, the masking system processes the incoming caller audio. When the masking system detects SPI in the caller audio stream or when the masking system determines a high likelihood that incoming caller audio will include SPI, the caller audio is masked such that it cannot be heard by the agent. The masking system collects the SPI from the caller audio and sends it to the organization associated with the agent for processing the caller's request or transaction without giving the agent access to caller SPI.
Abstract:
Disclosed herein are methods for presenting speech from a selected text that is on a computing device. This method includes presenting text on a touch-sensitive display and having that text size within a threshold level so that the computing device can accurately determine the intent of the user when the user touches the touch screen. Once the user touch has been received, the computing device identifies and interprets the portion of text that is to be selected, and subsequently presents the text audibly to the user.
Abstract:
Disclosed herein are systems, methods, and computer-readable storage media for selecting a speech recognition model in a standardized speech recognition infrastructure. The system receives speech from a user, and if a user-specific supervised speech model associated with the user is available, retrieves the supervised speech model. If the user-specific supervised speech model is unavailable and if an unsupervised speech model is available, the system retrieves the unsupervised speech model. If the user-specific supervised speech model and the unsupervised speech model are unavailable, the system retrieves a generic speech model associated with the user. Next the system recognizes the received speech from the user with the retrieved model. In one embodiment, the system trains a speech recognition model in a standardized speech recognition infrastructure. In another embodiment, the system handshakes with a remote application in a standardized speech recognition infrastructure.
Abstract:
Disclosed herein are systems, computer-implemented methods, and computer-readable media for recognizing speech. The method includes receiving speech from a user, perceiving at least one speech dialect in the received speech, selecting at least one grammar from a plurality of optimized dialect grammars based on at least one score associated with the perceived speech dialect and the perceived at least one speech dialect, and recognizing the received speech with the selected at least one grammar. Selecting at least one grammar can be further based on a user profile. Multiple grammars can be blended. Predefined parameters can include pronunciation differences, vocabulary, and sentence structure. Optimized dialect grammars can be domain specific. The method can further include recognizing initial received speech with a generic grammar until an optimized dialect grammar is selected. Selecting at least one grammar from a plurality of optimized dialect grammars can be based on a certainty threshold.