摘要:
Embodiments of a progressive feature selection method that selects features in multiple rounds are described. In one embodiment, the progressive feature selection method splits the feature space into tractable sub-spaces such that a feature selection algorithm can be performed on each sub-space. In a merge-split operation, the subset of features that the feature selection algorithm selects from the different sub-spaces are merged into subsequent sets of features. Instead of re-generating the mapping table for each subsequent set from scratch, a new mapping table from the previous round's tables is created by collecting those entries that correspond to the selected features. The feature selection method is then performed again on each of the subsequent feature sets and new features are selected from each of these feature sets. This feature selection-merge-split process is repeated on successively smaller numbers of feature sets until a single final set of features is selected.
摘要:
Embodiments of an interface system that enables a call center agent to access and intervene in an interaction between an automated call center system and a caller whenever necessary for complex application tasks is described. The system includes a user interface that presents the agent with one or more categories of information, including the conversation flow, obtained semantic information, the recognized utterances, and access to the utterance waveforms. This information is cross-linked and attached with a confidence level for better access and navigation within the dialog system for the generation of appropriate responses to the caller.
摘要:
Systems and methods are described that automatically generate interactive systems configured for collecting dialog data of human-machine interactions in dialog systems. The systems and methods comprise receiving a task flow that describes operations of a dialog system. A formal description of the task flow is generated, and an interactive system comprising a graphical user interface (GUI) is automatically generated from the formal description. The GUI consists of templates for control of the dialog system and real-time collection and annotating of dialog data during a live dialog between only the dialog system and callers to the dialog system. The dialog data consists of data of the live dialog.
摘要:
Systems and methods are described that automatically control modules of dialog systems. The systems and methods include a dialog module that receives and processes utterances from a speaker and outputs data used to generate synthetic speech outputs as responses to the utterances. A controller is coupled to the dialog module, and the controller detects an abnormal output of the dialog module when the dialog module is processing in an automatic mode. The controller comprises a mode control for an agent to control the dialog module by correcting the abnormal output and transferring a corrected output to a downstream dialog module that follows, in a processing path, the dialog module. The corrected output is used in further processing the utterances.
摘要:
An in-vehicle infotainment system, smart home information access and device control unit, or mobile system presents summarized information to a user based on a user preference model that is associated with the user. The system modifies the presentation of information to the user based on environmental context data about the vehicle and user context data about the activity of the user. During presentation of the information, the system modifies the content and presentation of the summarized information in response to multi-modal input requests from the user.
摘要:
Embodiments of a dialog system that utilizes a multi-modal input interface for recognizing user input in human-machine interaction (HMI) systems are described. Embodiments include a component that receives user input from a plurality of different user input mechanisms (multi-modal input) and performs certain synchronization and disambiguation processes. The multi-modal input components synchronizes and integrates the information obtained from different modalities, disambiguates the input, and recovers from any errors that might be produced with respect to any of the user inputs. Such a system effectively addresses any ambiguity associated with the user input and corrects for errors in the human-machine interaction.
摘要:
Embodiments of an interface system that enables a call center agent to access and intervene in an interaction between an automated call center system and a caller whenever necessary for complex application tasks is described. The system includes a user interface that presents the agent with one or more categories of information, including the conversation flow, obtained semantic information, the recognized utterances, and access to the utterance waveforms. This information is cross-linked and attached with a confidence level for better access and navigation within the dialog system for the generation of appropriate responses to the caller.
摘要:
In a confidence computing method and system, a processor may interpret speech signals as a text string or directly receive a text string as input, generate a syntactical parse tree representing the interpreted string and including a plurality of sub-trees which each represents a corresponding section of the interpreted text string, determine for each sub-tree whether the sub-tree is accurate, obtain replacement speech signals for each sub-tree determined to be inaccurate, and provide output based on corresponding text string sections of at least one sub-tree determined to be accurate.
摘要:
Embodiments of a dialog system that utilizes grammar-based labeling scheme to generate labeled sentences for use in training statistical models. During the process of training data development, a grammar is constructed manually based on the application domain or adapted from a general grammar rule. An annotation schema is created accordingly based on the application requirements, such as syntactic and semantic information. Such information is then included in the grammar specification. After the labeled grammar is constructed, a generation algorithm is then used to generate sentences for training various statistical models.
摘要:
Systems and methods are described that automatically control modules of dialog systems. The systems and methods include a dialog module that receives and processes utterances from a speaker and outputs data used to generate synthetic speech outputs as responses to the utterances. A controller is coupled to the dialog module, and the controller detects an abnormal output of the dialog module when the dialog module is processing in an automatic mode. The controller comprises a mode control for an agent to control the dialog module by correcting the abnormal output and transferring a corrected output to a downstream dialog module that follows, in a processing path, the dialog module. The corrected output is used in further processing the utterances.