摘要:
Embodiments of an automated dialog system testing method and component are described. This automated testing method and system supplements real human-based testing with simulated user input and incorporates a set of evaluation measures that focus on three basic aspects of task-oriented dialog systems, namely, understanding ability, efficiency, and the appropriateness of system actions. These measures are first applied on a corpus generated between a dialog system and a group of human users to demonstrate the validity of these measures with the human users' satisfaction levels. Results generally show that these measures are significantly correlated with these satisfaction levels. A regression model is then built to predict the user satisfaction scores using these evaluation measures. The regression model is applied on a simulated dialog corpus trained from the above real user corpus, and show that the user satisfaction score estimated from the simulated dialogs do not significantly differ from the real users' satisfaction scores. These evaluation measures can then be used to assess the system performance based on the estimated user satisfaction.
摘要:
A method and system are described to adapt instructions for performing a task by a user, which includes receiving generalized instructions for the task, selecting a content of the generalized instructions based on user-specific knowledge regarding the task, constructing utterances using the selected content, and conveying the utterances to the user.
摘要:
This invention provides a new and improved voice communication system with high quality noise cancellation method and devices to overcome the limitations and difficulties encountered in conventional technologies. This invention discloses a noise cancellation apparatus that includes a vibration sensor and a microphone for receiving and transmitting voice signals as incoming speeches. The vibration sensor is applied to receive vibration signals corresponding to the voice signals for applying the vibration signals as reference signals for removing noise signals generated from environmental noises by converting vibration signals to intermediate PDL representation together with the speaker characteristics, mapping them into full band high quality clean acoustic representation, and synthesizing clear personal speech with characteristics identical to the original microphone speech without noises.
摘要:
Embodiments of a configurable content optimizer for use in dialog systems are described. In one embodiment, the content optimizer is a configurable component that acts as an intermediary between a dialog management module and a knowledge management module of a dialog system during the query process. The content optimizer module makes extensive use of the system ontology and organizes items returned by the knowledge base and makes adjustments to the query so that a reasonable number of responses are returned. Each query is broken down into a number of constraints, the constraints are characterized by type, and adjustments are made by strategies that include relaxing or tightening constraints in the query. Generic strategies for the potential adjustments are represented in a configurable manner so that the content optimizing module can be easily applied to new domains.
摘要:
In a system and method for fulfilling a service query for a user, a processor may parse the query into a set of operations, identify a set of service providers that each provides functionality for performing at least one respective operation of the set of operations, and, for each of the set of operations, select a respective one of the set of service providers to perform the operation, and interface with the service provider selected for the operation to cause the service provider to perform the operation.
摘要:
A computerized method for building and running natural language understanding systems, wherein a natural language understanding system takes a sentence as input and returns some representation of the possible meanings of the sentence as output (the “interpretation”) using a run-time interpreter th assigns interpretations to sentences and a compiler that produces (in a computer memory) an internal specification needed for the run-time interpreter from a user specification of the semantics of the application. The compiler builds a natural language system, while the run-time interpreter runs the system.
摘要:
A method of receiving input from a user includes sensing a first trajectory of a center of mass of a hand of the user during a gesture made by the hand. A second trajectory of a finger tip of the hand of the user during the gesture made by the hand is also sensed. An alphanumeric character represented by the gesture made by the hand is determined dependent upon both the first trajectory and the second trajectory.
摘要:
Embodiments of a method and system for detecting repeated patterns in dialog systems are described. The system includes a dynamic time warping (DTW) based pattern comparison algorithm that is used to find the best matching parts between a correction utterance and an original utterance. Reference patterns are generated from the correction utterance by an unsupervised segmentation scheme. No significant information about the position of the repeated parts in the correction utterance is assumed, as each reference pattern is compared with the original utterance from the beginning of the utterance to the end. A pattern comparison process with DTW is executed without knowledge of fixed end-points. A recursive DTW computation is executed to find the best matching parts that are considered as the repeated parts as well as the end-points of the utterance.
摘要:
Embodiments of a method and system for detecting repeated patterns in dialog systems are described. The system includes a dynamic time warping (DTW) based pattern comparison algorithm that is used to find the best matching parts between a correction utterance and an original utterance. Reference patterns are generated from the correction utterance by an unsupervised segmentation scheme. No significant information about the position of the repeated parts in the correction utterance is assumed, as each reference pattern is compared with the original utterance from the beginning of the utterance to the end. A pattern comparison process with DTW is executed without knowledge of fixed end-points. A recursive DTW computation is executed to find the best matching parts that are considered as the repeated parts as well as the end-points of the utterance.
摘要:
In a system and method for fulfilling a service query for a user, a processor may parse the query into a set of operations, identify a set of service providers that each provides functionality for performing at least one respective operation of the set of operations, and, for each of the set of operations, select a respective one of the set of service providers to perform the operation, and interface with the service provider selected for the operation to cause the service provider to perform the operation.