摘要:
A voice user interface authoring tool is configured to use categorized example caller responses, from which callflow paths, automatic speech recognition, and natural language processing control files can be generated automatically within a single, integrated authoring user interface. A voice user interface (VUI) design component allows an author to create an application incorporating various types of action nodes, including Prompt/Response Processing (PRP) nodes. At runtime, the system uses the information from each PRP node to prompt a user to say something, and to process the user's response in order to extract its meaning. An Automatic Speech Recognition/Natural Language Processing (ASR/NLP) Control Design component allows the author to associate sample inputs with each possible meaning, and automatically generates the necessary ASR and NLP runtime control files. The VUI design component allows the author to associate the appropriate ASR and NLP control files with each PRP node, and to associate an action node with each possible meaning, as indicated by the NLP control file.
摘要:
A voice user interface authoring tool is configured to use categorized example caller responses, from which callflow paths, automatic speech recognition, and natural language processing control files can be generated automatically within a single, integrated authoring user interface. A voice user interface (VUI) design component allows an author to create an application incorporating various types of action nodes, including Prompt/Response Processing (PRP) nodes. At runtime, the system uses the information from each PRP node to prompt a user to say something, and to process the user's response in order to extract its meaning. An Automatic Speech Recognition/Natural Language Processing (ASR/NLP) Control Design component allows the author to associate sample inputs with each possible meaning, and automatically generates the necessary ASR and NLP runtime control files. The VUI design component allows the author to associate the appropriate ASR and NLP control files with each PRP node, and to associate an action node with each possible meaning, as indicated by the NLP control file.
摘要:
A method of executing operations in a voice-activated command system includes automatically initiating execution of a default operation. A user is then prompted, after the default operation has been initiated, to determine whether the user wishes to execute a second operation instead of the default operation. If the user wishes to execute the second operation instead of the default operation, execution of the default operation is terminated and execution of the second operation is initiated. In voice-activated and other command systems, such as voice dialing systems, this method allows the command system to execute the most probable operation without delay, while still making the system easily navigable by naïve users. Systems, computer readable medium and apparatus which implement the methods of the present invention are also disclosed.
摘要:
A follow-up call to a user is made after completion of a first call with a voice user interface module operable on a computer. The voice user interface module inquiries about information communicated in the first call.
摘要:
A method of generating an optimized grammar, for use in speech recognition, from a data set or big list of items, is disclosed. The method includes the steps of obtaining a tree representing items in the data set, and generating the grammar using the tree. The tree or tree data structure representing items in the data set is a simulated recognition search tree, representing items in the data set, which can be automatically generated from the data set.
摘要:
A method of providing information to a user in a telephone interactive system includes receiving a new call. A comparison is then made between an identifier associated with the new call with stored call information pertaining to previous calls. If the identifier associated with the new call matches an identifier associated with a previous call, a subsequent action taken in the new call is based on context information stored from the previous call.
摘要:
A method of querying a user to select from a list in a voice-activated command system is provided. The method includes generating command prompt phrases during which the user can select items on the list. The command prompt phrases include an item on the list and an index for another item on the list. In some embodiments, each command prompt phrase also includes a period of silence between item on the list and the index for another item on the list. If a user selecting barge-in is received during a particular command prompt phrase, the corresponding item on the list is selected.
摘要:
A process for collecting the identity of a telephone caller is disclosed. In one embodiment, a personalized Context Free Grammar (CFG) is created for each potential call recipient, and is configured to support identification of incoming callers utilizing voice recognition. Each CFG incorporates an indication of high probability callers and probability weights in each CFG are altered accordingly. When a recipient receives a call, the relevant CFG is applied in association with a voice recognition application to enable at least a preliminary identification of the caller. In accordance with another embodiment, the caller confirms identifications. In accordance with one embodiment, standard caller-ID functionality is utilized if possible at least to assist in the caller identification process. In accordance with still another embodiment, voice recognition enhanced caller identification is utilized to provide intelligent call routing functionality.
摘要:
A speech recognition application including a recognition module configured to receive input utterances and an application module configured to select a recognition from the speech recognition module using output from a first iteration to select a recognition result for a second iteration. In one embodiment, the application module eliminates a previous rejected recognition result or results from the N-Best list for recognition. In another embodiment, the application module rescores N-Best entries based upon N-Best lists or information from another iteration. In another illustrated embodiment, the application module uses a limited grammar from a current N-Best list for subsequent recognition, for example for rerecognition using a recorded input from a previous iteration.
摘要:
A method of providing voice dialing assistance includes providing a first input to a speech recognition engine, with the first input corresponding to a speech sample provided by a caller attempting to reach an intended call recipient. A speech recognition output is generated in response to the first input. A potential call recipient is identified based upon the speech recognition output. A confirmation that the potential call recipient is the intended call recipient is implemented using a personal recording made by the potential call recipient.