摘要:
A method of providing voice dialing assistance includes providing a first input to a speech recognition engine, with the first input corresponding to a speech sample provided by a caller attempting to reach an intended call recipient. A speech recognition output is generated in response to the first input. A potential call recipient is identified based upon the speech recognition output. A confirmation that the potential call recipient is the intended call recipient is implemented using a personal recording made by the potential call recipient.
摘要:
A method is disclosed from constructing a grammar. The grammar is configured to be processed by a speech recognition engine in the context of a voice-activated command system. The method includes receiving a database containing a plurality of terms. From the plurality of terms, first and second terms are identified. The first and second terms are spelled differently but have a first pronunciation in common. One of the first and second terms also has a second pronunciation that is not inherent to the other of the first and second terms. The first and second pronunciations are placed within the grammar.
摘要:
A method of providing information to a user in a telephone interactive system includes receiving a new call. A comparison is then made between an identifier associated with the new call with stored call information pertaining to previous calls. If the identifier associated with the new call matches an identifier associated with a previous call, a subsequent action taken in the new call is based on context information stored from the previous call.
摘要:
A method of querying a user to select from a list in a voice-activated command system is provided. The method includes generating command prompt phrases during which the user can select items on the list. The command prompt phrases include an item on the list and an index for another item on the list. In some embodiments, each command prompt phrase also includes a period of silence between item on the list and the index for another item on the list. If a user selecting barge-in is received during a particular command prompt phrase, the corresponding item on the list is selected.
摘要:
A method is disclosed from constructing a grammar. The grammar is configured to be processed by a speech recognition engine in the context of a voice-activated command system. The method includes receiving a database containing a plurality of terms. From the plurality of terms, first and second terms are identified. The first and second terms are spelled differently but have a first pronunciation in common. One of the first and second terms also has a second pronunciation that is not inherent to the other of the first and second terms. The first and second pronunciations are placed within the grammar.
摘要:
A computer-implemented method is disclosed for creating a grammar to be processed by a speech recognition engine in the context of a voice-activated command system. The method includes receiving a database containing a plurality of terms and identifying a set of terms that are pronounced the same but spelled differently. The method also includes placing a single term within the grammar to represent the set of terms.
摘要:
A computer-implemented method is disclosed for creating a grammar to be processed by a speech recognition engine in the context of a voice-activated command system. The method includes receiving a database containing a plurality of terms and identifying a set of terms that are pronounced the same but spelled differently. The method also includes placing a single term within the grammar to represent the set of terms.
摘要:
A method of allowing a user to provide constrained, mixed-initiative utterances in order to improve accuracy and avoid disambiguation dialogs when recognition of a user's audible input would otherwise render a number of possible selections from the database or list is provided. A grammar is adapted to include additional information associated with at least some of the entries. The additional information forms part of the information conveyed by the use in the constrained, mixed-initiative utterance.
摘要:
A speech recognition application including a recognition module configured to receive input utterances and an application module configured to select a recognition from the speech recognition module using output from a first iteration to select a recognition result for a second iteration. In one embodiment, the application module eliminates a previous rejected recognition result or results from the N-Best list for recognition. In another embodiment, the application module rescores N-Best entries based upon N-Best lists or information from another iteration. In another illustrated embodiment, the application module uses a limited grammar from a current N-Best list for subsequent recognition, for example for rerecognition using a recorded input from a previous iteration.
摘要:
A method of generating an optimized grammar, for use in speech recognition, from a data set or big list of items, is disclosed. The method includes the steps of obtaining a tree representing items in the data set, and generating the grammar using the tree. The tree or tree data structure representing items in the data set is a simulated recognition search tree, representing items in the data set, which can be automatically generated from the data set.