摘要:
A speech recognition system recognizes a customer identifier received as a speech signal from a customer. The system generates a first plurality of customer identifier choices from the speech signal and a probability associated with each of the first plurality of choices. The system includes a database having stored thereon a plurality of customer attribute parameters indexed to a second plurality of customer identifier choices. The system queries the database using the first plurality of choices, and retrieves one or more customer attribute parameters from the database. The probabilities are adjusted by the system based on the retrieved customer attribute parameters. The customer identifier is then selected from the first plurality of choices based on the adjusted probabilities.
摘要:
A speech recognition system recognizes spoken utterances received as a speech signal from a user. A prompt for requesting a spoken utterance from the user is assigned a response identifier which indicates at least one of a plurality of speech recognizers to best recognize a particular type of spoken utterance. The system includes a processor for receiving the speech signal from the user in response to the prompt. The processor also directs the speech signal to the at least one speech recognizer indicated by the response identifier. The speech recognizer generates a plurality of spoken utterance choices from the speech signal and a probability associated with each of the plurality of choices. At least one of the spoken utterance choices is selected based on the associated probabilities.
摘要:
The present invention relates to a method and apparatus for placing a message in digital data. The message is placed by manipulating certain data bits in a way that does not severely corrupt the data. The data can be mu-law encoded, wherein a value of 1 is assigned to one representation of zero, and a value of 0 is assigned to the other representation of zero. In this case, a message is placed in the data using these assigned values of 1 and 0.
摘要:
A method and apparatus for correcting misrecognized words appearing in electronic documents that have been generated by scanning an original document in accordance with an optical character recognition (“OCR”) technique. Each recognized word is generated by first producing, for each character position of the corresponding word in the original document, the N-best characters for occupying that character position. If an incorrect word is found in the electronic document, the present invention generates a plurality of reference words from which one is selected for replacing the incorrect word. This selected reference word is determined by the present invention to be the reference word that is the most likely correct replacement for the incorrect recognized word. This selection is accomplished by computing for each reference word a replacement word value. The reference word that is selected to replace the incorrect recognized word corresponds to the highest replacement word value.
摘要:
A method and apparatus for recognizing an input identifier on the basis of a set of comparison identifiers. After a user provides the input identifier according to a first form, the present invention provides a recognized identifier based on the input identifier. The present invention then generates a plurality of comparison identifiers on the basis of the recognized identifier. The user is then prompted to provide the input identifier again, but this time according to a second form that is different than the first form. A second recognized identifier is then generated on the basis of the input identifier provided according to the second form. If a match exists between the second recognized identifier and one of the comparison identifiers, the matched comparison identifier is selected as corresponding to the input identifier.
摘要:
A method and apparatus for reducing a set of reference identifiers to a candidate subset of reference identifiers. The reference identifiers are associated in memory with a plurality of index codes. A user provides an input identifier, causing a recognizing device of the present invention to produce a recognized identifier on the basis of the input identifier. The present invention determines an index code based on the recognized identifier and on the basis of a plurality of pre-stored confusion sets of characters that group together in individual confusion sets those characters having a relatively high likelihood of being confused with each other by the recognizing device. After matching the determined index code with one of the reference index codes, the present invention determines which reference index codes are within a predetermined distance of the matched reference index code. The reference identifiers that are associated with these reference index codes constitute the candidate subset of reference identifiers.
摘要:
A method and apparatus for correcting misrecognized words appearing in electronic documents that have been generated by scanning an original document in accordance with an optical character recognition (“OCR”) technique. If an incorrect word is found in the electronic document, the present invention generates at least one reference word and selects the reference word that is the most likely correct replacement for the incorrect word. This selection is accomplished by comparing each character member of every reference word to a plurality of confusion sets. On the basis of this comparison, the reference words are reduced to a smaller candidate set of reference words, from which a reference word for replacing the incorrect word is selected on the basis of predetermined criteria.
摘要:
A method and apparatus for matching at least a first input identifier with a reference identifier. A user provides an input identifier into a system, and the system produces a recognized identifier based on the input identifier. The system of the present invention perform a check-sum operation to determine whether the recognized identifier was recognized correctly. If the check-sum operation reveals that the recognized identifier is incorrect, the system of the present invention generates a plurality of substitute identifiers. The substitute identifiers are compared to a set of pre-stored reference identifiers. If a match is found between a reference identifier and a substitute identifier, the matched reference identifier is selected as corresponding to the input identifier provided by the user.
摘要:
A method and apparatus derive a dynamic grammar composed of a subset of a plurality of data elements that are each associated with one of a plurality of reference identifiers. The present invention generates a set of selection identifiers on the basis of a user-provided first input identifier and determines which of these selection identifiers are present in a set of pre-stored reference identifiers. The present invention creates a dynamic grammar that includes those data elements that are associated with those reference identifiers that are matched to any of the selection identifiers. Based on a user-provided second identifier and on the data elements of the dynamic grammar, the present invention selects one of the reference identifiers in the dynamic grammar.
摘要:
A speech recognition system recognizes a caller identifier received during a telephone call as a speech signal from a caller. The system generates a plurality of caller identifier choices from the speech signal and receives location information of the caller. The system includes a database on which is stored a plurality of caller identifiers indexed to a plurality of location information. The system queries a database based on the received location information and retrieves one or more caller identifiers from the database. The system then selects the recognized caller identifier from the plurality of caller identifier choices based on the retrieved one or more caller identifiers.