摘要:
A voice dialing method includes the steps of receiving an utterance from a user, decoding the utterance to identify a recognition result for the utterance, and communicating to the user the recognition result. If an indication is received from the user that the communicated recognition result is incorrect, then it is added to a rejection reference. Then, when the user repeats the misunderstood utterance, the rejection reference can be used to eliminate the incorrect recognition result as a potential subsequent recognition result. The method can be used for single or multiple digits or digit strings.
摘要:
A voice dialing method includes the steps of receiving an utterance from a user, decoding the utterance to identify a recognition result for the utterance, and communicating to the user the recognition result. If an indication is received from the user that the communicated recognition result is incorrect, then it is added to a rejection reference. Then, when the user repeats the misunderstood utterance, the rejection reference can be used to eliminate the incorrect recognition result as a potential subsequent recognition result. The method can be used for single or multiple digits or digit strings.
摘要:
A voice dialing method includes the steps of receiving an utterance from a user, decoding the utterance to identify a recognition result for the utterance, and communicating to the user the recognition result. If an indication is received from the user that the communicated recognition result is incorrect, then it is added to a rejection reference. Then, when the user repeats the misunderstood utterance, the rejection reference can be used to eliminate the incorrect recognition result as a potential subsequent recognition result. The method can be used for single or multiple digits or digit strings.
摘要:
A speech recognition method includes the steps of receiving input speech containing vocabulary, processing the input speech with a grammar to obtain N-best hypotheses and associated parameter values, and determining whether a first-best hypothesis of the N-best hypotheses is confusable with any vocabulary within the grammar. The first-best hypothesis is accepted as recognized speech corresponding to the received input speech if the first-best hypothesis is not determined to be confusable with any vocabulary within the grammar. Where the first-best hypothesis is determined to be confusable, at least one parameter value of the first-best hypothesis can be compared to at least one threshold value. The first-best hypothesis can be accepted as recognized speech corresponding to the received input speech, if the parameter value of the first-best hypothesis is greater than the threshold value.
摘要:
A speech recognition method includes the steps of receiving input speech containing vocabulary, processing the input speech with a grammar to obtain N-best hypotheses and associated parameter values, and determining whether a first-best hypothesis of the N-best hypotheses is confusable with any vocabulary within the grammar. The first-best hypothesis is accepted as recognized speech corresponding to the received input speech if the first-best hypothesis is not determined to be confusable with any vocabulary within the grammar. Where the first-best hypothesis is determined to be confusable, at least one parameter value of the first-best hypothesis can be compared to at least one threshold value, and accepting the second-best as the recognized speech, if its confidence score is within certain lower and upper threshold values and is not confusable with the first-best. The first-best hypothesis can be accepted as recognized speech corresponding to the received input speech, if the parameter value of the first-best hypothesis is greater than the threshold value.
摘要:
A voice dialing method includes the steps of receiving an utterance from a user, decoding the utterance to identify a recognition result for the utterance, and communicating to the user the recognition result. If an indication is received from the user that the communicated recognition result is incorrect, then it is added to a rejection reference. Then, when the user repeats the misunderstood utterance, the rejection reference can be used to eliminate the incorrect recognition result as a potential subsequent recognition result. The method can be used for single or multiple digits or digit strings.
摘要:
A method of speech to DTMF generation involving ASR-enabled and DTMF-controlled communications systems. The ASR-enabled system is used to recognize speech received from the DTMF-controlled telecommunications system using sampling rate independent speech recognition. It then identifies a speech segment contained in the speech received from the DTMF-controlled system that corresponds with at least one keyword associated with user-defined data. Then, the ASR-enabled system transmits at least one DTMF signal to the DTMF-controlled system in response to the identified speech segment. This allows a user of an ASR-enabled system such as a vehicle telematics unit to at least partially automate access to the DTMF-controlled system using the telematics unit, so that voice mailbox numbers, passwords, and the like normally entered via a telephone keypad can be automatically sent to the DTMF-controlled system from the telematics unit without having to be manually input each time by the user.
摘要:
A method of speech to DTMF generation involving ASR-enabled and DTMF-controlled communications systems. The ASR-enabled system is used to recognize speech received from the DTMF-controlled telecommunications system using sampling rate independent speech recognition. It then identifies a speech segment contained in the speech received from the DTMF-controlled system that corresponds with at least one keyword associated with user-defined data. Then, the ASR-enabled system transmits at least one DTMF signal to the DTMF-controlled system in response to the identified speech segment. This allows a user of an ASR-enabled system such as a vehicle telematics unit to at least partially automate access to the DTMF-controlled system using the telematics unit, so that voice mailbox numbers, passwords, and the like normally entered via a telephone keypad can be automatically sent to the DTMF-controlled system from the telematics unit without having to be manually input each time by the user.
摘要:
A method of operating a speech recognition system on a vehicle having a visual display and manually-operated input device that includes initiating a speech recognition system, controlling menu selections on a visual display using a manually-operated input device, receiving a notification from the manually-operated input device indicating that the user is manipulating the device in conjunction with the menu selections on the visual display, and adjusting operation of the speech recognition system based on input received by the manually-operated input device.
摘要:
A method of operating a speech recognition system on a vehicle having a visual display and manually-operated input device that includes initiating a speech recognition system, controlling menu selections on a visual display using a manually-operated input device, receiving a notification from the manually-operated input device indicating that the user is manipulating the device in conjunction with the menu selections on the visual display, and adjusting operation of the speech recognition system based on input received by the manually-operated input device.