摘要:
A speech recognition support method in a system to retrieve a map in response to a user's input speech. The user's speech is recognized and a recognition result is obtained. If the recognition result represents a point on the map, a distance between the point and a base point on the map is calculated. The distance is decided to be above a threshold or not. If the distance is above the threshold, an inquiry to confirm whether the recognition result is correct is output to the user.
摘要:
A speech recognition apparatus includes a storage unit which store vocabularies, each of vocabularies including plural word body data, each of the word body data obtained by removing a specific word head from a word or sentence, and store at least one word head portion including labeled nodes to express at least one common word head common to at least two of the vocabularies, an instruction receiving unit which receive an instruction of a target vocabulary and an instruction of a operation, a grammar network generating unit which generate, when adding is instructed, a grammar network containing the word head portion, the target vocabulary and connection information indicating that each of the word body data contained in the target vocabulary is connected to a specific one of the labeled nodes contained in the word head portion, and a speech recognition unit which execute speech recognition using the generated grammar network.
摘要:
A speech recognition support method in a system to retrieve a map in response to a user's input speech. The user's speech is recognized and a recognition result is obtained. If the recognition result represents a point on the map, a distance between the point and a base point on the map is calculated. The distance is decided to be above a threshold or not. If the distance is above the threshold, an inquiry to confirm whether the recognition result is correct is output to the user.
摘要:
A setting unit sets an input language and an output language, a first receiving unit receives input data in a language, a storage unit stores the input data, a detection unit detects a discrepancy between the input language and the language of the input data, a swapping unit swaps the settings of the input language and the output language if the discrepancy is detected, a recognition unit recognizes the input data from the storage unit in the input language set by the setting unit if the discrepancy is not detected, and recognizes the input data read from the storage unit in the input language swapped by the swapping unit if the discrepancy is detected, and a translation unit translates a recognition result recognized in the input language set by the setting unit into the set output language if the discrepancy is not detected, and translates the recognition result recognized in the input language swapped by the swapping unit into the output language swapped by the swapping unit if the discrepancy is detected.
摘要:
A setting unit sets an input language and an output language, a first receiving unit receives input data in a language, a storage unit stores the input data, a detection unit detects a discrepancy between the input language and the language of the input data, a swapping unit swaps the settings of the input language and the output language if the discrepancy is detected, a recognition unit recognizes the input data from the storage unit in the input language set by the setting unit if the discrepancy is not detected, and recognizes the input data read from the storage unit in the input language swapped by the swapping unit if the discrepancy is detected, and a translation unit translates a recognition result recognized in the input language set by the setting unit into the set output language if the discrepancy is not detected, and translates the recognition result recognized in the input language swapped by the swapping unit into the output language swapped by the swapping unit if the discrepancy is detected.
摘要:
A voice recognition apparatus determines whether an input sound is a voice segment or a non-voice segment in time series, generates a word model for the voice segment, allocates a predetermined non-voice model for the non-voice segment, connects the word model and the non-voice model in sequence according to the time series of the segments of the input sound corresponding to the respective models and generates a vocalization model, and coordinates the vocalization model with a vocalization ID in one-to-one correspondence, and stores the same.