Abstract:
A speech recognition system and method thereof are provided. The speech recognition system connects to an external general-purpose speech recognition system, and including a storage unit and a processing unit. The storage unit stores a specific application speech recognition module, a comparison module and an enhancement module. The specific application speech recognition module converts a speech signal into a first phonetic text. The general-purpose speech recognition system converts the speech signal into a written text. The comparison module receives the first phonetic text and the written text, converts the written text into a second phonetic text, and aligns the second phonetic text with the first phonetic text according to similarity of pronunciation to output a phonetic text alignment result. The enhancement module receives the phonetic text alignment result, and constructs with the written text and the first phonetic text after path weighting to form an outputting recognized text.
Abstract:
A speech recognition system and method thereof, a vocabulary establishing method and a computer program product are provided. The speech recognition method includes: storing a speech recognition model including speech-units and basic components of acoustic models, wherein each of the speech-units includes at least one state and each state corresponds to one of the basic components of acoustic models; receiving first and second speech signals; obtaining a speech-unit sequence of a native/non-native vocabulary from a speech-analysis and unit-expansion module; recognizing the first speech signal according to the speech recognition model and the speech-unit sequence of the native/non-native vocabulary and further outputting a recognition result; and selecting an optimal component from the basic components of acoustic models according to the speech recognition model, the second speech signal, and the word corresponding to the second speech signal, and further updating the speech-units according to the best basic component of acoustic model.
Abstract:
A speech recognition system and method thereof are provided. The speech recognition system connects to an external general-purpose speech recognition system, and including a storage unit and a processing unit. The storage unit stores a specific application speech recognition module, a comparison module and an enhancement module. The specific application speech recognition module converts a speech signal into a first phonetic text. The general-purpose speech recognition system converts the speech signal into a written text. The comparison module receives the first phonetic text and the written text, converts the written text into a second phonetic text, and aligns the second phonetic text with the first phonetic text according to similarity of pronunciation to output a phonetic text alignment result. The enhancement module receives the phonetic text alignment result, and constructs with the written text and the first phonetic text after path weighting to form an outputting recognized text.
Abstract:
According to an exemplary embodiment of a guided speaker adaptive speech synthesis system, a speaker adaptive training module generates adaptation information and a speaker-adapted model based on inputted recording text and recording speech. A text to speech engine receives the recording text and the speaker-adapted model and outputs synthesized speech information. A performance assessment module receives the adaptation information and the synthesized speech information to generate assessment information. An adaptation recommendation module selects at least one subsequent recording text from at least one text source as a recommendation of a next adaption process, according to the adaptation information and the assessment information.
Abstract:
The disclosure provides methods and computer systems for named entity verification, named entity verification model training, and phrase expansion. The method for named entity verification includes to receive an unknown type phrase, to generate a query phrase according to the unknown type phrase, to perform auto-completion on the query phrase to receive one or more returned phrases, to extract feature information from the returned phrases, and to determine a named entity type of the unknown type phrase based on the feature information verify whether or not the unknown type phrase belongs to the target named entity type based on the feature information and a target verification model to accordingly output a verification result.
Abstract:
A recognition network generation device, disposed in an electronic device, comprising: an operation record storage device storing a plurality of operation records of the electronic device, wherein each of the operation records includes operation content executed by the electronic device and device peripheral information detected by the electronic device when the electronic device executes the operation content; an activity model constructor classifying the operation records into a plurality of activity models according to all the device peripheral information of the operation records; an activity predictor selecting at least one selected activity model according to the degree of similarity between each of the activity models and a current device peripheral information detected by the electronic device; and a weight adjustor adjusting the weights of a plurality of recognition vocabularies by taking into account a number of times each recognition vocabulary appears in all operations contents of the activity models, wherein the recognition vocabularies correspond to all the operation content of the at least one selected activity model.
Abstract:
A speech recognition system and method thereof, a vocabulary establishing method and a computer program product are provided. The speech recognition method includes: storing a speech recognition model including speech-units and basic components of acoustic models, wherein each of the speech-units includes at least one state and each state corresponds to one of the basic components of acoustic models; receiving first and second speech signals; obtaining a speech-unit sequence of a native/non-native vocabulary from a speech-analysis and unit-expansion module; recognizing the first speech signal according to the speech recognition model and the speech-unit sequence of the native/non-native vocabulary and further outputting a recognition result; and selecting an optimal component from the basic components of acoustic models according to the speech recognition model, the second speech signal, and the word corresponding to the second speech signal, and further updating the speech-units according to the best basic component of acoustic model.
Abstract:
In a spoken word generation system for speech recognition, at least one input device receives a plurality of input signals at least including at least one sound signal; a mode detection module detects the plurality of input signals; when a specific sound event is detected in the at least one sound signal or at least one control signal is included in the plurality of input signals, a speech training mode is outputted; when no specific sound event is detected in the at least one sound signal and no control signal is included in the plurality of input signals, a speech recognition mode is outputted; a speech training module receives the speech training mode and performs a training process on the audio segment and outputs a training result; and a speech recognition module receives the speech recognition mode, and performs a speech recognition process and outputs a recognition result.
Abstract:
A voice control method is provided. At least one object name-action prompt correspondence document is received and processed into an object name-action prompt correspondence document set that defines at least one object name and at least one corresponding action prompt. The object name-action prompt correspondence document set is processed to establish an object name-action prompt correspondence list. A voice is recognized as one or multiple voice recognition results to generate one or multiple corresponding candidate object names. At least one corresponding candidate action prompt is outputted according to the candidate object name(s) and the object name-action prompt correspondence list. A selected action prompt is received, and a module providing the selected action prompt is requested to execute an operation.