-
公开(公告)号:US10224023B2
公开(公告)日:2019-03-05
申请号:US15458990
申请日:2017-03-15
Applicant: Industrial Technology Research Institute
Inventor: Shih-Chieh Chien , Chih-Chung Kuo
Abstract: A speech recognition system and method thereof, a vocabulary establishing method and a computer program product are provided. The speech recognition method includes: storing a speech recognition model including speech-units and basic components of acoustic models, wherein each of the speech-units includes at least one state and each state corresponds to one of the basic components of acoustic models; receiving first and second speech signals; obtaining a speech-unit sequence of a native/non-native vocabulary from a speech-analysis and unit-expansion module; recognizing the first speech signal according to the speech recognition model and the speech-unit sequence of the native/non-native vocabulary and further outputting a recognition result; and selecting an optimal component from the basic components of acoustic models according to the speech recognition model, the second speech signal, and the word corresponding to the second speech signal, and further updating the speech-units according to the best basic component of acoustic model.
-
2.
公开(公告)号:US20180166069A1
公开(公告)日:2018-06-14
申请号:US15458990
申请日:2017-03-15
Applicant: Industrial Technology Research Institute
Inventor: Shih-Chieh Chien , Chih-Chung Kuo
CPC classification number: G10L15/063 , G10L15/02 , G10L15/04 , G10L15/05 , G10L2015/0635
Abstract: A speech recognition system and method thereof, a vocabulary establishing method and a computer program product are provided. The speech recognition method includes: storing a speech recognition model including speech-units and basic components of acoustic models, wherein each of the speech-units includes at least one state and each state corresponds to one of the basic components of acoustic models; receiving first and second speech signals; obtaining a speech-unit sequence of a native/non-native vocabulary from a speech-analysis and unit-expansion module; recognizing the first speech signal according to the speech recognition model and the speech-unit sequence of the native/non-native vocabulary and further outputting a recognition result; and selecting an optimal component from the basic components of acoustic models according to the speech recognition model, the second speech signal, and the word corresponding to the second speech signal, and further updating the speech-units according to the best basic component of acoustic model.
-
3.
公开(公告)号:US09691389B2
公开(公告)日:2017-06-27
申请号:US14288833
申请日:2014-05-28
Applicant: INDUSTRIAL TECHNOLOGY RESEARCH INSTITUTE
Inventor: Shih-Chieh Chien , Chih-Chung Kuo
IPC: G10L15/00 , G10L15/26 , G10L17/00 , G10L21/00 , G10L25/00 , G10L15/28 , G10L15/22 , G10L15/06 , G10L15/08
CPC classification number: G10L15/28 , G10L15/063 , G10L15/22 , G10L2015/088 , G10L2015/223
Abstract: In a spoken word generation system for speech recognition, at least one input device receives a plurality of input signals at least including at least one sound signal; a mode detection module detects the plurality of input signals; when a specific sound event is detected in the at least one sound signal or at least one control signal is included in the plurality of input signals, a speech training mode is outputted; when no specific sound event is detected in the at least one sound signal and no control signal is included in the plurality of input signals, a speech recognition mode is outputted; a speech training module receives the speech training mode and performs a training process on the audio segment and outputs a training result; and a speech recognition module receives the speech recognition mode, and performs a speech recognition process and outputs a recognition result.
-
公开(公告)号:US08972264B2
公开(公告)日:2015-03-03
申请号:US13717645
申请日:2012-12-17
Applicant: Industrial Technology Research Institute
Inventor: Shih-Chieh Chien
IPC: G10L15/00
CPC classification number: G10L15/142 , G10L15/01 , G10L2015/085
Abstract: A method and apparatus for utterance verification are provided for verifying a recognized vocabulary output from speech recognition. The apparatus for utterance verification includes a reference score accumulator, a verification score generator and a decision device. A log-likelihood score obtained from speech recognition is processed by taking a logarithm of the value of the probability of one of feature vectors of an input speech conditioned on one of states of each model vocabulary. A verification score is generated based on the processed result. The verification score is compared with a predetermined threshold value so as to reject or accept the recognized vocabulary.
Abstract translation: 提供用于发声验证的方法和装置,用于验证从语音识别输出的识别词汇。 用于话语验证的装置包括参考分数累加器,验证分数发生器和判定装置。 通过以每个模型词汇表的状态条件为基础的输入语音的特征向量中的一个特征向量的概率的对数来处理从语音识别获得的对数似然分数。 基于处理结果生成验证分数。 将验证分数与预定阈值进行比较,以拒绝或接受所识别的词汇。
-
公开(公告)号:US20140129224A1
公开(公告)日:2014-05-08
申请号:US13717645
申请日:2012-12-17
Applicant: INDUSTRIAL TECHNOLOGY RESEARCH INSTITUTE
Inventor: Shih-Chieh Chien
IPC: G10L15/04
CPC classification number: G10L15/142 , G10L15/01 , G10L2015/085
Abstract: A method and apparatus for utterance verification are provided for verifying a recognized vocabulary output from speech recognition. The apparatus for utterance verification includes a reference score accumulator, a verification score generator and a decision device. A log-likelihood score obtained from speech recognition is processed by taking a logarithm of the value of the probability of one of feature vectors of an input speech conditioned on one of states of each model vocabulary. A verification score is generated based on the processed result. The verification score is compared with a predetermined threshold value so as to reject or accept the recognized vocabulary.
Abstract translation: 提供用于发声验证的方法和装置,用于验证从语音识别输出的识别词汇。 用于话语验证的装置包括参考分数累加器,验证分数发生器和判定装置。 通过以每个模型词汇表的状态条件为基础的输入语音的特征向量中的一个特征向量的概率的对数来处理从语音识别获得的对数似然分数。 基于处理结果生成验证分数。 将验证分数与预定阈值进行比较,以拒绝或接受所识别的词汇。
-
-
-
-