APPARATUS AND METHOD FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
    2.
    发明申请
    APPARATUS AND METHOD FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION 有权
    大容量连续语音识别的装置和方法

    公开(公告)号:US20160240190A1

    公开(公告)日:2016-08-18

    申请号:US15042309

    申请日:2016-02-12

    IPC分类号: G10L15/14 G10L15/16 G10L15/20

    摘要: Provided is an apparatus for large vocabulary continuous speech recognition (LVCSR) based on a context-dependent deep neural network hidden Markov model (CD-DNN-HMM) algorithm. The apparatus may include an extractor configured to extract acoustic model-state level information corresponding to an input speech signal from a training data model set using at least one of a first feature vector based on a gammatone filterbank signal analysis algorithm and a second feature vector based on a bottleneck algorithm, and a speech recognizer configured to provide a result of recognizing the input speech signal based on the extracted acoustic model-state level information.

    摘要翻译: 提供了一种基于上下文相关深度神经网络隐马尔可夫模型(CD-DNN-HMM)算法的大词汇连续语音识别(LVCSR)装置。 该装置可以包括提取器,其被配置为使用基于伽马一滤波器组信号分析算法和基于第二特征向量的第一特征向量中的至少一个从训练数据模型集中提取与输入语音信号相对应的声学模型状态级别信息 以及语音识别器,被配置为基于所提取的声学模型状态级别信息来提供识别输入语音信号的结果。

    APPARATUS AND METHOD FOR VERIFYING UTTERANCE IN SPEECH RECOGNITION SYSTEM

    公开(公告)号:US20170200458A1

    公开(公告)日:2017-07-13

    申请号:US15186286

    申请日:2016-06-17

    摘要: An apparatus and method for verifying an utterance based on multi-event detection information in a natural language speech recognition system. The apparatus includes a noise processor configured to process noise of an input speech signal, a feature extractor configured to extract features of speech data obtained through the noise processing, an event detector configured to detect events of the plurality of speech features occurring in the speech data using the noise-processed data and data of the extracted features, a decoder configured to perform speech recognition using a plurality of preset speech recognition models for the extracted feature data, and an utterance verifier configured to calculate confidence measurement values in units of words and sentences using information on the plurality of events detected by the event detector and a preset utterance verification model and perform utterance verification according to the calculated confidence measurement values.

    SIGNAL PROCESSING ALGORITHM-INTEGRATED DEEP NEURAL NETWORK-BASED SPEECH RECOGNITION APPARATUS AND LEARNING METHOD THEREOF
    8.
    发明申请
    SIGNAL PROCESSING ALGORITHM-INTEGRATED DEEP NEURAL NETWORK-BASED SPEECH RECOGNITION APPARATUS AND LEARNING METHOD THEREOF 审中-公开
    信号处理算法综合深度基于神经网络的语音识别装置及其学习方法

    公开(公告)号:US20160078863A1

    公开(公告)日:2016-03-17

    申请号:US14737907

    申请日:2015-06-12

    IPC分类号: G10L15/16

    CPC分类号: G10L15/16

    摘要: Provided are a signal processing algorithm-integrated deep neural network (DNN)-based speech recognition apparatus and a learning method thereof. A model parameter learning method in a deep neural network (DNN)-based speech recognition apparatus implementable by a computer includes converting a signal processing algorithm for extracting a feature parameter from a speech input signal of a time domain into signal processing deep neural network (DNN), fusing the signal processing DNN and a classification DNN, and learning a model parameter in a deep learning model in which the signal processing DNN and the classification DNN are fused.

    摘要翻译: 提供了一种基于信号处理算法的深度神经网络(DNN)语音识别装置及其学习方法。 由计算机实现的基于深神经网络(DNN)的语音识别装置中的模型参数学习方法包括:将来自时域的语音输入信号的特征参数的信号处理算法转换为信号处理深层神经网络(DNN ),融合信号处理DNN和分类DNN,并在信号处理DNN和分类DNN融合的深度学习模型中学习模型参数。

    APPARATUS AND METHOD FOR RECOGNIZING CONTINUOUS SPEECH
    9.
    发明申请
    APPARATUS AND METHOD FOR RECOGNIZING CONTINUOUS SPEECH 审中-公开
    用于识别连续语音的装置和方法

    公开(公告)号:US20150006175A1

    公开(公告)日:2015-01-01

    申请号:US14304104

    申请日:2014-06-13

    IPC分类号: G10L15/18

    CPC分类号: G10L15/18 G10L15/04 G10L15/32

    摘要: The present invention relates to an apparatus and a method for recognizing continuous speech having large vocabulary. In the present invention, large vocabulary in large vocabulary continuous speech having a lot of same kinds of vocabulary is divided to a reasonable number of clusters, then representative vocabulary for pertinent clusters is selected and first recognition is performed with the representative vocabulary, then if the representative vocabulary is recognized by use of the result of first recognition, re-recognition is performed against all words in the cluster where the recognized representative vocabulary belongs.

    摘要翻译: 本发明涉及用于识别具有较大词汇量的连续语音的装置和方法。 在本发明中,具有大量相同种类词汇的大词汇连续语音中的大词汇被划分为合理数量的群集,然后选择相关群集的代表性词汇,并用代表性词汇表进行首次识别, 通过使用第一识别结果来识别代表性词汇,对所识别的代表词汇所属的群集中的所有单词进行重新识别。