Voice processing device and method, and program
    1.
    发明授权
    Voice processing device and method, and program 有权
    语音处理装置及方法及程序

    公开(公告)号:US08612223B2

    公开(公告)日:2013-12-17

    申请号:US12817526

    申请日:2010-06-17

    IPC分类号: G10L15/06

    CPC分类号: G10L15/183

    摘要: There is provided a voice processing device. The device includes: score calculation unit configured to calculate a score indicating compatibility of a voice signal input on the basis of an utterance of a user with each of plural pieces of intention information indicating each of a plurality of intentions; intention selection unit configured to select the intention information indicating the intention of the utterance of the user among the plural pieces of intention information on the basis of the score calculated by the score calculation unit; and intention reliability calculation unit configured to calculate the reliability with respect to the intention information selected by the intention selection unit on the basis of the score calculated by the score calculation unit.

    摘要翻译: 提供语音处理装置。 该设备包括:分数计算单元,被配置为计算表示基于用户的话语输入的语音信号的兼容性的分数与指示多个意图中的每一个的多个意图信息中的每一个; 意图选择单元,被配置为基于由分数计算单元计算的得分,在多个意图信息中选择表示用户的发音意图的意图信息; 以及意图可靠性计算单元,被配置为基于由分数计算单元计算出的分数来计算与意图选择单元选择的意图信息相关的可靠性。

    VOICE RECOGNITION DEVICE AND VOICE RECOGNITION METHOD, LANGUAGE MODEL GENERATING DEVICE AND LANGUAGE MODEL GENERATING METHOD, AND COMPUTER PROGRAM
    2.
    发明申请
    VOICE RECOGNITION DEVICE AND VOICE RECOGNITION METHOD, LANGUAGE MODEL GENERATING DEVICE AND LANGUAGE MODEL GENERATING METHOD, AND COMPUTER PROGRAM 审中-公开
    语音识别装置和语音识别方法,语言模型生成装置和语言模型生成方法以及计算机程序

    公开(公告)号:US20100241418A1

    公开(公告)日:2010-09-23

    申请号:US12661164

    申请日:2010-03-11

    IPC分类号: G10L15/18 G06F17/27 G10L15/00

    CPC分类号: G10L15/1815 G10L15/183

    摘要: A speech recognition device includes one intention extracting language model and more in which an intention of a focused specific task is inherent, an absorbing language model in which any intention of the task is not inherent, a language score calculating section that calculates a language score indicating a linguistic similarity between each of the intention extracting language model and the absorbing language model, and the content of an utterance, and a decoder that estimates an intention in the content of an utterance based on a language score of each of the language models calculated by the language score calculating section.

    摘要翻译: 一种语音识别装置,包括一种意图提取语言模型,其中特定任务的意图是固有的,其中任务的任何意图不是固有的吸收语言模型;语言得分计算部分,其计算表示 每个意图提取语言模型和吸收语言模型之间的语言相似性,以及话语的内容,以及解码器,其基于由语言模型计算的每个语言模型的语言得分来估计语音内容中的意图 语言成绩计算部分。

    Mapping determination methods and data discrimination methods using the
same
    3.
    发明授权
    Mapping determination methods and data discrimination methods using the same 失效
    映射确定方法和使用该方法的数据鉴别方法

    公开(公告)号:US5796921A

    公开(公告)日:1998-08-18

    申请号:US540948

    申请日:1995-10-11

    IPC分类号: G06K9/62 G06F15/18

    CPC分类号: G06K9/6232

    摘要: A mapping determination method for obtaining mapping F from an N-dimensional metric vector space .OMEGA..sub.N to an M-dimensional metric vector space .OMEGA..sub.M has the following steps to get the optimal mapping quickly and positively. In the first step, complete, periodic, L.sub.m basic functions g.sub.m (X) according to the distribution of samples classified into Q categories on the N-dimensional metric vector space .OMEGA..sub.N are set. In the second step, a function f.sub.m (X) indicating the m-th component of the mapping F is expressed with the linear sum of the functions g.sub.m (X) and L.sub.m coefficients c.sub.m. The third step provides Q teacher vectors T.sub.q =(t.sub.q.1, t.sub.q.2, t.sub.q.3, . . . , t.sub.q.M) (where q=1, 2, . . . , Q) for the categories on the M-dimensional metric vector space .OMEGA..sub.M, calculates the specified estimation function J, and obtains the coefficients c.sub.m which minimize the estimation function J. In the fourth step, the coefficients c.sub.m obtained in the third step are stored in memory.

    摘要翻译: 用于从N维量度向量空间OMEGA N到M维度量向量空间OMEGA M获得映射F的映射确定方法具有以下步骤以快速和积极地获得最佳映射。 在第一步中,根据在N维量度向量空间OMEGA N上分类为Q类别的样本分布,完成,定期,Lm基本函数gm(X)。 在第二步骤中,表示映射F的第m个分量的函数fm(X)用函数gm(X)和Lm系数cm的线性和表示。 第三步为M类别提供Q教师向量Tq =(tq.1,tq.2,tq.3,...,tq.M)(其中q = 1,2,...,Q) 维度度量向量空间OMEGA M计算指定的估计函数J,并获得使估计函数J最小化的系数cm。在第四步骤中,将第三步骤中获得的系数cm存储在存储器中。

    Speech recognition apparatus, speech recognition method, and storage medium
    5.
    发明授权
    Speech recognition apparatus, speech recognition method, and storage medium 失效
    语音识别装置,语音识别方法和存储介质

    公开(公告)号:US07013277B2

    公开(公告)日:2006-03-14

    申请号:US09794887

    申请日:2001-02-26

    IPC分类号: G10L15/00

    CPC分类号: G10L15/193 G10L2015/085

    摘要: A preliminary word-selecting section selects one or more words following words which have been obtained in a word string serving as a candidate for a result of speech recognition; and a matching section calculates acoustic or linguistic scores for the selected words, and forms a word string serving as a candidate for a result of speech recognition according to the scores. A control section generates word-connection relationships between words in the word string serving as a candidate for a result of speech recognition, sends them to a word-connection-information storage section, and stores them in it. A re-evaluation section corrects the word-connection relationships stored in the word-connection-information storage section 16, and the control section determines a word string serving as the result of speech recognition according to the corrected word-connection relationships.

    摘要翻译: 初步字选择部选择在用作语音识别结果的候选者的字串中获得的一个或多个单词, 并且匹配部分计算所选择的单词的声学或语言得分,并且根据分数形成用作语音识别结果的候选的词串。 控制部分生成用作语音识别结果候选的字串中的字之间的字连接关系,将它们发送到字连接信息存储部分,并将它们存储在其中。 重新评估部分校正存储在字连接信息存储部分16中的字连接关系,并且控制部分根据校正的字连接关系确定用作语音识别结果的字串。

    Information processing apparatus, information processing method, and program
    6.
    发明授权
    Information processing apparatus, information processing method, and program 有权
    信息处理装置,信息处理方法和程序

    公开(公告)号:US08566094B2

    公开(公告)日:2013-10-22

    申请号:US13206631

    申请日:2011-08-10

    IPC分类号: G10L15/00

    摘要: An apparatus, method and program for performing a speech recognition process utilizing contextual information that comprises an estimation of the intention of an utterance of a user. The recognition process includes calculating a pre-score based on observed contextual information according intention models which correspond to a plurality of types of intention information and combining the pre-scoring results with acoustic and linguistic scores to obtain an improved recognition or comprehension of the intent of a user utterance.

    摘要翻译: 一种用于使用包括对用户的话语的意图的估计的上下文信息执行语音识别处理的装置,方法和程序。 识别过程包括基于观察到的情境信息来计算预分数,该意图模型对应于多种类型的意图信息,并将预评分结果与声学和语言得分相结合,以获得对目标的意图的改进的识别或理解 用户说话。

    Speech recognition with score calculation
    7.
    发明授权
    Speech recognition with score calculation 有权
    语音识别与分数计算

    公开(公告)号:US07249017B2

    公开(公告)日:2007-07-24

    申请号:US10785246

    申请日:2004-02-24

    IPC分类号: G10L15/08 G06F17/27

    CPC分类号: G10L15/187 G10L2015/025

    摘要: In order to prevent degradation of speech recognition accuracy due to an unknown word, a dictionary database has stored therein a word dictionary in which are stored, in addition to words for the objects of speech recognition, suffixes, which are sound elements and a sound element sequence, which form the unknown word, for classifying the unknown word by the part of speech thereof. Based on such a word dictionary, a matching section connects the acoustic models of an sound model database, and calculates the score using the series of features output by a feature extraction section on the basis of the connected acoustic model. Then, the matching section selects a series of the words, which represents the speech recognition result, on the basis of the score.

    摘要翻译: 为了防止由于未知词引起的语音识别精度的降低,字典数据库中存储有词语词典,除了用于语音识别的对象的词之外,还存储有作为声音元素和声音元素的后缀的词典 序列,其形成未知单词,用于通过其部分语音对未知单词进行分类。 基于这样的词典,匹配部分连接声音模型数据库的声学模型,并且使用基于连接的声学模型的特征提取部分输出的一系列特征来计算分数。 然后,匹配部分基于分数来选择表示语音识别结果的一系列单词。

    Voice recognition apparatus and method, and recording medium
    8.
    发明授权
    Voice recognition apparatus and method, and recording medium 失效
    语音识别装置和方法以及记录介质

    公开(公告)号:US06961701B2

    公开(公告)日:2005-11-01

    申请号:US09798521

    申请日:2001-03-03

    摘要: An extended-word selecting section calculates a score for a phoneme string formed of one more phonemes, corresponding to a user's speech, and searches a large-vocabulary-dictionary for a word having one or more phonemes equal to or similar to those of a phoneme string having a score equal to or higher than a predetermined value. A matching section calculates scores for the word searched for by the extended-word selecting section in addition to a word preliminary word-selecting section. A control section determines a word string as the result of recognition of the speech uttered by the user.

    摘要翻译: 扩展字选择部分计算由与用户的语音相对应的一个以上音素形成的音素串的分数,并且搜索具有等于或类似于音素的一个或多个音素的单词的大词汇词典 具有等于​​或高于预定值的分数的字符串。 匹配部分除了字初步字选择部分之外,还计算由扩展字选择部分搜索的字的分数。 控制部分确定作为用户发出的语音的识别结果的字串。