Speech recognition apparatus
    1.
    发明申请
    Speech recognition apparatus 失效
    语音识别装置

    公开(公告)号:US20050075877A1

    公开(公告)日:2005-04-07

    申请号:US10416092

    申请日:2001-11-07

    CPC分类号: G10L15/08 G10L15/083

    摘要: A speech recognizing device for efficient processing while keeping a high speech recognizing performance. A matching unit (14) computes the score of a word preliminarily selected by a word preliminary selection unit (13) and determines candidates of the speech recognition result on the basis of the score. A control unit (11) creates a word connection relation between the words of a word sequence, which is a candidate of the speech recognition result and stores them in a word connection information storage unit (16). A revaluation unit (15) corrects the word connection relation serially, and the control unit ( 11) defines the speech recognition result on the basis of the word connection relation corrected. A word connection relation managing unit (21) limits the time corresponding to the boundary of a word expressed by the word connection relation, and a word connection relation managing unit (22) limits the starting time of the word preliminarily selected by the word preliminary selection unit (13). The speech recognizing device can be applied to an interactive system which responds to the speech recognition result.

    摘要翻译: 一种用于在保持高语音识别性能的同时高效处理的语音识别装置。 匹配单元(14)计算由词初步选择单元(13)预先选择的单词的分数,并根据得分确定语音识别结果的候选。 控制单元(11)创建作为语音识别结果的候选者的字序列的字之间的字连接关系,并将它们存储在字连接信息存储单元(16)中。 重估单元(15)串行地校正字连接关系,并且控制单元(11)基于校正的字连接关系来定义语音识别结果。 字连接关系管理单元(21)限制与字连接关系所表示的字的边界对应的时间,并且字连接关系管理单元(22)限制由初始选择预先选择的单词的开始时间 单位(13)。 语音识别装置可以应用于响应于语音识别结果的交互式系统。

    Speech recognition apparatus, speech recognition method, and storage medium
    2.
    发明授权
    Speech recognition apparatus, speech recognition method, and storage medium 失效
    语音识别装置,语音识别方法和存储介质

    公开(公告)号:US07013277B2

    公开(公告)日:2006-03-14

    申请号:US09794887

    申请日:2001-02-26

    IPC分类号: G10L15/00

    CPC分类号: G10L15/193 G10L2015/085

    摘要: A preliminary word-selecting section selects one or more words following words which have been obtained in a word string serving as a candidate for a result of speech recognition; and a matching section calculates acoustic or linguistic scores for the selected words, and forms a word string serving as a candidate for a result of speech recognition according to the scores. A control section generates word-connection relationships between words in the word string serving as a candidate for a result of speech recognition, sends them to a word-connection-information storage section, and stores them in it. A re-evaluation section corrects the word-connection relationships stored in the word-connection-information storage section 16, and the control section determines a word string serving as the result of speech recognition according to the corrected word-connection relationships.

    摘要翻译: 初步字选择部选择在用作语音识别结果的候选者的字串中获得的一个或多个单词, 并且匹配部分计算所选择的单词的声学或语言得分,并且根据分数形成用作语音识别结果的候选的词串。 控制部分生成用作语音识别结果候选的字串中的字之间的字连接关系,将它们发送到字连接信息存储部分,并将它们存储在其中。 重新评估部分校正存储在字连接信息存储部分16中的字连接关系,并且控制部分根据校正的字连接关系确定用作语音识别结果的字串。

    Speech recognition with score calculation
    3.
    发明授权
    Speech recognition with score calculation 有权
    语音识别与分数计算

    公开(公告)号:US07249017B2

    公开(公告)日:2007-07-24

    申请号:US10785246

    申请日:2004-02-24

    IPC分类号: G10L15/08 G06F17/27

    CPC分类号: G10L15/187 G10L2015/025

    摘要: In order to prevent degradation of speech recognition accuracy due to an unknown word, a dictionary database has stored therein a word dictionary in which are stored, in addition to words for the objects of speech recognition, suffixes, which are sound elements and a sound element sequence, which form the unknown word, for classifying the unknown word by the part of speech thereof. Based on such a word dictionary, a matching section connects the acoustic models of an sound model database, and calculates the score using the series of features output by a feature extraction section on the basis of the connected acoustic model. Then, the matching section selects a series of the words, which represents the speech recognition result, on the basis of the score.

    摘要翻译: 为了防止由于未知词引起的语音识别精度的降低,字典数据库中存储有词语词典,除了用于语音识别的对象的词之外,还存储有作为声音元素和声音元素的后缀的词典 序列,其形成未知单词,用于通过其部分语音对未知单词进行分类。 基于这样的词典,匹配部分连接声音模型数据库的声学模型,并且使用基于连接的声学模型的特征提取部分输出的一系列特征来计算分数。 然后,匹配部分基于分数来选择表示语音识别结果的一系列单词。

    Voice recognition apparatus and method, and recording medium
    4.
    发明授权
    Voice recognition apparatus and method, and recording medium 失效
    语音识别装置和方法以及记录介质

    公开(公告)号:US06961701B2

    公开(公告)日:2005-11-01

    申请号:US09798521

    申请日:2001-03-03

    摘要: An extended-word selecting section calculates a score for a phoneme string formed of one more phonemes, corresponding to a user's speech, and searches a large-vocabulary-dictionary for a word having one or more phonemes equal to or similar to those of a phoneme string having a score equal to or higher than a predetermined value. A matching section calculates scores for the word searched for by the extended-word selecting section in addition to a word preliminary word-selecting section. A control section determines a word string as the result of recognition of the speech uttered by the user.

    摘要翻译: 扩展字选择部分计算由与用户的语音相对应的一个以上音素形成的音素串的分数,并且搜索具有等于或类似于音素的一个或多个音素的单词的大词汇词典 具有等于​​或高于预定值的分数的字符串。 匹配部分除了字初步字选择部分之外,还计算由扩展字选择部分搜索的字的分数。 控制部分确定作为用户发出的语音的识别结果的字串。

    Speech recognition device and speech recognition method and recording medium utilizing preliminary word selection
    5.
    发明授权
    Speech recognition device and speech recognition method and recording medium utilizing preliminary word selection 失效
    语音识别装置和语音识别方法以及采用初步选词的记录媒体

    公开(公告)号:US07881935B2

    公开(公告)日:2011-02-01

    申请号:US10019125

    申请日:2001-02-16

    IPC分类号: G10L15/04 G10L15/14

    摘要: A speech recognition apparatus in which the accuracy in speech recognition is improved as the resource is prevented from increasing. Such a word which is probable as the result of the speech recognition is selected on the basis of an acoustic score and a linguistic score, while word selection is also performed on the basis of a measure different from the acoustic score, such as the number of phonemes being small, a part of speech being a pre-set one, inclusion in the past results of speech recognition or the linguistic score being not less than a pre-set value. The words so selected are subjected to matching processing.

    摘要翻译: 一种在防止资源增加时语音识别精度提高的语音识别装置。 基于声学得分和语言得分来选择可能作为语音识别结果的这样一个词,而基于与声分数不同的度量来执行词选择,例如, 音素很小,部分言语是预先设定的,包括过去的语音识别结果或语言分数不低于预设值。 所选择的单词将进行匹配处理。

    Speech recognition apparatus
    6.
    发明授权
    Speech recognition apparatus 失效
    语音识别装置

    公开(公告)号:US07240002B2

    公开(公告)日:2007-07-03

    申请号:US10416092

    申请日:2001-11-07

    IPC分类号: G10L15/04

    CPC分类号: G10L15/08 G10L15/083

    摘要: The present invention provides a speech recognition apparatus having high speech recognition performance and capable of performing speech recognition in a highly efficient manner. A matching unit 14 calculates the scores of words selected by a preliminary word selector 13 and determines a candidate for a speech recognition result on the basis of the calculated scores. A control unit 11 produces word connection relationships among words included in a word series employed as a candidate for the speech recognition result and stores them into a word connection information storage unit 16. A reevaluation unit 15 corrects the word connection relationships one by one. On the basis of the corrected word connection relationships, the control unit 11 determines the speech recognition result. A word connection managing unit 21 limits times allowed for a boundary between words represented by the word connection relationships to be located thereat. A word connection managing unit 22 limits start times of words preliminarily selected by the preliminary word selector 13. The present invention can be applied to an interactive system that recognizes an input speech and responds to the speech recognition result.

    摘要翻译: 本发明提供了具有高语音识别性能并且能够以高效的方式执行语音识别的语音识别装置。 匹配单元14计算由初步词选择器13选择的单词的分数,并且基于所计算的分数来确定语音识别结果的候选。 控制单元11产生用作语音识别结果候选的单词序列中包含的单词之间的字连接关系,并将它们存储到单词连接信息存储单元16中。 重新评估单元15逐个地修正单词连接关系。 基于校正后的字连接关系,控制单元11确定语音识别结果。 字连接管理单元21限制由字连接关系所表示的字之间的边界所允许的时间。 字连接管理单元22限制由初步词选择器13预先选择的单词的开始时间。 本发明可以应用于识别输入语音并响应于语音识别结果的交互式系统。

    Speech recognition system that restarts recognition operation when a new speech signal is entered using a talk switch
    10.
    发明授权
    Speech recognition system that restarts recognition operation when a new speech signal is entered using a talk switch 失效
    当使用通话开关输入新的语音信号时,语音识别系统重启识别操作

    公开(公告)号:US06253174B1

    公开(公告)日:2001-06-26

    申请号:US09108826

    申请日:1998-07-01

    IPC分类号: G10L1500

    CPC分类号: G10L15/22 G01C21/3608

    摘要: A speech recognition system for use in an automobile navigation system performs speech processing for recognizing speech or spoken words that correspond to a name of a place and a word designating a desired operation of the navigation system. When a new audio signal is input during speech recognition processing of a previously input audio signal, by holding a talk switch down for a fixed time period, the processing of the previously input audio signal is canceled, and the new audio signal immediately undergoes speech recognition processing without requiring any continuation of the processing of the previously input audio signal. The speech recognition system also determines whether an input audio signal has been reinputted within a predetermined amount of time from when the audio signal was previously inputted.

    摘要翻译: 用于汽车导航系统的语音识别系统执行语音处理,用于识别与指定导航系统的期望操作的位置名称和单词相对应的语音或口语单词。 当在先前输入的音频信号的语音识别处理期间输入新的音频信号时,通过将通话开关保持固定时间段,先前输入的音频信号的处理被消除,并且新的音频信号立即进行语音识别 处理,而不需要对先前输入的音频信号的处理的任何继续。 语音识别系统还确定输入音频信号是否已经在从先前输入音频信号的预定时间量内被重新输入。