Audio conversation device, method, and robot device
    1.
    发明申请
    Audio conversation device, method, and robot device 审中-公开
    音频会话设备,方法和机器人设备

    公开(公告)号:US20060177802A1

    公开(公告)日:2006-08-10

    申请号:US10549795

    申请日:2004-03-16

    IPC分类号: G09B19/04

    CPC分类号: G10L13/00

    摘要: In a conventional voice dialogue system, there is a case where it is difficult to perform a natural dialogue with the user. Therefore, we designed to perform speech recognition on the user's utterance, to control a dialogue with the user according to a scenario previously given, based on the speech recognition result to generate an answering sentence corresponding to the contents of the user's utterance as the occasion demands, and to perform voice synthesis processing to one sentence in the reproduced scenario or the generated answering sentence.

    摘要翻译: 在常规语音对话系统中,存在难以与用户进行自然对话的情况。 因此,我们设计为在用户的话语上执行语音识别,以根据语音识别结果根据先前给出的场景来控制与用户的对话,以产生与用户话语的内容相对应的应答语句 并且对再现的场景或所生成的应答语句中的一个句子执行语音合成处理。

    Information processing apparatus, information processing method, and program
    2.
    发明授权
    Information processing apparatus, information processing method, and program 有权
    信息处理装置,信息处理方法和程序

    公开(公告)号:US08566094B2

    公开(公告)日:2013-10-22

    申请号:US13206631

    申请日:2011-08-10

    IPC分类号: G10L15/00

    摘要: An apparatus, method and program for performing a speech recognition process utilizing contextual information that comprises an estimation of the intention of an utterance of a user. The recognition process includes calculating a pre-score based on observed contextual information according intention models which correspond to a plurality of types of intention information and combining the pre-scoring results with acoustic and linguistic scores to obtain an improved recognition or comprehension of the intent of a user utterance.

    摘要翻译: 一种用于使用包括对用户的话语的意图的估计的上下文信息执行语音识别处理的装置,方法和程序。 识别过程包括基于观察到的情境信息来计算预分数,该意图模型对应于多种类型的意图信息,并将预评分结果与声学和语言得分相结合,以获得对目标的意图的改进的识别或理解 用户说话。

    Speech recognition apparatus, speech recognition method, and storage medium
    4.
    发明授权
    Speech recognition apparatus, speech recognition method, and storage medium 失效
    语音识别装置,语音识别方法和存储介质

    公开(公告)号:US07013277B2

    公开(公告)日:2006-03-14

    申请号:US09794887

    申请日:2001-02-26

    IPC分类号: G10L15/00

    CPC分类号: G10L15/193 G10L2015/085

    摘要: A preliminary word-selecting section selects one or more words following words which have been obtained in a word string serving as a candidate for a result of speech recognition; and a matching section calculates acoustic or linguistic scores for the selected words, and forms a word string serving as a candidate for a result of speech recognition according to the scores. A control section generates word-connection relationships between words in the word string serving as a candidate for a result of speech recognition, sends them to a word-connection-information storage section, and stores them in it. A re-evaluation section corrects the word-connection relationships stored in the word-connection-information storage section 16, and the control section determines a word string serving as the result of speech recognition according to the corrected word-connection relationships.

    摘要翻译: 初步字选择部选择在用作语音识别结果的候选者的字串中获得的一个或多个单词, 并且匹配部分计算所选择的单词的声学或语言得分,并且根据分数形成用作语音识别结果的候选的词串。 控制部分生成用作语音识别结果候选的字串中的字之间的字连接关系,将它们发送到字连接信息存储部分,并将它们存储在其中。 重新评估部分校正存储在字连接信息存储部分16中的字连接关系,并且控制部分根据校正的字连接关系确定用作语音识别结果的字串。

    Speech recognition with score calculation
    5.
    发明授权
    Speech recognition with score calculation 有权
    语音识别与分数计算

    公开(公告)号:US07249017B2

    公开(公告)日:2007-07-24

    申请号:US10785246

    申请日:2004-02-24

    IPC分类号: G10L15/08 G06F17/27

    CPC分类号: G10L15/187 G10L2015/025

    摘要: In order to prevent degradation of speech recognition accuracy due to an unknown word, a dictionary database has stored therein a word dictionary in which are stored, in addition to words for the objects of speech recognition, suffixes, which are sound elements and a sound element sequence, which form the unknown word, for classifying the unknown word by the part of speech thereof. Based on such a word dictionary, a matching section connects the acoustic models of an sound model database, and calculates the score using the series of features output by a feature extraction section on the basis of the connected acoustic model. Then, the matching section selects a series of the words, which represents the speech recognition result, on the basis of the score.

    摘要翻译: 为了防止由于未知词引起的语音识别精度的降低,字典数据库中存储有词语词典,除了用于语音识别的对象的词之外,还存储有作为声音元素和声音元素的后缀的词典 序列,其形成未知单词,用于通过其部分语音对未知单词进行分类。 基于这样的词典,匹配部分连接声音模型数据库的声学模型,并且使用基于连接的声学模型的特征提取部分输出的一系列特征来计算分数。 然后,匹配部分基于分数来选择表示语音识别结果的一系列单词。

    Voice recognition apparatus and method, and recording medium
    6.
    发明授权
    Voice recognition apparatus and method, and recording medium 失效
    语音识别装置和方法以及记录介质

    公开(公告)号:US06961701B2

    公开(公告)日:2005-11-01

    申请号:US09798521

    申请日:2001-03-03

    摘要: An extended-word selecting section calculates a score for a phoneme string formed of one more phonemes, corresponding to a user's speech, and searches a large-vocabulary-dictionary for a word having one or more phonemes equal to or similar to those of a phoneme string having a score equal to or higher than a predetermined value. A matching section calculates scores for the word searched for by the extended-word selecting section in addition to a word preliminary word-selecting section. A control section determines a word string as the result of recognition of the speech uttered by the user.

    摘要翻译: 扩展字选择部分计算由与用户的语音相对应的一个以上音素形成的音素串的分数,并且搜索具有等于或类似于音素的一个或多个音素的单词的大词汇词典 具有等于​​或高于预定值的分数的字符串。 匹配部分除了字初步字选择部分之外,还计算由扩展字选择部分搜索的字的分数。 控制部分确定作为用户发出的语音的识别结果的字串。

    Speech recognition apparatus
    7.
    发明申请
    Speech recognition apparatus 失效
    语音识别装置

    公开(公告)号:US20050075877A1

    公开(公告)日:2005-04-07

    申请号:US10416092

    申请日:2001-11-07

    CPC分类号: G10L15/08 G10L15/083

    摘要: A speech recognizing device for efficient processing while keeping a high speech recognizing performance. A matching unit (14) computes the score of a word preliminarily selected by a word preliminary selection unit (13) and determines candidates of the speech recognition result on the basis of the score. A control unit (11) creates a word connection relation between the words of a word sequence, which is a candidate of the speech recognition result and stores them in a word connection information storage unit (16). A revaluation unit (15) corrects the word connection relation serially, and the control unit ( 11) defines the speech recognition result on the basis of the word connection relation corrected. A word connection relation managing unit (21) limits the time corresponding to the boundary of a word expressed by the word connection relation, and a word connection relation managing unit (22) limits the starting time of the word preliminarily selected by the word preliminary selection unit (13). The speech recognizing device can be applied to an interactive system which responds to the speech recognition result.

    摘要翻译: 一种用于在保持高语音识别性能的同时高效处理的语音识别装置。 匹配单元(14)计算由词初步选择单元(13)预先选择的单词的分数,并根据得分确定语音识别结果的候选。 控制单元(11)创建作为语音识别结果的候选者的字序列的字之间的字连接关系,并将它们存储在字连接信息存储单元(16)中。 重估单元(15)串行地校正字连接关系,并且控制单元(11)基于校正的字连接关系来定义语音识别结果。 字连接关系管理单元(21)限制与字连接关系所表示的字的边界对应的时间,并且字连接关系管理单元(22)限制由初始选择预先选择的单词的开始时间 单位(13)。 语音识别装置可以应用于响应于语音识别结果的交互式系统。

    Speech recognition device and speech recognition method and recording medium utilizing preliminary word selection
    8.
    发明授权
    Speech recognition device and speech recognition method and recording medium utilizing preliminary word selection 失效
    语音识别装置和语音识别方法以及采用初步选词的记录媒体

    公开(公告)号:US07881935B2

    公开(公告)日:2011-02-01

    申请号:US10019125

    申请日:2001-02-16

    IPC分类号: G10L15/04 G10L15/14

    摘要: A speech recognition apparatus in which the accuracy in speech recognition is improved as the resource is prevented from increasing. Such a word which is probable as the result of the speech recognition is selected on the basis of an acoustic score and a linguistic score, while word selection is also performed on the basis of a measure different from the acoustic score, such as the number of phonemes being small, a part of speech being a pre-set one, inclusion in the past results of speech recognition or the linguistic score being not less than a pre-set value. The words so selected are subjected to matching processing.

    摘要翻译: 一种在防止资源增加时语音识别精度提高的语音识别装置。 基于声学得分和语言得分来选择可能作为语音识别结果的这样一个词,而基于与声分数不同的度量来执行词选择,例如, 音素很小,部分言语是预先设定的,包括过去的语音识别结果或语言分数不低于预设值。 所选择的单词将进行匹配处理。

    System and method for an automatic set-up of speech recognition engines
    9.
    发明授权
    System and method for an automatic set-up of speech recognition engines 失效
    用于语音识别引擎自动设置的系统和方法

    公开(公告)号:US07716047B2

    公开(公告)日:2010-05-11

    申请号:US10403730

    申请日:2003-03-31

    IPC分类号: G10L15/00 G06F15/18

    CPC分类号: G10L15/28

    摘要: A system and method for an automatic set-up of speech recognition engines may include a speech recognizer configured to perform speech recognition procedures to identify input speech data according to one or more operating parameters. A merit manager may be utilized to automatically calculate merit values corresponding to the foregoing recognition procedures. These merit values may incorporate recognition accuracy information, recognition speed information, and a user-specified weighting factor that shifts the relative effect of the recognition accuracy information and the recognition speed information on the merit values. The merit manager may then automatically perform a merit value optimization procedure to select operating parameters that correspond to an optimal one of the merit values.

    摘要翻译: 用于语音识别引擎的自动设置的系统和方法可以包括被配置为执行语音识别过程以根据一个或多个操作参数来识别输入语音数据的语音识别器。 可以使用优点管理器来自动计算对应于前述识别过程的优点值。 这些优点值可以包括识别精度信息,识别速度信息和用户指定的加权因子,其将识别精度信息和识别速度信息的相对影响偏移在优值上。 然后,优点管理器可以自动执行优值优化过程,以选择对应于优值的最佳值的操作参数。

    Speech recognition apparatus
    10.
    发明授权
    Speech recognition apparatus 失效
    语音识别装置

    公开(公告)号:US07240002B2

    公开(公告)日:2007-07-03

    申请号:US10416092

    申请日:2001-11-07

    IPC分类号: G10L15/04

    CPC分类号: G10L15/08 G10L15/083

    摘要: The present invention provides a speech recognition apparatus having high speech recognition performance and capable of performing speech recognition in a highly efficient manner. A matching unit 14 calculates the scores of words selected by a preliminary word selector 13 and determines a candidate for a speech recognition result on the basis of the calculated scores. A control unit 11 produces word connection relationships among words included in a word series employed as a candidate for the speech recognition result and stores them into a word connection information storage unit 16. A reevaluation unit 15 corrects the word connection relationships one by one. On the basis of the corrected word connection relationships, the control unit 11 determines the speech recognition result. A word connection managing unit 21 limits times allowed for a boundary between words represented by the word connection relationships to be located thereat. A word connection managing unit 22 limits start times of words preliminarily selected by the preliminary word selector 13. The present invention can be applied to an interactive system that recognizes an input speech and responds to the speech recognition result.

    摘要翻译: 本发明提供了具有高语音识别性能并且能够以高效的方式执行语音识别的语音识别装置。 匹配单元14计算由初步词选择器13选择的单词的分数,并且基于所计算的分数来确定语音识别结果的候选。 控制单元11产生用作语音识别结果候选的单词序列中包含的单词之间的字连接关系,并将它们存储到单词连接信息存储单元16中。 重新评估单元15逐个地修正单词连接关系。 基于校正后的字连接关系,控制单元11确定语音识别结果。 字连接管理单元21限制由字连接关系所表示的字之间的边界所允许的时间。 字连接管理单元22限制由初步词选择器13预先选择的单词的开始时间。 本发明可以应用于识别输入语音并响应于语音识别结果的交互式系统。