专利检索 ap:("Atsuo Hiroe" OR "Hideki Shimomura" OR "Helmut Lucke" OR "Katsuki Minamino" OR "Haru Kato") AND inv:"Katsuki Minamino" 第 1 页

1.

发明申请
Audio conversation device, method, and robot device 审中-公开
标题翻译：音频会话设备，方法和机器人设备

公开(公告)号：US20060177802A1

公开(公告)日：2006-08-10

申请号：US10549795

申请日：2004-03-16

申请人： Atsuo Hiroe , Hideki Shimomura , Helmut Lucke , Katsuki Minamino , Haru Kato

发明人： Atsuo Hiroe , Hideki Shimomura , Helmut Lucke , Katsuki Minamino , Haru Kato

IPC分类号： G09B19/04

CPC分类号： G10L13/00

摘要： In a conventional voice dialogue system, there is a case where it is difficult to perform a natural dialogue with the user. Therefore, we designed to perform speech recognition on the user's utterance, to control a dialogue with the user according to a scenario previously given, based on the speech recognition result to generate an answering sentence corresponding to the contents of the user's utterance as the occasion demands, and to perform voice synthesis processing to one sentence in the reproduced scenario or the generated answering sentence.

摘要翻译： 在常规语音对话系统中，存在难以与用户进行自然对话的情况。因此，我们设计为在用户的话语上执行语音识别，以根据语音识别结果根据先前给出的场景来控制与用户的对话，以产生与用户话语的内容相对应的应答语句并且对再现的场景或所生成的应答语句中的一个句子执行语音合成处理。

2.

发明授权
Information processing apparatus, information processing method, and program 有权
标题翻译：信息处理装置，信息处理方法和程序

公开(公告)号：US08566094B2

公开(公告)日：2013-10-22

申请号：US13206631

申请日：2011-08-10

申请人： Katsuki Minamino , Atsuo Hiroe , Yoshinori Maeda , Satoshi Asakawa

发明人： Katsuki Minamino , Atsuo Hiroe , Yoshinori Maeda , Satoshi Asakawa

IPC分类号： G10L15/00

CPC分类号： G10L15/32 , G10L15/187 , G10L15/19

摘要： An apparatus, method and program for performing a speech recognition process utilizing contextual information that comprises an estimation of the intention of an utterance of a user. The recognition process includes calculating a pre-score based on observed contextual information according intention models which correspond to a plurality of types of intention information and combining the pre-scoring results with acoustic and linguistic scores to obtain an improved recognition or comprehension of the intent of a user utterance.

摘要翻译： 一种用于使用包括对用户的话语的意图的估计的上下文信息执行语音识别处理的装置，方法和程序。识别过程包括基于观察到的情境信息来计算预分数，该意图模型对应于多种类型的意图信息，并将预评分结果与声学和语言得分相结合，以获得对目标的意图的改进的识别或理解用户说话。

3.

发明授权
Robot apparatus, method and device for recognition of letters or characters, control program and recording medium 失效
标题翻译：用于识别字母或字符的机器人装置，方法和装置，控制程序和记录介质

公开(公告)号：US07088853B2

公开(公告)日：2006-08-08

申请号：US10336201

申请日：2002-12-31

申请人： Atsuo Hiroe , Katsuki Minamino , Kenta Kawamoto , Kohtaro Sabe , Takeshi Ohashi

发明人： Atsuo Hiroe , Katsuki Minamino , Kenta Kawamoto , Kohtaro Sabe , Takeshi Ohashi

IPC分类号： G06K9/00 , G10L15/00 , G10L13/00

CPC分类号： G10L13/047 , G10L13/00 , G10L15/06 , G10L15/24 , G10L2015/0631

摘要： A plural number of letters or characters, inferred from the results of letter/character recognition of an image photographed by a CCD camera (20), a plural number of kana readings inferred from the letters or characters and the way of pronunciation corresponding to the kana readings are generated in an pronunciation information generating unit (150) and the plural readings obtained are matched to the pronunciation from the user acquired by a microphone (23) to specify one kana reading and the way of pronunciation (reading) from among the plural generated candidates.

摘要翻译： 从由CCD照相机（20）拍摄的图像的字母/字符识别的结果推断的多个字母或字符，从字母或字符推断的多个假名读数和对应于假名的发音方式在发音信息生成单元（150）中产生读数，并且所获得的多个读数与由麦克风（23）获取的用户的发音相匹配，以从多个生成的多个生成单元中指定一个假名读数和发音（读取）的方式候选人。

4.

发明授权
Speech recognition apparatus, speech recognition method, and storage medium 失效
标题翻译：语音识别装置，语音识别方法和存储介质

公开(公告)号：US07013277B2

公开(公告)日：2006-03-14

申请号：US09794887

申请日：2001-02-26

申请人： Katsuki Minamino , Yasuharu Asano , Hiroaki Ogawa , Helmut Lucke

发明人： Katsuki Minamino , Yasuharu Asano , Hiroaki Ogawa , Helmut Lucke

IPC分类号： G10L15/00

CPC分类号： G10L15/193 , G10L2015/085

摘要： A preliminary word-selecting section selects one or more words following words which have been obtained in a word string serving as a candidate for a result of speech recognition; and a matching section calculates acoustic or linguistic scores for the selected words, and forms a word string serving as a candidate for a result of speech recognition according to the scores. A control section generates word-connection relationships between words in the word string serving as a candidate for a result of speech recognition, sends them to a word-connection-information storage section, and stores them in it. A re-evaluation section corrects the word-connection relationships stored in the word-connection-information storage section 16, and the control section determines a word string serving as the result of speech recognition according to the corrected word-connection relationships.

摘要翻译： 初步字选择部选择在用作语音识别结果的候选者的字串中获得的一个或多个单词，并且匹配部分计算所选择的单词的声学或语言得分，并且根据分数形成用作语音识别结果的候选的词串。控制部分生成用作语音识别结果候选的字串中的字之间的字连接关系，将它们发送到字连接信息存储部分，并将它们存储在其中。重新评估部分校正存储在字连接信息存储部分16中的字连接关系，并且控制部分根据校正的字连接关系确定用作语音识别结果的字串。

5.

发明授权
Speech recognition with score calculation 有权
标题翻译：语音识别与分数计算

公开(公告)号：US07249017B2

公开(公告)日：2007-07-24

申请号：US10785246

申请日：2004-02-24

申请人： Helmut Lucke , Katsuki Minamino , Yasuharu Asano , Hiroaki Ogawa

发明人： Helmut Lucke , Katsuki Minamino , Yasuharu Asano , Hiroaki Ogawa

IPC分类号： G10L15/08 , G06F17/27

CPC分类号： G10L15/187 , G10L2015/025

摘要： In order to prevent degradation of speech recognition accuracy due to an unknown word, a dictionary database has stored therein a word dictionary in which are stored, in addition to words for the objects of speech recognition, suffixes, which are sound elements and a sound element sequence, which form the unknown word, for classifying the unknown word by the part of speech thereof. Based on such a word dictionary, a matching section connects the acoustic models of an sound model database, and calculates the score using the series of features output by a feature extraction section on the basis of the connected acoustic model. Then, the matching section selects a series of the words, which represents the speech recognition result, on the basis of the score.

摘要翻译： 为了防止由于未知词引起的语音识别精度的降低，字典数据库中存储有词语词典，除了用于语音识别的对象的词之外，还存储有作为声音元素和声音元素的后缀的词典序列，其形成未知单词，用于通过其部分语音对未知单词进行分类。基于这样的词典，匹配部分连接声音模型数据库的声学模型，并且使用基于连接的声学模型的特征提取部分输出的一系列特征来计算分数。然后，匹配部分基于分数来选择表示语音识别结果的一系列单词。

6.

发明授权
Voice recognition apparatus and method, and recording medium 失效
标题翻译：语音识别装置和方法以及记录介质

公开(公告)号：US06961701B2

公开(公告)日：2005-11-01

申请号：US09798521

申请日：2001-03-03

申请人： Hiroaki Ogawa , Katsuki Minamino , Yasuharu Asano , Helmut Lucke

发明人： Hiroaki Ogawa , Katsuki Minamino , Yasuharu Asano , Helmut Lucke

IPC分类号： G10L15/18 , G10L15/02 , G10L17/00 , G10L15/08

CPC分类号： G10L17/02 , G10L17/16 , G10L2015/025

摘要： An extended-word selecting section calculates a score for a phoneme string formed of one more phonemes, corresponding to a user's speech, and searches a large-vocabulary-dictionary for a word having one or more phonemes equal to or similar to those of a phoneme string having a score equal to or higher than a predetermined value. A matching section calculates scores for the word searched for by the extended-word selecting section in addition to a word preliminary word-selecting section. A control section determines a word string as the result of recognition of the speech uttered by the user.

摘要翻译： 扩展字选择部分计算由与用户的语音相对应的一个以上音素形成的音素串的分数，并且搜索具有等于或类似于音素的一个或多个音素的单词的大词汇词典具有等于或高于预定值的分数的字符串。匹配部分除了字初步字选择部分之外，还计算由扩展字选择部分搜索的字的分数。控制部分确定作为用户发出的语音的识别结果的字串。

7.

发明申请
Speech recognition apparatus 失效
标题翻译：语音识别装置

公开(公告)号：US20050075877A1

公开(公告)日：2005-04-07

申请号：US10416092

申请日：2001-11-07

申请人： Katsuki Minamino , Yasuharu Asano , Hiroaki Ogawa , Helmut Lucke

发明人： Katsuki Minamino , Yasuharu Asano , Hiroaki Ogawa , Helmut Lucke

IPC分类号： G10L15/18 , G10L15/08 , G10L15/10 , G10L15/14 , G10L15/00

CPC分类号： G10L15/08 , G10L15/083

摘要： A speech recognizing device for efficient processing while keeping a high speech recognizing performance. A matching unit (14) computes the score of a word preliminarily selected by a word preliminary selection unit (13) and determines candidates of the speech recognition result on the basis of the score. A control unit (11) creates a word connection relation between the words of a word sequence, which is a candidate of the speech recognition result and stores them in a word connection information storage unit (16). A revaluation unit (15) corrects the word connection relation serially, and the control unit ( 11) defines the speech recognition result on the basis of the word connection relation corrected. A word connection relation managing unit (21) limits the time corresponding to the boundary of a word expressed by the word connection relation, and a word connection relation managing unit (22) limits the starting time of the word preliminarily selected by the word preliminary selection unit (13). The speech recognizing device can be applied to an interactive system which responds to the speech recognition result.

摘要翻译： 一种用于在保持高语音识别性能的同时高效处理的语音识别装置。匹配单元（14）计算由词初步选择单元（13）预先选择的单词的分数，并根据得分确定语音识别结果的候选。控制单元（11）创建作为语音识别结果的候选者的字序列的字之间的字连接关系，并将它们存储在字连接信息存储单元（16）中。重估单元（15）串行地校正字连接关系，并且控制单元（11）基于校正的字连接关系来定义语音识别结果。字连接关系管理单元（21）限制与字连接关系所表示的字的边界对应的时间，并且字连接关系管理单元（22）限制由初始选择预先选择的单词的开始时间单位（13）。语音识别装置可以应用于响应于语音识别结果的交互式系统。

8.

发明授权
Speech recognition device and speech recognition method and recording medium utilizing preliminary word selection 失效
标题翻译：语音识别装置和语音识别方法以及采用初步选词的记录媒体

公开(公告)号：US07881935B2

公开(公告)日：2011-02-01

申请号：US10019125

申请日：2001-02-16

申请人： Yasuharu Asano , Katsuki Minamino , Hiroaki Ogawa , Helmut Lucke

发明人： Yasuharu Asano , Katsuki Minamino , Hiroaki Ogawa , Helmut Lucke

IPC分类号： G10L15/04 , G10L15/14

CPC分类号： G10L15/08 , G10L15/18 , G10L2015/085

摘要： A speech recognition apparatus in which the accuracy in speech recognition is improved as the resource is prevented from increasing. Such a word which is probable as the result of the speech recognition is selected on the basis of an acoustic score and a linguistic score, while word selection is also performed on the basis of a measure different from the acoustic score, such as the number of phonemes being small, a part of speech being a pre-set one, inclusion in the past results of speech recognition or the linguistic score being not less than a pre-set value. The words so selected are subjected to matching processing.

摘要翻译： 一种在防止资源增加时语音识别精度提高的语音识别装置。基于声学得分和语言得分来选择可能作为语音识别结果的这样一个词，而基于与声分数不同的度量来执行词选择，例如，音素很小，部分言语是预先设定的，包括过去的语音识别结果或语言分数不低于预设值。所选择的单词将进行匹配处理。

9.

发明授权
System and method for an automatic set-up of speech recognition engines 失效
标题翻译：用于语音识别引擎自动设置的系统和方法

公开(公告)号：US07716047B2

公开(公告)日：2010-05-11

申请号：US10403730

申请日：2003-03-31

申请人： Gustavo Hernandez-Abrego , Xavier Menendez-Pidal , Thomas Kemp , Katsuki Minamino , Helmut Lucke

发明人： Gustavo Hernandez-Abrego , Xavier Menendez-Pidal , Thomas Kemp , Katsuki Minamino , Helmut Lucke

IPC分类号： G10L15/00 , G06F15/18

CPC分类号： G10L15/28

摘要： A system and method for an automatic set-up of speech recognition engines may include a speech recognizer configured to perform speech recognition procedures to identify input speech data according to one or more operating parameters. A merit manager may be utilized to automatically calculate merit values corresponding to the foregoing recognition procedures. These merit values may incorporate recognition accuracy information, recognition speed information, and a user-specified weighting factor that shifts the relative effect of the recognition accuracy information and the recognition speed information on the merit values. The merit manager may then automatically perform a merit value optimization procedure to select operating parameters that correspond to an optimal one of the merit values.

摘要翻译： 用于语音识别引擎的自动设置的系统和方法可以包括被配置为执行语音识别过程以根据一个或多个操作参数来识别输入语音数据的语音识别器。可以使用优点管理器来自动计算对应于前述识别过程的优点值。这些优点值可以包括识别精度信息，识别速度信息和用户指定的加权因子，其将识别精度信息和识别速度信息的相对影响偏移在优值上。然后，优点管理器可以自动执行优值优化过程，以选择对应于优值的最佳值的操作参数。

10.

发明授权
Speech recognition apparatus 失效
标题翻译：语音识别装置

公开(公告)号：US07240002B2

公开(公告)日：2007-07-03

申请号：US10416092

申请日：2001-11-07

申请人： Katsuki Minamino , Yasuharu Asano , Hiroaki Ogawa , Helmut Lucke

发明人： Katsuki Minamino , Yasuharu Asano , Hiroaki Ogawa , Helmut Lucke

IPC分类号： G10L15/04

CPC分类号： G10L15/08 , G10L15/083

摘要： The present invention provides a speech recognition apparatus having high speech recognition performance and capable of performing speech recognition in a highly efficient manner. A matching unit 14 calculates the scores of words selected by a preliminary word selector 13 and determines a candidate for a speech recognition result on the basis of the calculated scores. A control unit 11 produces word connection relationships among words included in a word series employed as a candidate for the speech recognition result and stores them into a word connection information storage unit 16. A reevaluation unit 15 corrects the word connection relationships one by one. On the basis of the corrected word connection relationships, the control unit 11 determines the speech recognition result. A word connection managing unit 21 limits times allowed for a boundary between words represented by the word connection relationships to be located thereat. A word connection managing unit 22 limits start times of words preliminarily selected by the preliminary word selector 13. The present invention can be applied to an interactive system that recognizes an input speech and responds to the speech recognition result.

摘要翻译： 本发明提供了具有高语音识别性能并且能够以高效的方式执行语音识别的语音识别装置。匹配单元14计算由初步词选择器13选择的单词的分数，并且基于所计算的分数来确定语音识别结果的候选。控制单元11产生用作语音识别结果候选的单词序列中包含的单词之间的字连接关系，并将它们存储到单词连接信息存储单元16中。重新评估单元15逐个地修正单词连接关系。基于校正后的字连接关系，控制单元11确定语音识别结果。字连接管理单元21限制由字连接关系所表示的字之间的边界所允许的时间。字连接管理单元22限制由初步词选择器13预先选择的单词的开始时间。本发明可以应用于识别输入语音并响应于语音识别结果的交互式系统。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类