专利检索 ap:("Katsuki Minamino" OR "Hitoshi Honda" OR "Yoshinori Maeda" OR "Hiroaki Ogawa") AND inv:"Katsuki Minamino" 第 1 页

1.

发明授权
Voice processing device and method, and program 有权
标题翻译：语音处理装置及方法及程序

公开(公告)号：US08612223B2

公开(公告)日：2013-12-17

申请号：US12817526

申请日：2010-06-17

申请人： Katsuki Minamino , Hitoshi Honda , Yoshinori Maeda , Hiroaki Ogawa

发明人： Katsuki Minamino , Hitoshi Honda , Yoshinori Maeda , Hiroaki Ogawa

IPC分类号： G10L15/06

CPC分类号： G10L15/183

摘要： There is provided a voice processing device. The device includes: score calculation unit configured to calculate a score indicating compatibility of a voice signal input on the basis of an utterance of a user with each of plural pieces of intention information indicating each of a plurality of intentions; intention selection unit configured to select the intention information indicating the intention of the utterance of the user among the plural pieces of intention information on the basis of the score calculated by the score calculation unit; and intention reliability calculation unit configured to calculate the reliability with respect to the intention information selected by the intention selection unit on the basis of the score calculated by the score calculation unit.

摘要翻译： 提供语音处理装置。该设备包括：分数计算单元，被配置为计算表示基于用户的话语输入的语音信号的兼容性的分数与指示多个意图中的每一个的多个意图信息中的每一个; 意图选择单元，被配置为基于由分数计算单元计算的得分，在多个意图信息中选择表示用户的发音意图的意图信息; 以及意图可靠性计算单元，被配置为基于由分数计算单元计算出的分数来计算与意图选择单元选择的意图信息相关的可靠性。

2.

发明申请
VOICE RECOGNITION DEVICE AND VOICE RECOGNITION METHOD, LANGUAGE MODEL GENERATING DEVICE AND LANGUAGE MODEL GENERATING METHOD, AND COMPUTER PROGRAM 审中-公开
标题翻译：语音识别装置和语音识别方法，语言模型生成装置和语言模型生成方法以及计算机程序

公开(公告)号：US20100241418A1

公开(公告)日：2010-09-23

申请号：US12661164

申请日：2010-03-11

申请人： Yoshinori Maeda , Hitoshi Honda , Katsuki Minamino

发明人： Yoshinori Maeda , Hitoshi Honda , Katsuki Minamino

IPC分类号： G10L15/18 , G06F17/27 , G10L15/00

CPC分类号： G10L15/1815 , G10L15/183

摘要： A speech recognition device includes one intention extracting language model and more in which an intention of a focused specific task is inherent, an absorbing language model in which any intention of the task is not inherent, a language score calculating section that calculates a language score indicating a linguistic similarity between each of the intention extracting language model and the absorbing language model, and the content of an utterance, and a decoder that estimates an intention in the content of an utterance based on a language score of each of the language models calculated by the language score calculating section.

摘要翻译： 一种语音识别装置，包括一种意图提取语言模型，其中特定任务的意图是固有的，其中任务的任何意图不是固有的吸收语言模型;语言得分计算部分，其计算表示每个意图提取语言模型和吸收语言模型之间的语言相似性，以及话语的内容，以及解码器，其基于由语言模型计算的每个语言模型的语言得分来估计语音内容中的意图语言成绩计算部分。

3.

发明授权
Mapping determination methods and data discrimination methods using the same 失效
标题翻译：映射确定方法和使用该方法的数据鉴别方法

公开(公告)号：US5796921A

公开(公告)日：1998-08-18

申请号：US540948

申请日：1995-10-11

申请人： Katsuki Minamino , Masao Watari , Miyuki Tanaka , Kazuo Ishii , Yasuhiko Kato , Hiroaki Ogawa , Masanori Omote , Kazuo Watanabe , Hitoshi Honda

发明人： Katsuki Minamino , Masao Watari , Miyuki Tanaka , Kazuo Ishii , Yasuhiko Kato , Hiroaki Ogawa , Masanori Omote , Kazuo Watanabe , Hitoshi Honda

IPC分类号： G06K9/62 , G06F15/18

CPC分类号： G06K9/6232

摘要： A mapping determination method for obtaining mapping F from an N-dimensional metric vector space .OMEGA..sub.N to an M-dimensional metric vector space .OMEGA..sub.M has the following steps to get the optimal mapping quickly and positively. In the first step, complete, periodic, L.sub.m basic functions g.sub.m (X) according to the distribution of samples classified into Q categories on the N-dimensional metric vector space .OMEGA..sub.N are set. In the second step, a function f.sub.m (X) indicating the m-th component of the mapping F is expressed with the linear sum of the functions g.sub.m (X) and L.sub.m coefficients c.sub.m. The third step provides Q teacher vectors T.sub.q =(t.sub.q.1, t.sub.q.2, t.sub.q.3, . . . , t.sub.q.M) (where q=1, 2, . . . , Q) for the categories on the M-dimensional metric vector space .OMEGA..sub.M, calculates the specified estimation function J, and obtains the coefficients c.sub.m which minimize the estimation function J. In the fourth step, the coefficients c.sub.m obtained in the third step are stored in memory.

摘要翻译： 用于从N维量度向量空间OMEGA N到M维度量向量空间OMEGA M获得映射F的映射确定方法具有以下步骤以快速和积极地获得最佳映射。在第一步中，根据在N维量度向量空间OMEGA N上分类为Q类别的样本分布，完成，定期，Lm基本函数gm（X）。在第二步骤中，表示映射F的第m个分量的函数fm（X）用函数gm（X）和Lm系数cm的线性和表示。第三步为M类别提供Q教师向量Tq =（tq.1，tq.2，tq.3，...，tq.M）（其中q = 1,2，...，Q）维度度量向量空间OMEGA M计算指定的估计函数J，并获得使估计函数J最小化的系数cm。在第四步骤中，将第三步骤中获得的系数cm存储在存储器中。

4.

发明授权
Navigation apparatus, navigation method and automotive vehicles 失效
标题翻译：导航仪，导航方式和汽车

公开(公告)号：US6064323A

公开(公告)日：2000-05-16

申请号：US728914

申请日：1996-10-11

申请人： Kazuo Ishii , Eiji Yamamoto , Miyuki Tanaka , Hiroshi Kakuda , Yasuharu Asano , Hiroaki Ogawa , Masanori Omote , Katsuki Minamino

发明人： Kazuo Ishii , Eiji Yamamoto , Miyuki Tanaka , Hiroshi Kakuda , Yasuharu Asano , Hiroaki Ogawa , Masanori Omote , Katsuki Minamino

IPC分类号： G09B29/10 , G01C21/00 , G01C21/36 , G08G1/0969 , G10L13/00 , G10L15/00 , G08G1/123

CPC分类号： G01C21/3608 , G08G1/0969

摘要： A navigation apparatus and navigation method for an automobile in which a map is visually displayed and a desired destination can be set by speaking the name of such destination. A voice recognition section recognizes the destination and marks it on the map that is being displayed and the best route to the displayed destination is then shown on the map to be followed by the driver of the automobile.

摘要翻译： 一种用于汽车的导航装置和导航方法，其中可以通过说出这样的目的地的名称来可视地显示地图并且可以设置期望的目的地。语音识别部分识别目的地并将其标记在正被显示的地图上，然后在地图上显示要显示的目的地的最佳路线以跟随汽车驾驶员。

5.

发明授权
Speech recognition apparatus, speech recognition method, and storage medium 失效
标题翻译：语音识别装置，语音识别方法和存储介质

公开(公告)号：US07013277B2

公开(公告)日：2006-03-14

申请号：US09794887

申请日：2001-02-26

申请人： Katsuki Minamino , Yasuharu Asano , Hiroaki Ogawa , Helmut Lucke

发明人： Katsuki Minamino , Yasuharu Asano , Hiroaki Ogawa , Helmut Lucke

IPC分类号： G10L15/00

CPC分类号： G10L15/193 , G10L2015/085

摘要： A preliminary word-selecting section selects one or more words following words which have been obtained in a word string serving as a candidate for a result of speech recognition; and a matching section calculates acoustic or linguistic scores for the selected words, and forms a word string serving as a candidate for a result of speech recognition according to the scores. A control section generates word-connection relationships between words in the word string serving as a candidate for a result of speech recognition, sends them to a word-connection-information storage section, and stores them in it. A re-evaluation section corrects the word-connection relationships stored in the word-connection-information storage section 16, and the control section determines a word string serving as the result of speech recognition according to the corrected word-connection relationships.

摘要翻译： 初步字选择部选择在用作语音识别结果的候选者的字串中获得的一个或多个单词，并且匹配部分计算所选择的单词的声学或语言得分，并且根据分数形成用作语音识别结果的候选的词串。控制部分生成用作语音识别结果候选的字串中的字之间的字连接关系，将它们发送到字连接信息存储部分，并将它们存储在其中。重新评估部分校正存储在字连接信息存储部分16中的字连接关系，并且控制部分根据校正的字连接关系确定用作语音识别结果的字串。

6.

发明授权
Information processing apparatus, information processing method, and program 有权
标题翻译：信息处理装置，信息处理方法和程序

公开(公告)号：US08566094B2

公开(公告)日：2013-10-22

申请号：US13206631

申请日：2011-08-10

申请人： Katsuki Minamino , Atsuo Hiroe , Yoshinori Maeda , Satoshi Asakawa

发明人： Katsuki Minamino , Atsuo Hiroe , Yoshinori Maeda , Satoshi Asakawa

IPC分类号： G10L15/00

CPC分类号： G10L15/32 , G10L15/187 , G10L15/19

摘要： An apparatus, method and program for performing a speech recognition process utilizing contextual information that comprises an estimation of the intention of an utterance of a user. The recognition process includes calculating a pre-score based on observed contextual information according intention models which correspond to a plurality of types of intention information and combining the pre-scoring results with acoustic and linguistic scores to obtain an improved recognition or comprehension of the intent of a user utterance.

摘要翻译： 一种用于使用包括对用户的话语的意图的估计的上下文信息执行语音识别处理的装置，方法和程序。识别过程包括基于观察到的情境信息来计算预分数，该意图模型对应于多种类型的意图信息，并将预评分结果与声学和语言得分相结合，以获得对目标的意图的改进的识别或理解用户说话。

7.

发明授权
Speech recognition with score calculation 有权
标题翻译：语音识别与分数计算

公开(公告)号：US07249017B2

公开(公告)日：2007-07-24

申请号：US10785246

申请日：2004-02-24

申请人： Helmut Lucke , Katsuki Minamino , Yasuharu Asano , Hiroaki Ogawa

发明人： Helmut Lucke , Katsuki Minamino , Yasuharu Asano , Hiroaki Ogawa

IPC分类号： G10L15/08 , G06F17/27

CPC分类号： G10L15/187 , G10L2015/025

摘要： In order to prevent degradation of speech recognition accuracy due to an unknown word, a dictionary database has stored therein a word dictionary in which are stored, in addition to words for the objects of speech recognition, suffixes, which are sound elements and a sound element sequence, which form the unknown word, for classifying the unknown word by the part of speech thereof. Based on such a word dictionary, a matching section connects the acoustic models of an sound model database, and calculates the score using the series of features output by a feature extraction section on the basis of the connected acoustic model. Then, the matching section selects a series of the words, which represents the speech recognition result, on the basis of the score.

摘要翻译： 为了防止由于未知词引起的语音识别精度的降低，字典数据库中存储有词语词典，除了用于语音识别的对象的词之外，还存储有作为声音元素和声音元素的后缀的词典序列，其形成未知单词，用于通过其部分语音对未知单词进行分类。基于这样的词典，匹配部分连接声音模型数据库的声学模型，并且使用基于连接的声学模型的特征提取部分输出的一系列特征来计算分数。然后，匹配部分基于分数来选择表示语音识别结果的一系列单词。

8.

发明授权
Voice recognition apparatus and method, and recording medium 失效
标题翻译：语音识别装置和方法以及记录介质

公开(公告)号：US06961701B2

公开(公告)日：2005-11-01

申请号：US09798521

申请日：2001-03-03

申请人： Hiroaki Ogawa , Katsuki Minamino , Yasuharu Asano , Helmut Lucke

发明人： Hiroaki Ogawa , Katsuki Minamino , Yasuharu Asano , Helmut Lucke

IPC分类号： G10L15/18 , G10L15/02 , G10L17/00 , G10L15/08

CPC分类号： G10L17/02 , G10L17/16 , G10L2015/025

摘要： An extended-word selecting section calculates a score for a phoneme string formed of one more phonemes, corresponding to a user's speech, and searches a large-vocabulary-dictionary for a word having one or more phonemes equal to or similar to those of a phoneme string having a score equal to or higher than a predetermined value. A matching section calculates scores for the word searched for by the extended-word selecting section in addition to a word preliminary word-selecting section. A control section determines a word string as the result of recognition of the speech uttered by the user.

摘要翻译： 扩展字选择部分计算由与用户的语音相对应的一个以上音素形成的音素串的分数，并且搜索具有等于或类似于音素的一个或多个音素的单词的大词汇词典具有等于或高于预定值的分数的字符串。匹配部分除了字初步字选择部分之外，还计算由扩展字选择部分搜索的字的分数。控制部分确定作为用户发出的语音的识别结果的字串。

9.

发明授权
Information access system and recording medium 失效
标题翻译：信息访问系统和记录介质

公开(公告)号：US6161093A

公开(公告)日：2000-12-12

申请号：US164316

申请日：1998-10-01

申请人： Masao Watari , Makoto Akabane , Tetsuya Kagami , Kazuo Ishii , Yusuke Iwahashi , Yasuhiko Kato , Hiroaki Ogawa , Masanori Omote , Kazuo Watanabe , Katsuki Minamino , Yasuharu Asano

发明人： Masao Watari , Makoto Akabane , Tetsuya Kagami , Kazuo Ishii , Yusuke Iwahashi , Yasuhiko Kato , Hiroaki Ogawa , Masanori Omote , Kazuo Watanabe , Katsuki Minamino , Yasuharu Asano

IPC分类号： G06F17/30 , G09B21/00 , G10L21/00

CPC分类号： G06F17/30017 , G09B21/001

摘要： A book database stores at least phonetic signal information including phoneme information and rhythm information as document data, a central system transmits phonetic signal information stored on the book database to a terminal and the terminal receives the phonetic signal information is then carried out at the terminal and the document is then recited via synthesized sounds.

摘要翻译： 图书数据库至少存储包括音素信息和节奏信息的语音信息作为文档数据，中央系统将存储在书籍数据库上的语音信号信息发送到终端，并且终端接收语音信号信息然后在终端执行，然后通过合成的声音叙述文件。

10.

发明授权
Voice recognition apparatus, voice recognition method, map displaying apparatus, map displaying method, navigation apparatus, navigation method and car 失效
标题翻译：语音识别装置，语音识别方法，地图显示装置，地图显示方法，导航装置，导航方法和汽车

公开(公告)号：US5956684A

公开(公告)日：1999-09-21

申请号：US728910

申请日：1996-10-11

申请人： Kazuo Ishii , Eiji Yamamoto , Miyuki Tanaka , Hiroshi Kakuda , Yasuharu Asano , Hiroaki Ogawa , Masanori Omote , Katsuki Minamino

发明人： Kazuo Ishii , Eiji Yamamoto , Miyuki Tanaka , Hiroshi Kakuda , Yasuharu Asano , Hiroaki Ogawa , Masanori Omote , Katsuki Minamino

IPC分类号： G09B29/10 , G01C21/00 , G01C21/36 , G08G1/0969 , G10L13/00 , G10L15/00 , G10L15/06 , G10L15/22 , G10L15/26 , G10L15/28 , G10L5/02

CPC分类号： G01C21/3608 , G08G1/0969 , G10L15/26

摘要： Voice processing for recognizing a predetermined voice such as a place name is performed by a voice processing section 14 from an audio signal inputted from a microphone 11 on the basis of an operation of a talk switch 18. When a map display is based on the recognized place name is performed, an incorrect reading and a place name commonly mistaken can be also recognized. Accordingly, a high grade operation of a navigation apparatus can be simply performed without obstructing an operator driving while a car.

摘要翻译： 语音处理部14根据通话开关18的操作从麦克风11输入的音频信号，进行用于识别诸如地名的预定语音的语音处理。当地图显示是基于识别的执行地名，错误阅读和通常误认的地名也可以被识别。因此，导航装置的高等级的操作可以简单地执行，而不会妨碍驾驶者驾驶汽车。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类