专利检索 ap:("Makoto Hirota" OR "Toshiaki Fukada" OR "Yasuhiro Komori") AND inv:"Yasuhiro Komori" 第 1 页

1.

发明授权
Information processing device and information processing method 有权
标题翻译：信息处理装置及信息处理方法

公开(公告)号：US07451090B2

公开(公告)日：2008-11-11

申请号：US11135083

申请日：2005-05-23

申请人： Kenichiro Nakagawa , Makoto Hirota , Hiromi Ikeda , Tsuyoshi Yagisawa , Hiroki Yamamoto , Toshiaki Fukada , Yasuhiro Komori

发明人： Kenichiro Nakagawa , Makoto Hirota , Hiromi Ikeda , Tsuyoshi Yagisawa , Hiroki Yamamoto , Toshiaki Fukada , Yasuhiro Komori

IPC分类号： G10L21/00

CPC分类号： G06F17/30265

摘要： In a system implementing image retrieval by performing speech recognition on voice information added to an image, the speech recognition is triggered by an event, such as an image upload event, that is not an explicit speech-recognition order event. The system obtains voice information added to an image, detects an event, and performs speech recognition on the obtained voice information in response to a specific event, even if the detected event is not an explicit speech-recognition order event.

摘要翻译： 在通过对添加到图像的语音信息执行语音识别来实现图像检索的系统中，语音识别由诸如图像上传事件的事件触发，该事件不是明确的语音识别顺序事件。即使检测到的事件不是明确的语音识别顺序事件，系统获得添加到图像的语音信息，检测事件，并且响应于特定事件对所获得的语音信息执行语音识别。

2.

发明申请
INFORMATION PROCESSING METHOD AND INFORMATION PROCESSING DEVICE 失效
标题翻译：信息处理方法和信息处理设备

公开(公告)号：US20070046645A1

公开(公告)日：2007-03-01

申请号：US11462670

申请日：2006-08-04

申请人： Makoto Hirota , Toshiaki Fukada , Yasuhiro Komori

发明人： Makoto Hirota , Toshiaki Fukada , Yasuhiro Komori

IPC分类号： G09G5/00

CPC分类号： G10L15/24 , G06K9/6293 , G10L2015/025

摘要： In an information processing method for recognizing a handwritten figure or character, with use of a speech input in combination, in order to increase the recognition accuracy a given target is subjected to figure recognition and a first candidate figure list is obtained. Input speech information is phonetically recognized and a second candidate figure list is obtained. On the basis of the figure candidates obtained by the figure recognition and the figure candidates obtained by the speech recognition, a most likely figure is selected.

摘要翻译： 在用于识别手写图形或字符的信息处理方法中，通过组合使用语音输入，为了增加识别精度，给定目标进行图形识别，并获得第一候选图表。语音识别输入语音信息，获得第二候选图表。基于通过图形识别获得的图形候选者和通过语音识别获得的图形候选者，选择最可能的图形。

3.

发明授权
Information processing method and information processing device 失效
标题翻译：信息处理方法和信息处理装置

公开(公告)号：US07706615B2

公开(公告)日：2010-04-27

申请号：US11462670

申请日：2006-08-04

申请人： Makoto Hirota , Toshiaki Fukada , Yasuhiro Komori

发明人： Makoto Hirota , Toshiaki Fukada , Yasuhiro Komori

IPC分类号： G06K9/00 , G10L15/00

CPC分类号： G10L15/24 , G06K9/6293 , G10L2015/025

摘要： In an information processing method for recognizing a handwritten figure or character, with use of a speech input in combination, in order to increase the recognition accuracy a given target is subjected to figure recognition and a first candidate figure list is obtained. Input speech information is phonetically recognized and a second candidate figure list is obtained. On the basis of the figure candidates obtained by the figure recognition and the figure candidates obtained by the speech recognition, a most likely figure is selected.

摘要翻译： 在用于识别手写图形或字符的信息处理方法中，通过组合使用语音输入，为了增加识别精度，给定目标进行图形识别，并获得第一候选图表。语音识别输入语音信息，获得第二候选图表。基于通过图形识别获得的图形候选者和通过语音识别获得的图形候选者，选择最可能的图形。

4.

发明申请
Information processing device and information processing method 有权
标题翻译：信息处理装置及信息处理方法

公开(公告)号：US20050267747A1

公开(公告)日：2005-12-01

申请号：US11135083

申请日：2005-05-23

申请人： Kenichiro Nakagawa , Makoto Hirota , Hiromi Ikeda , Tsuyoshi Yagisawa , Hiroki Yamamoto , Toshiaki Fukada , Yasuhiro Komori

发明人： Kenichiro Nakagawa , Makoto Hirota , Hiromi Ikeda , Tsuyoshi Yagisawa , Hiroki Yamamoto , Toshiaki Fukada , Yasuhiro Komori

IPC分类号： G06F17/30 , G10L15/00 , G10L15/28

CPC分类号： G06F17/30265

摘要： In a system implementing image retrieval by performing speech recognition on voice information added to an image, the speech recognition is triggered by an event, such as an image upload event, that is not an explicit speech-recognition order event. The system obtains voice information added to an image, detects an event, and performs speech recognition on the obtained voice information in response to a specific event, even if the detected event is not an explicit speech-recognition order event.

摘要翻译： 在通过对添加到图像的语音信息执行语音识别来实现图像检索的系统中，语音识别由诸如图像上传事件的事件触发，该事件不是明确的语音识别顺序事件。即使检测到的事件不是明确的语音识别顺序事件，系统获得添加到图像的语音信息，检测事件，并且响应于特定事件对所获得的语音信息执行语音识别。

5.

发明授权
Speech recognition method and apparatus 失效
标题翻译：语音识别方法和装置

公开(公告)号：US07565290B2

公开(公告)日：2009-07-21

申请号：US11165167

申请日：2005-06-24

申请人： Hideo Kuboyama , Toshiaki Fukada , Yasuhiro Komori

发明人： Hideo Kuboyama , Toshiaki Fukada , Yasuhiro Komori

IPC分类号： G10L15/00

CPC分类号： G10L15/142

摘要： A speech recognition apparatus includes a word dictionary having recognition target words, a first acoustic model which expresses a reference pattern of a speech unit by one or more states, a second acoustic model which is lower in precision than said first acoustic model, selection means for selecting one of said first acoustic model and said second acoustic model on the basis of a parameter associated with a state of interest, and likelihood calculation means for calculating a likelihood of an acoustic feature parameter with respect to said acoustic model selected by said selection means.

摘要翻译： 语音识别装置包括具有识别目标字的单词字典，通过一个或多个状态表示语音单元的参考图形的第一声学模型，精度低于所述第一声学模型的第二声学模型;选择装置，基于与感兴趣的状态相关联的参数来选择所述第一声学模型和所述第二声学模型之一;以及似然计算装置，用于计算相对于由所述选择装置选择的所述声学模型的声学特征参数的似然性。

6.

发明申请
Segment set creating method and apparatus 失效

公开(公告)号：US20060069566A1

公开(公告)日：2006-03-30

申请号：US11225178

申请日：2005-09-14

申请人： Toshiaki Fukada , Masayuki Yamada , Yasuhiro Komori

发明人： Toshiaki Fukada , Masayuki Yamada , Yasuhiro Komori

IPC分类号： G10L13/08

CPC分类号： G10L13/06

摘要： A segment set before updating is read, and clustering considering a phoneme environment is performed to it. For each cluster obtained by the clustering, a representative segment of a segment set belonging to the cluster is generated. For each cluster, a segment belonging to the cluster is replaced with the representative segment so as to update the segment set.

7.

发明申请
Speech recognition method and apparatus 失效
标题翻译：语音识别方法和装置

公开(公告)号：US20050288929A1

公开(公告)日：2005-12-29

申请号：US11165167

申请日：2005-06-24

申请人： Hideo Kuboyama , Toshiaki Fukada , Yasuhiro Komori

发明人： Hideo Kuboyama , Toshiaki Fukada , Yasuhiro Komori

IPC分类号： G10L15/04 , G10L15/14

CPC分类号： G10L15/142

摘要： A speech recognition apparatus includes a word dictionary having recognition target words, a first acoustic model which expresses a reference pattern of a speech unit by one or more states, a second acoustic model which is lower in precision than said first acoustic model, selection means for selecting one of said first acoustic model and said second acoustic model on the basis of a parameter associated with a state of interest, and likelihood calculation means for calculating a likelihood of an acoustic feature parameter with respect to said acoustic model selected by said selection means.

摘要翻译： 语音识别装置包括具有识别目标字的单词字典，通过一个或多个状态表示语音单元的参考图形的第一声学模型，精度低于所述第一声学模型的第二声学模型;选择装置，基于与感兴趣的状态相关联的参数来选择所述第一声学模型和所述第二声学模型之一;以及似然计算装置，用于计算相对于由所述选择装置选择的所述声学模型的声学特征参数的似然性。

8.

发明授权
Speech synthesis method and apparatus, and dictionary generation method and apparatus 失效
标题翻译：语音合成方法和装置，以及字典生成方法和装置

公开(公告)号：US07546241B2

公开(公告)日：2009-06-09

申请号：US10449072

申请日：2003-06-02

申请人： Masayuki Yamada , Yasuhiro Komori , Toshiaki Fukada

发明人： Masayuki Yamada , Yasuhiro Komori , Toshiaki Fukada

IPC分类号： G10L13/08

CPC分类号： G10L13/06 , G10L13/04

摘要： In a speech synthesis process, micro-segments are cut from acquired waveform data and a window function. The obtained micro-segments are re-arranged to implement a desired prosody, and superposed data is generated by superposing the re-arranged micro-segments, so as to obtain synthetic speech waveform data. A spectrum correction filter is formed based on the acquired waveform data. At least one of the waveform data, micro-segments, and superposed data is corrected using the spectrum correction filter. In this way, “blur” of a speech spectrum due to the window function applied to obtain micro-segments is reduced, and speech synthesis with high sound quality is realized.

摘要翻译： 在语音合成过程中，从获取的波形数据和窗口函数中切割微片段。所获得的微片段被重新布置以实现期望的韵律，并且通过叠加重新布置的微片段来产生叠加数据，以便获得合成语音波形数据。基于获取的波形数据形成频谱校正滤波器。使用光谱校正滤波器来校正波形数据，微片段和叠加数据中的至少一个。以这种方式，由于应用于获得微片段的窗口函数而导致的语音频谱的“模糊”减少，并且实现了高音质的语音合成。

9.

发明申请
Apparatus and method for detecting signal 失效
标题翻译：用于检测信号的装置和方法

公开(公告)号：US20050131689A1

公开(公告)日：2005-06-16

申请号：US11007245

申请日：2004-12-09

申请人： Philip Garner , Toshiaki Fukada , Yasuhiro Komori

发明人： Philip Garner , Toshiaki Fukada , Yasuhiro Komori

IPC分类号： G10L11/02 , G10L15/04 , G10L15/20 , G10L21/02 , G10L15/12

CPC分类号： G10L25/78

摘要： Robust signal detection against various types of background noise is implemented. According to a signal detection apparatus and method of this invention, the feature amount of an input signal sequence and the feature amount of a noise component contained in the signal sequence are extracted. After that, the first likelihood indicating probability that the signal sequence is detected and the second likelihood indicating probability that the noise component is detected are calculated on the basis of a predetermined signal-to-noise ratio and the extracted feature amount of the signal sequence. Additionally, a likelihood ratio indicating the ratio between the first likelihood and the second likelihood is calculated. Detection of the signal sequence is determined on the basis of the likelihood ratio.

摘要翻译： 实现对各种背景噪声的鲁棒信号检测。根据本发明的信号检测装置和方法，提取输入信号序列的特征量和包含在信号序列中的噪声成分的特征量。之后，基于预定的信噪比和所提取的信号序列的特征量来计算检测到信号序列的概率的第一似然度和指示检测到噪声分量的概率的第二似然。另外，计算表示第一似然率与第二似然率之比的似然比。基于似然比确定信号序列的检测。

10.

发明授权
Signal processing apparatus and method 失效
标题翻译：信号处理装置及方法

公开(公告)号：US07756707B2

公开(公告)日：2010-07-13

申请号：US11082931

申请日：2005-03-18

申请人： Philip Garner , Toshiaki Fukada , Yasuhiro Komori

发明人： Philip Garner , Toshiaki Fukada , Yasuhiro Komori

IPC分类号： G10L15/20

CPC分类号： G10L25/87

摘要： A signal processing apparatus and method for performing a robust endpoint detection of a signal are provided. An input signal sequence is divided into frames each of which has a predetermined time length. The presence of the signal in the frame is detected. After that, the filter process of smoothing the detection result by using the detection result for a past frame is applied to the detection result for a current frame. The filter output is compared with a predetermined threshold value to determine the state of the signal sequence of the current frame on the basis of the comparison result.

摘要翻译： 提供了一种用于执行信号的鲁棒端点检测的信号处理装置和方法。输入信号序列被分成具有预定时间长度的帧。检测到帧中存在信号。之后，通过使用过去帧的检测结果对检测结果进行平滑的滤波处理应用于当前帧的检测结果。将滤波器输出与预定阈值进行比较，以基于比较结果确定当前帧的信号序列的状态。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类