-
公开(公告)号:US07451090B2
公开(公告)日:2008-11-11
申请号:US11135083
申请日:2005-05-23
申请人: Kenichiro Nakagawa , Makoto Hirota , Hiromi Ikeda , Tsuyoshi Yagisawa , Hiroki Yamamoto , Toshiaki Fukada , Yasuhiro Komori
发明人: Kenichiro Nakagawa , Makoto Hirota , Hiromi Ikeda , Tsuyoshi Yagisawa , Hiroki Yamamoto , Toshiaki Fukada , Yasuhiro Komori
IPC分类号: G10L21/00
CPC分类号: G06F17/30265
摘要: In a system implementing image retrieval by performing speech recognition on voice information added to an image, the speech recognition is triggered by an event, such as an image upload event, that is not an explicit speech-recognition order event. The system obtains voice information added to an image, detects an event, and performs speech recognition on the obtained voice information in response to a specific event, even if the detected event is not an explicit speech-recognition order event.
摘要翻译: 在通过对添加到图像的语音信息执行语音识别来实现图像检索的系统中,语音识别由诸如图像上传事件的事件触发,该事件不是明确的语音识别顺序事件。 即使检测到的事件不是明确的语音识别顺序事件,系统获得添加到图像的语音信息,检测事件,并且响应于特定事件对所获得的语音信息执行语音识别。
-
公开(公告)号:US20050267747A1
公开(公告)日:2005-12-01
申请号:US11135083
申请日:2005-05-23
申请人: Kenichiro Nakagawa , Makoto Hirota , Hiromi Ikeda , Tsuyoshi Yagisawa , Hiroki Yamamoto , Toshiaki Fukada , Yasuhiro Komori
发明人: Kenichiro Nakagawa , Makoto Hirota , Hiromi Ikeda , Tsuyoshi Yagisawa , Hiroki Yamamoto , Toshiaki Fukada , Yasuhiro Komori
CPC分类号: G06F17/30265
摘要: In a system implementing image retrieval by performing speech recognition on voice information added to an image, the speech recognition is triggered by an event, such as an image upload event, that is not an explicit speech-recognition order event. The system obtains voice information added to an image, detects an event, and performs speech recognition on the obtained voice information in response to a specific event, even if the detected event is not an explicit speech-recognition order event.
摘要翻译: 在通过对添加到图像的语音信息执行语音识别来实现图像检索的系统中,语音识别由诸如图像上传事件的事件触发,该事件不是明确的语音识别顺序事件。 即使检测到的事件不是明确的语音识别顺序事件,系统获得添加到图像的语音信息,检测事件,并且响应于特定事件对所获得的语音信息执行语音识别。
-
公开(公告)号:US20080040108A1
公开(公告)日:2008-02-14
申请号:US11834369
申请日:2007-08-06
IPC分类号: G10L15/00
CPC分类号: G10L15/20 , G10L15/065
摘要: In order to implement proper sensitivity setting with respect to a connected speech input device, this invention includes a connector for detachable connection of a speech input device, a detection unit which detects that the speech input device has connected to the connector, and a setting unit which sets a set value for adjusting a parameter of a speech signal input from the speech input device through the connector in accordance with detection of connection of the speech input device by the detection unit.
摘要翻译: 为了对连接的语音输入装置实施适当的灵敏度设置,本发明包括用于语音输入装置的可拆卸连接的连接器,检测到语音输入装置已经连接到连接器的检测单元和设置单元 其根据检测单元检测到语音输入装置的连接,设定通过连接器调整从语音输入装置输入的语音信号的参数的设定值。
-
公开(公告)号:US20090109297A1
公开(公告)日:2009-04-30
申请号:US12257798
申请日:2008-10-24
CPC分类号: H04N5/232 , G10L15/26 , G11B27/10 , H04N5/23254 , H04N5/765 , H04N5/772 , H04N5/775 , H04N9/8047 , H04N9/8063 , H04N9/8205
摘要: An image capturing apparatus of this invention includes an audio acquisition unit which acquires audio data, an speech processing unit which analyzes the acquired audio data and detects predetermined audio data, an image capturing unit which captures image data by activating a shutter when the speech processing unit detects the predetermined audio data, and a storage unit which stores the audio data acquired by the audio acquisition unit before the shutter is activated in association with image data captured upon activating the shutter.
摘要翻译: 本发明的摄像装置包括:音频获取单元,其获取音频数据,语音处理单元,其分析获取的音频数据并检测预定的音频数据;图像捕获单元,其在语音处理单元处激活快门时捕获图像数据; 检测预定音频数据;以及存储单元,其存储在快门被激活之前由音频获取单元获取的音频数据与在激活快门时捕获的图像数据相关联。
-
公开(公告)号:US08126720B2
公开(公告)日:2012-02-28
申请号:US12257798
申请日:2008-10-24
IPC分类号: G10L21/00
CPC分类号: H04N5/232 , G10L15/26 , G11B27/10 , H04N5/23254 , H04N5/765 , H04N5/772 , H04N5/775 , H04N9/8047 , H04N9/8063 , H04N9/8205
摘要: An image capturing apparatus of this invention includes an audio acquisition unit which acquires audio data, an speech processing unit which analyzes the acquired audio data and detects predetermined audio data, an image capturing unit which captures image data by activating a shutter when the speech processing unit detects the predetermined audio data, and a storage unit which stores the audio data acquired by the audio acquisition unit before the shutter is activated in association with image data captured upon activating the shutter.
摘要翻译: 本发明的摄像装置包括:音频获取单元,其获取音频数据,语音处理单元,其分析获取的音频数据并检测预定的音频数据;图像捕获单元,其在语音处理单元处激活快门时捕获图像数据; 检测预定音频数据;以及存储单元,其存储在快门被激活之前由音频获取单元获取的音频数据与在激活快门时捕获的图像数据相关联。
-
公开(公告)号:US07376332B2
公开(公告)日:2008-05-20
申请号:US10964227
申请日:2004-10-12
申请人: Hiromi Ikeda , Tsuyoshi Yagisawa , Toshiaki Fukada
发明人: Hiromi Ikeda , Tsuyoshi Yagisawa , Toshiaki Fukada
CPC分类号: G06F17/30265
摘要: In an information processing apparatus or method for presenting multimedia data, a storage unit holds an object in an image, such as an image, characters, or symbols, and sound data associated with the object. Metadata of the object is referred to, and an output parameter of the sound data associated with the object is determined based on the metadata. Then, a sound output unit outputs the sound data at a sound volume or the like based on the output parameter.
摘要翻译: 在用于呈现多媒体数据的信息处理装置或方法中,存储单元将对象保存在与对象相关联的图像,诸如图像,字符或符号以及声音数据的图像中。 参考对象的元数据,并且基于元数据确定与对象相关联的声音数据的输出参数。 然后,声音输出单元基于输出参数输出音量等的声音数据。
-
公开(公告)号:US20050097439A1
公开(公告)日:2005-05-05
申请号:US10964227
申请日:2004-10-12
申请人: Hiromi Ikeda , Tsuyoshi Yagisawa , Toshiaki Fukada
发明人: Hiromi Ikeda , Tsuyoshi Yagisawa , Toshiaki Fukada
CPC分类号: G06F17/30265
摘要: In an information processing apparatus or method for presenting multimedia data, a storage unit holds an object in an image, such as an image, characters, or symbols, and sound data associated with the object. Metadata of the object is referred to, and an output parameter of the sound data associated with the object is determined based on the metadata. Then, a sound output unit outputs the sound data at a sound volume or the like based on the output parameter.
摘要翻译: 在用于呈现多媒体数据的信息处理装置或方法中,存储单元将对象保存在与对象相关联的图像,诸如图像,字符或符号以及声音数据的图像中。 参考对象的元数据,并且基于元数据确定与对象相关联的声音数据的输出参数。 然后,声音输出单元基于输出参数输出音量等的声音数据。
-
公开(公告)号:US20070046645A1
公开(公告)日:2007-03-01
申请号:US11462670
申请日:2006-08-04
申请人: Makoto Hirota , Toshiaki Fukada , Yasuhiro Komori
发明人: Makoto Hirota , Toshiaki Fukada , Yasuhiro Komori
IPC分类号: G09G5/00
CPC分类号: G10L15/24 , G06K9/6293 , G10L2015/025
摘要: In an information processing method for recognizing a handwritten figure or character, with use of a speech input in combination, in order to increase the recognition accuracy a given target is subjected to figure recognition and a first candidate figure list is obtained. Input speech information is phonetically recognized and a second candidate figure list is obtained. On the basis of the figure candidates obtained by the figure recognition and the figure candidates obtained by the speech recognition, a most likely figure is selected.
摘要翻译: 在用于识别手写图形或字符的信息处理方法中,通过组合使用语音输入,为了增加识别精度,给定目标进行图形识别,并获得第一候选图表。 语音识别输入语音信息,获得第二候选图表。 基于通过图形识别获得的图形候选者和通过语音识别获得的图形候选者,选择最可能的图形。
-
公开(公告)号:US07706615B2
公开(公告)日:2010-04-27
申请号:US11462670
申请日:2006-08-04
申请人: Makoto Hirota , Toshiaki Fukada , Yasuhiro Komori
发明人: Makoto Hirota , Toshiaki Fukada , Yasuhiro Komori
CPC分类号: G10L15/24 , G06K9/6293 , G10L2015/025
摘要: In an information processing method for recognizing a handwritten figure or character, with use of a speech input in combination, in order to increase the recognition accuracy a given target is subjected to figure recognition and a first candidate figure list is obtained. Input speech information is phonetically recognized and a second candidate figure list is obtained. On the basis of the figure candidates obtained by the figure recognition and the figure candidates obtained by the speech recognition, a most likely figure is selected.
摘要翻译: 在用于识别手写图形或字符的信息处理方法中,通过组合使用语音输入,为了增加识别精度,给定目标进行图形识别,并获得第一候选图表。 语音识别输入语音信息,获得第二候选图表。 基于通过图形识别获得的图形候选者和通过语音识别获得的图形候选者,选择最可能的图形。
-
公开(公告)号:US07480073B2
公开(公告)日:2009-01-20
申请号:US10705859
申请日:2003-11-13
申请人: Hiromi Ikeda , Toshiaki Fukada , Makoto Hirota
发明人: Hiromi Ikeda , Toshiaki Fukada , Makoto Hirota
IPC分类号: H04N1/40
CPC分类号: H04N1/32117 , H04N1/00811 , H04N1/00822 , H04N2201/3216 , H04N2201/3222 , H04N2201/3226 , H04N2201/3232 , H04N2201/3276 , H04N2201/3277
摘要: Disclosed is a method that makes it easier to configure an image processing apparatus capable of reading an image and processing the image in accordance with settings information, the method including an identification step of identifying whether the read image is a document that carries settings information; and a setting step of setting the settings information, which is carried on the read image, if the read image has been identified as being a document carrying the settings information.
摘要翻译: 公开了一种使得能够根据设置信息配置能够读取图像并处理图像的图像处理装置的方法,该方法包括识别步骤,识别读取的图像是否是携带设置信息的文档; 以及设置步骤,如果读取的图像已被识别为携带设置信息的文档,则设置在读取的图像上承载的设置信息。
-
-
-
-
-
-
-
-
-