-
公开(公告)号:US20080270110A1
公开(公告)日:2008-10-30
申请号:US11742150
申请日:2007-04-30
IPC分类号: G06F17/28
CPC分类号: G10L15/06 , G06F16/433 , G06F16/434 , G06F16/4393 , G06F16/48 , G06F16/61 , G06F16/685 , G10L2015/228
摘要: A method of recognizing speech includes extracting textual content from a visual content time segment associated with a rich media presentation. A textual content input comprising a word from the extracted textual content is created. The textual content input is provided to an automatic speech recognition algorithm such that there is an increased probability that the automatic speech recognition algorithm recognizes the word within an audio content time segment associated with the rich media presentation.
摘要翻译: 识别语音的方法包括从与富媒体呈现相关联的视觉内容时间段中提取文本内容。 创建包括来自提取的文本内容的单词的文本内容输入。 将文本内容输入提供给自动语音识别算法,使得自动语音识别算法识别与富媒体呈现相关联的音频内容时间段内的单词的概率增加。
-
公开(公告)号:US20080270344A1
公开(公告)日:2008-10-30
申请号:US11742125
申请日:2007-04-30
CPC分类号: G06F16/433 , G06F16/434 , G06F16/4393 , G06F16/48 , G06F16/61 , G06F16/685 , G06F16/78 , G06F16/7837 , G06F16/7844
摘要: A method of generating a set of search results. An audio content search results set including an individual audio content search result corresponding to a rich media time segment is generated. A visual content search results set including an individual visual content search result corresponding to the rich media time segment is also generated. A relevance of the rich media time segment is determined based at least in part on an individual search result count. The individual search result count is a sum of a number of individual audio content search results corresponding to the rich media time segment and a number of individual visual content search results corresponding to the rich media time segment. The rich media time segment is included in an ordered set of search results, wherein an order of the rich media time segment is based at least in part on the determined relevance.
摘要翻译: 一种生成一组搜索结果的方法。 生成包括与富媒体时间段对应的单独音频内容搜索结果的音频内容搜索结果集。 还生成包括与富媒体时间段对应的单独视觉内容搜索结果的视觉内容搜索结果集。 至少部分地基于单个搜索结果计数确定富媒体时间段的相关性。 个别搜索结果计数是与富媒体时间段对应的各个音频内容搜索结果的数量和对应于富媒体时间段的个体视觉内容搜索结果的数量的和。 富媒体时段被包括在有序的一组搜索结果中,其中富媒体时间段的顺序至少部分地基于所确定的相关性。
-
公开(公告)号:US07983915B2
公开(公告)日:2011-07-19
申请号:US11742137
申请日:2007-04-30
CPC分类号: G06F17/30056 , G06F17/30026 , G06F17/30038 , G06F17/30047 , G06F17/30746 , G06F17/30778 , G10L15/26 , G10L2015/025
摘要: A method of generating an audio content index for use by a search engine includes determining a phoneme sequence based on recognized speech from an audio content time segment. The method also includes identifying k-phonemes which occur within the phoneme sequence. The identified k-phonemes are stored within a data structure such that the identified k-phonemes are capable of being compared with k-phonemes from a search query.
摘要翻译: 生成用于搜索引擎的音频内容索引的方法包括基于来自音频内容时间段的识别语音来确定音素序列。 该方法还包括识别发生在音素序列内的k-音素。 所识别的k-音素被存储在数据结构内,使得所识别的k-音素能够与来自搜索查询的k-音素进行比较。
-
公开(公告)号:US20080270138A1
公开(公告)日:2008-10-30
申请号:US11742137
申请日:2007-04-30
IPC分类号: G10L13/00
CPC分类号: G06F17/30056 , G06F17/30026 , G06F17/30038 , G06F17/30047 , G06F17/30746 , G06F17/30778 , G10L15/26 , G10L2015/025
摘要: A method of generating an audio content index for use by a search engine includes determining a phoneme sequence based on recognized speech from an audio content time segment. The method also includes identifying k-phonemes which occur within the phoneme sequence. The identified k-phonemes are stored within a data structure such that the identified k-phonemes are capable of being compared with k-phonemes from a search query.
摘要翻译: 生成用于搜索引擎的音频内容索引的方法包括基于来自音频内容时间段的识别语音来确定音素序列。 该方法还包括识别发生在音素序列内的k-音素。 所识别的k-音素被存储在数据结构内,使得所识别的k-音素能够与来自搜索查询的k-音素进行比较。
-
-
-