-
公开(公告)号:US20060116997A1
公开(公告)日:2006-06-01
申请号:US10998451
申请日:2004-11-29
申请人: Roger Peng Yu , Frank Torsten Seide
发明人: Roger Peng Yu , Frank Torsten Seide
IPC分类号: G06F17/30
CPC分类号: G10L15/04
摘要: A method of identifying a location of a query string in an audio signal is provided. Under the method, a segment of the audio signal is selected. A score for a query string in the segment of the audio signal is determined by determining the product of probabilities of overlapping sequences of tokens. The score is then used to decide if the segment of the audio signal is likely to contain the query string.
-
公开(公告)号:US07680853B2
公开(公告)日:2010-03-16
申请号:US11401048
申请日:2006-04-10
IPC分类号: G06F17/30
CPC分类号: G06F17/30038 , G06F17/30746 , G06F17/30796 , G10L15/26 , G10L25/48
摘要: Search results are provided in a format that allows users to efficiently determine whether audio or video documents identified from a search query actually contain the words in the query. This is achieved by returning snippets of text around query term matches and allowing the user to play a segment of the audio signal by selecting a word in the snippet. In other embodiments, markers are placed on a timeline that represents the duration of the audio signal. Each marker represents a query term match and when selected causes the audio signal to begin to play near the temporal location represented by the marker.
摘要翻译: 搜索结果以格式提供,允许用户有效地确定从搜索查询中识别的音频或视频文档是否实际包含查询中的单词。 这是通过在查询词匹配之外返回文本的片断来实现的,并且允许用户通过在片段中选择一个词来播放音频信号的片段。 在其他实施例中,标记被放置在表示音频信号的持续时间的时间线上。 每个标记表示查询词匹配,并且当被选择时,音频信号开始在由标记表示的时间位置附近播放。
-
公开(公告)号:US07584098B2
公开(公告)日:2009-09-01
申请号:US10998451
申请日:2004-11-29
申请人: Roger Peng Yu , Frank Torsten Seide
发明人: Roger Peng Yu , Frank Torsten Seide
IPC分类号: G10L15/00
CPC分类号: G10L15/04
摘要: A method of identifying a location of a query string in an audio signal is provided. Under the method, a segment of the audio signal is selected. A score for a query string in the segment of the audio signal is determined by determining the product of probabilities of overlapping sequences of tokens. The score is then used to decide if the segment of the audio signal is likely to contain the query string.
摘要翻译: 提供了一种识别音频信号中的查询字符串的位置的方法。 在该方法下,选择音频信号的一段。 通过确定令牌的重叠序列的概率的乘积来确定音频信号的段中的查询串的分数。 然后,该分数用于确定音频信号的片段是否可能包含查询字符串。
-
4.
公开(公告)号:US20120096007A1
公开(公告)日:2012-04-19
申请号:US13325261
申请日:2011-12-14
IPC分类号: G06F17/30
CPC分类号: G06F17/3002 , G06F17/2785 , G06F17/30781 , G06F17/30864 , G10L15/04 , G10L15/065 , G10L15/26 , G11B27/10 , G11B27/28 , H04N5/76 , H04N5/765 , H04N5/781 , H04N5/85 , H04N5/907 , H04N5/91 , H04N9/8205
摘要: Content-based analysis is performed on multimedia content prior to encoding the multimedia content in the rendering chain of processing. A content-based index stream is generated based on the content-based analysis and the content-based index stream is embedded in the multimedia file during rendering. The content-based index stream can be used to generate a content-based searchable index when necessary.
摘要翻译: 在对处理的呈现链中的多媒体内容进行编码之前,对多媒体内容执行基于内容的分析。 基于内容的分析生成基于内容的索引流,并且在呈现期间将基于内容的索引流嵌入到多媒体文件中。 当需要时,基于内容的索引流可用于生成基于内容的可搜索索引。
-
公开(公告)号:US20070143110A1
公开(公告)日:2007-06-21
申请号:US11300735
申请日:2005-12-15
申请人: Alejandro Acero , Asela Gunawardana , Ciprian Chelba , Erik Selberg , Frank Torsten Seide , Patrick Nguyen , Roger Yu
发明人: Alejandro Acero , Asela Gunawardana , Ciprian Chelba , Erik Selberg , Frank Torsten Seide , Patrick Nguyen , Roger Yu
IPC分类号: G10L15/04
摘要: A computer-implemented method of indexing a speech lattice for search of audio corresponding to the speech lattice is provided. The method includes identifying at least two speech recognition hypotheses for a word which have time ranges satisfying a criteria. The method further includes merging the at least two speech recognition hypotheses to generate a merged speech recognition hypothesis for the word.
摘要翻译: 提供了一种用于索引用于搜索与语音格子相对应的音频的语音格子的计算机实现的方法。 该方法包括识别具有满足标准的时间范围的单词的至少两个语音识别假设。 该方法还包括合并至少两个语音识别假设以产生该单词的合并语音识别假设。
-
-
-
-