-
公开(公告)号:US20090187564A1
公开(公告)日:2009-07-23
申请号:US12414570
申请日:2009-03-30
CPC分类号: G06F17/30666 , G06F17/30622 , Y10S707/99933
摘要: Phrases in a corpus of documents including stopwords are found using a data processor arranged to execute phrase queries. Memory stores an index structure which maps entries in the index structure to documents in the corpus. Entries in the index structure represent words and other entries represent stopwords found in the corpus coalesced with prefixes of respective adjacent words adjacent to the stopwords. The prefixes comprise one or more leading characters of the respective adjacent words. A query processor forms a modified query by substituting a stopword with a search token representing the stopword coalesced with a prefix of the next word in the query. The processor executes the modified query. Also, index structures including coalesced stopwords are created and maintained.
摘要翻译: 使用安排执行短语查询的数据处理器可以找到包含词性的文档语料库中的短语。 内存存储将索引结构中的条目映射到语料库中的文档的索引结构。 索引结构中的条目表示单词,并且其他条目表示在语料库中找到的与词语相邻的各个相邻单词的前缀合并的词条。 前缀包括各个相邻单词的一个或多个主要字符。 查询处理器通过用表示与查询中的下一个单词的前缀合并的停止词的搜索标记来代替停止词来形成修改的查询。 处理器执行修改后的查询。 此外,创建和维护包括合并的停用词的索引结构。
-
公开(公告)号:US20070027853A1
公开(公告)日:2007-02-01
申请号:US11391889
申请日:2006-03-29
IPC分类号: G06F17/30
CPC分类号: G06F17/30666 , G06F17/30622 , Y10S707/99933
摘要: Phrases in a corpus of documents including stopwords are found using a data processor arranged to execute phrase queries. Memory stores an index structure which maps entries in the index structure to documents in the corpus. Entries in the index structure represent words and other entries represent stopwords found in the corpus coalesced with prefixes of respective adjacent words adjacent to the stopwords. The prefixes comprise one or more leading characters of the respective adjacent words. A query processor forms a modified query by substituting a stopword with a search token representing the stopword coalesced with a prefix of the next word in the query. The processor executes the modified query. Also, index structures including coalesced stopwords are created and maintained.
摘要翻译: 使用安排执行短语查询的数据处理器可以找到包含词性的文档语料库中的短语。 内存存储将索引结构中的条目映射到语料库中的文档的索引结构。 索引结构中的条目表示单词,并且其他条目表示在语料库中找到的与词语相邻的各个相邻单词的前缀合并的词条。 前缀包括各个相邻单词的一个或多个主要字符。 查询处理器通过用表示与查询中的下一个单词的前缀合并的停止词的搜索标记来代替停止词来形成修改的查询。 处理器执行修改后的查询。 此外,创建和维护包括合并的停用词的索引结构。
-
公开(公告)号:US08560683B2
公开(公告)日:2013-10-15
申请号:US13158313
申请日:2011-06-10
IPC分类号: G06F15/16 , G06F15/173
CPC分类号: H04N21/254 , H04L67/12 , H04N21/2407 , H04N21/44222 , H04N21/6582 , H04N21/812
摘要: Analytics describing video data published to one or more destination sites are calculated. Metrics describing performance of the video data, such as performance in different geographical areas, in different demographics and in different devices are calculated. An interface simplifies calculation of the video metrics to simplify analysis by allowing a user to identify different videos or sets of videos for analysis. Additionally, interaction with one or more web pages including the video data is also captured and combined with video data performance metrics. Integrating web page interaction data and video performance metrics provide a user with a more accurate description of how visitors interact with content presented using the one or more web pages.
摘要翻译: 计算描述发布到一个或多个目标站点的视频数据的分析。 计算描述视频数据的性能的度量,例如在不同地理区域,不同人口统计学和不同设备中的性能。 界面简化了视频度量的计算,以简化分析,允许用户识别不同的视频或视频集用于分析。 此外,还捕获与包括视频数据的一个或多个网页的交互并与视频数据性能度量相结合。 集成网页交互数据和视频性能指标为用户提供了更准确的描述,了解访客如何与使用一个或多个网页呈现的内容进行互动。
-
公开(公告)号:US20090193005A1
公开(公告)日:2009-07-30
申请号:US12414581
申请日:2009-03-30
IPC分类号: G06F17/30
CPC分类号: G06F17/30666 , G06F17/30622 , Y10S707/99933
摘要: Words having selected characteristics in a corpus of documents are found using a data processor arranged to execute queries. Memory stores an index structure in which entries in the index structure map words and marks for words having the selected characteristics to locations within documents in the corpus. Entries in the index structure represent words and other entries represent marks with the location information of a marked word. The entries for the marks can be tokens coalesced with prefixes of respective marked words or adjacent. A query processor forms a modified query by adding a mark for a word to the query. The processor executes the modified query.
摘要翻译: 使用被布置为执行查询的数据处理器来找到在文档语料库中具有选择特征的词。 存储器存储索引结构,其中索引结构中的条目将具有所选特征的单词和标记映射到语料库中的文档内的位置。 索引结构中的条目表示单词,其他条目表示具有标记词的位置信息的标记。 标记的条目可以是标记与各个标记的词的前缀或相邻的令牌。 查询处理器通过向查询添加单词的标记来形成修改的查询。 处理器执行修改后的查询。
-
公开(公告)号:US20070027854A1
公开(公告)日:2007-02-01
申请号:US11391890
申请日:2006-03-29
申请人: Ramana Rao , Swapnil Hajela , Nareshkumar Rajkumar
发明人: Ramana Rao , Swapnil Hajela , Nareshkumar Rajkumar
IPC分类号: G06F17/30
CPC分类号: G06F17/3066 , G06F17/30622 , Y10S707/99933
摘要: Words having selected characteristics in a corpus of documents are found using a data processor arranged to execute queries. Memory stores an index structure in which entries in the index structure map words and marks for words having the selected characteristics to locations within documents in the corpus. Entries in the index structure represent words and other entries represent marks with the location information of a marked word. The entries for the marks can be tokens coalesced with prefixes of respective marked words or adjacent. A query processor forms a modified query by adding a mark for a word to the query. The processor executes the modified query.
摘要翻译: 使用被布置为执行查询的数据处理器来找到在文档语料库中具有选择特征的词。 存储器存储索引结构,其中索引结构中的条目将具有所选特征的单词和标记映射到语料库中的文档内的位置。 索引结构中的条目表示单词,其他条目表示具有标记词的位置信息的标记。 标记的条目可以是标记与各个标记的词的前缀或相邻的令牌。 查询处理器通过向查询添加单词的标记来形成修改的查询。 处理器执行修改后的查询。
-
-
-
-