-
公开(公告)号:US20120041960A9
公开(公告)日:2012-02-16
申请号:US12359939
申请日:2009-01-26
IPC分类号: G06F17/30
CPC分类号: G06F17/30864 , G06F17/3053 , G06F17/30675 , Y10S707/99932 , Y10S707/99933 , Y10S707/99937 , Y10S707/99938
摘要: Methods of providing a document relevance score to a document on a network are disclosed. Computer readable medium having stored thereon computer-executable instructions for performing a method of providing a document relevance score to a document on a network are also disclosed. Further, computing systems containing at least one application module, wherein the at least one application module comprises application code for performing methods of providing a document relevance score to a document on a network are disclosed.
摘要翻译: 公开了向网络上的文档提供文档相关性分数的方法。 还公开了其上存储有用于执行向网络上的文档提供文档相关性得分的方法的计算机可执行指令的计算机可读介质。 此外,公开了包含至少一个应用模块的计算系统,其中所述至少一个应用模块包括用于执行向网络上的文档提供文档相关性分数的方法的应用代码。
-
公开(公告)号:US07499919B2
公开(公告)日:2009-03-03
申请号:US11231955
申请日:2005-09-21
IPC分类号: G06F17/30
CPC分类号: G06F17/30864 , G06F17/3053 , G06F17/30675 , Y10S707/99932 , Y10S707/99933 , Y10S707/99937 , Y10S707/99938
摘要: Methods of providing a document relevance score to a document on a network are disclosed. Computer readable medium having stored thereon computer-executable instructions for performing a method of providing a document relevance score to a document on a network are also disclosed. Further, computing systems containing at least one application module, wherein the at least one application module comprises application code for performing methods of providing a document relevance score to a document on a network are disclosed.
摘要翻译: 公开了向网络上的文档提供文档相关性分数的方法。 还公开了其上存储有用于执行向网络上的文档提供文档相关性分数的方法的计算机可执行指令的计算机可读介质。 此外,公开了包含至少一个应用模块的计算系统,其中所述至少一个应用模块包括用于执行向网络上的文档提供文档相关性分数的方法的应用代码。
-
公开(公告)号:US20060200464A1
公开(公告)日:2006-09-07
申请号:US11072734
申请日:2005-03-03
申请人: Michal Gideoni , David Lee , Dmitriy Meyerzon , Mihai Petriuc , Kyle Peltonen
发明人: Michal Gideoni , David Lee , Dmitriy Meyerzon , Mihai Petriuc , Kyle Peltonen
CPC分类号: G06F16/338 , G06F16/345
摘要: A text document is segmented into word and sentence information when the document is first presented and indexed. A memory stream is generated for the document. The memory stream includes document title information, word offsets, sentence offsets, the alternate list, and the contents of the document. The memory stream is used to determine which sentences in the document include query terms. The sentences that include query terms are ranked according to a ranking algorithm. The ranking algorithm determines which sentences include the highest number of query terms and the number of occurrences of the query terms in each sentence. A predetermined number of sentences that together contain as many query terms as possible are selected such that the sentences that are most representative of the document with respect to the query are included in the summary. The summary is generated at query time by concatenating the selected sentences with the query terms highlighted.
摘要翻译: 当文档首次呈现和索引时,文本文档被分割成单词和句子信息。 为文档生成内存流。 存储器流包括文档标题信息,字偏移,句子偏移,备用列表和文档的内容。 内存流用于确定文档中包含查询条款的哪些句子。 根据排序算法对包含查询项的句子进行排序。 排序算法确定哪个句子包括查询词的最高数目和每个句子中查询词的出现次数。 选择一起包含尽可能多的查询词语的预定数量的句子,使得相对于查询最有代表文档的句子被包括在摘要中。 通过将所选择的句子与突出显示的查询字词相连,在查询时生成摘要。
-
公开(公告)号:US07065523B2
公开(公告)日:2006-06-20
申请号:US10959330
申请日:2004-10-06
申请人: Kyle Peltonen , Dmitriy Meyerzon
发明人: Kyle Peltonen , Dmitriy Meyerzon
IPC分类号: G06F17/30
CPC分类号: G06F17/30867 , Y10S707/99931 , Y10S707/99933 , Y10S707/99934 , Y10S707/99935 , Y10S707/99936 , Y10S707/99942 , Y10S707/99943 , Y10S707/99944 , Y10S707/99945
摘要: Systems and methods for scoping a search. When a content index for electronic data is built, one or more scope restrictions are included in the content index. The scope restriction may be, for example, a root folder identifier, a mailbox identifier, or a URL. Because the scope restriction is included in the content index random access of the property store to determine the scope is avoided. Rather, the scope restriction is implicitly added to a search that uses the content index. By including a scope restriction in the search query, the search results identified from the content index are limited to results that match the scope restriction. Advantageously, the effect of including the scope restriction in the search is ignored if the search results are relatively small or when including the scope restriction provides little benefit.
-
公开(公告)号:US20060074911A1
公开(公告)日:2006-04-06
申请号:US10956891
申请日:2004-09-30
IPC分类号: G06F17/30
CPC分类号: G06F17/30861
摘要: A process takes advantage of a structure of a server hosting a network site that includes a change log stored in a database to batch index documents for search queries. The content of the site is batched and shipped in bulk from the server to an indexer. The change log keeps track of the changes to the content of the site. The indexer incrementally requests updates to the index using the change log and batches the changes so that the bandwidth usage and processor overhead costs are reduced.
摘要翻译: 一个进程利用托管网站的服务器的结构,其中包括存储在数据库中的更改日志,用于搜索查询的批索引文档。 网站的内容已批量批量运输,并从服务器发货到索引器。 更改日志会跟踪站点内容的更改。 索引器使用更改日志递增地请求对索引的更新,并批量更改,以减少带宽使用量和处理器间接成本。
-
公开(公告)号:US20060074865A1
公开(公告)日:2006-04-06
申请号:US10951123
申请日:2004-09-27
申请人: Chadd Merrigan , Kyle Peltonen , Dmitriy Meyerzon , David Lee
发明人: Chadd Merrigan , Kyle Peltonen , Dmitriy Meyerzon , David Lee
IPC分类号: G06F17/30
CPC分类号: G06F17/30864 , G06F17/30967 , Y10S707/99931 , Y10S707/99932 , Y10S707/99933 , Y10S707/99936 , Y10S707/99943
摘要: A set of index keys is included in an index search system that are associated with the scope of the search rather than the content of the documents that are the target of the search. These scope related index keys, or scope keys allows the scope of the search to be selected, reducing the number of documents that a search is required to sift through to obtain results. Furthermore, compound scopes are recognized and stored such that an index of complex search scopes is provided to eliminate rehashing of the searches based on these complex search scopes.
-
公开(公告)号:US20050044074A1
公开(公告)日:2005-02-24
申请号:US10959330
申请日:2004-10-06
申请人: Kyle Peltonen , Dmitriy Meyerzon
发明人: Kyle Peltonen , Dmitriy Meyerzon
IPC分类号: G06F17/30
CPC分类号: G06F17/30867 , Y10S707/99931 , Y10S707/99933 , Y10S707/99934 , Y10S707/99935 , Y10S707/99936 , Y10S707/99942 , Y10S707/99943 , Y10S707/99944 , Y10S707/99945
摘要: Systems and methods for scoping a search. When a content index for electronic data is built, one or more scope restrictions are included in the content index. The scope restriction may be, for example, a root folder identifier, a mailbox identifier, or a URL. Because the scope restriction is included in the content index random access of the property store to determine the scope is avoided. Rather, the scope restriction is implicitly added to a search that uses the content index. By including a scope restriction in the search query, the search results identified from the content index are limited to results that match the scope restriction. Advantageously, the effect of including the scope restriction in the search is ignored if the search results are relatively small or when including the scope restriction provides little benefit.
摘要翻译: 用于范围搜索的系统和方法。 当构建电子数据的内容索引时,内容索引中包含一个或多个范围限制。 范围限制可以是例如根文件夹标识符,邮箱标识符或URL。 由于范围限制包含在内容索引中,属性存储的随机存取确定范围被避免。 而是将范围限制隐式添加到使用内容索引的搜索中。 通过在搜索查询中包含范围限制,从内容索引识别的搜索结果仅限于匹配范围限制的结果。 有利地,如果搜索结果相对较小或包括范围限制几乎没有什么益处,则忽略包括范围限制在搜索中的效果。
-
公开(公告)号:US20100191744A1
公开(公告)日:2010-07-29
申请号:US12359939
申请日:2009-01-26
IPC分类号: G06F17/30
CPC分类号: G06F16/951 , G06F16/24578 , G06F16/334 , Y10S707/99932 , Y10S707/99933 , Y10S707/99937 , Y10S707/99938
摘要: Methods of providing a document relevance score to a document on a network are disclosed. Computer readable medium having stored thereon computer-executable instructions for performing a method of providing a document relevance score to a document on a network are also disclosed. Further, computing systems containing at least one application module, wherein the at least one application module comprises application code for performing methods of providing a document relevance score to a document on a network are disclosed.
摘要翻译: 公开了向网络上的文档提供文档相关性分数的方法。 还公开了其上存储有用于执行向网络上的文档提供文档相关性得分的方法的计算机可执行指令的计算机可读介质。 此外,公开了包含至少一个应用模块的计算系统,其中所述至少一个应用模块包括用于执行向网络上的文档提供文档相关性分数的方法的应用代码。
-
公开(公告)号:US20070067284A1
公开(公告)日:2007-03-22
申请号:US11231955
申请日:2005-09-21
IPC分类号: G06F17/30
CPC分类号: G06F17/30864 , G06F17/3053 , G06F17/30675 , Y10S707/99932 , Y10S707/99933 , Y10S707/99937 , Y10S707/99938
摘要: Methods of providing a document relevance score to a document on a network are disclosed. Computer readable medium having stored thereon computer-executable instructions for performing a method of providing a document relevance score to a document on a network are also disclosed. Further, computing systems containing at least one application module, wherein the at least one application module comprises application code for performing methods of providing a document relevance score to a document on a network are disclosed.
摘要翻译: 公开了向网络上的文档提供文档相关性分数的方法。 还公开了其上存储有用于执行向网络上的文档提供文档相关性得分的方法的计算机可执行指令的计算机可读介质。 此外,公开了包含至少一个应用模块的计算系统,其中所述至少一个应用模块包括用于执行向网络上的文档提供文档相关性分数的方法的应用代码。
-
公开(公告)号:US07644107B2
公开(公告)日:2010-01-05
申请号:US10956891
申请日:2004-09-30
CPC分类号: G06F17/30861
摘要: A process takes advantage of a structure of a server hosting a network site that includes a change log stored in a database to batch index documents for search queries. The content of the site is batched and shipped in bulk from the server to an indexer. The change log keeps track of the changes to the content of the site. The indexer incrementally requests updates to the index using the change log and batches the changes so that the bandwidth usage and processor overhead costs are reduced.
摘要翻译: 一个进程利用托管网站的服务器的结构,其中包括存储在数据库中的更改日志,用于搜索查询的批索引文档。 网站的内容已批量批量运输,并从服务器发货到索引器。 更改日志会跟踪站点内容的更改。 索引器使用更改日志递增地请求对索引的更新,并批量更改,以减少带宽使用量和处理器间接成本。
-
-
-
-
-
-
-
-
-