Index replication using crawl modification information
    1.
    发明授权
    Index replication using crawl modification information 有权
    索引复制使用爬网修改信息

    公开(公告)号:US07945533B2

    公开(公告)日:2011-05-17

    申请号:US11710100

    申请日:2007-02-23

    IPC分类号: G06F7/00 G06F15/16

    CPC分类号: G06F17/30864

    摘要: Systems, methodologies, media, and other embodiments associated with index replication using crawl modification information are described. One exemplary system embodiment includes an enterprise search system comprising a target search system comprising an index logic that uses modified crawl information related to items associated with sources to maintain an index that supports searching of the items; and, a crawl search system comprising a pipeline processor configured to receive modified crawl information related to the items and to propagate the modified crawl information to the target system.

    摘要翻译: 描述了使用爬行修改信息与索引复制相关联的系统,方法,媒体和其他实施例。 一个示例性系统实施例包括企业搜索系统,其包括目标搜索系统,该目标搜索系统包括索引逻辑,该索引逻辑使用与源相关联的项目相关的修改的爬网信息来维护支持搜索项的索引; 以及爬行搜索系统,包括流水线处理器,其被配置为接收与所述项目相关的经修改的爬网信息,并将修改的抓取信息传播到所述目标系统。

    Extensible mechanism for grouping search results
    2.
    发明申请
    Extensible mechanism for grouping search results 有权
    用于分组搜索结果的可扩展机制

    公开(公告)号:US20090100039A1

    公开(公告)日:2009-04-16

    申请号:US11974085

    申请日:2007-10-11

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: Systems, methods, and other embodiments associated with grouping automated search results are described. One embodiment includes a computer-readable medium storing computer-executable instructions operable to perform a method that includes identifying items to group. The method also includes selectively grouping a first item and a second item upon determining that a comparison of a metadata attributes indicates that the first item and the second item are to be treated as members of a group.

    摘要翻译: 描述了与分组自动搜索结果相关联的系统,方法和其他实施例。 一个实施例包括存储可操作以执行包括识别要分组的项目的方法的计算机可执行指令的计算机可读介质。 该方法还包括在确定元数据属性的比较指示将第一项目和第二个项目视为组的成员时,选择性地对第一项目和第二项目进行分组。

    Index replication using crawl modification information
    3.
    发明申请
    Index replication using crawl modification information 有权
    索引复制使用爬网修改信息

    公开(公告)号:US20070208716A1

    公开(公告)日:2007-09-06

    申请号:US11710100

    申请日:2007-02-23

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: Systems, methodologies, media, and other embodiments associated with index replication using crawl modification information are described. One exemplary system embodiment includes an enterprise search system comprising a target search system comprising an index logic that uses modified crawl information related to items associated with sources to maintain an index that supports searching of the items; and, a crawl search system comprising a pipeline processor configured to receive modified crawl information related to the items and to propagate the modified crawl information to the target system.

    摘要翻译: 描述了使用爬行修改信息与索引复制相关联的系统,方法,媒体和其他实施例。 一个示例性系统实施例包括企业搜索系统,其包括目标搜索系统,该目标搜索系统包括索引逻辑,该索引逻辑使用与源相关联的项目相关的修改的爬网信息来维护支持搜索项的索引; 以及爬行搜索系统,包括流水线处理器,其被配置为接收与所述项目相关的经修改的爬网信息,并将修改的抓取信息传播到所述目标系统。

    Techniques for managing XML data associated with multiple execution units
    4.
    发明授权
    Techniques for managing XML data associated with multiple execution units 有权
    用于管理与多个执行单元相关联的XML数据的技术

    公开(公告)号:US08949220B2

    公开(公告)日:2015-02-03

    申请号:US10810152

    申请日:2004-03-26

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/3092

    摘要: Techniques for managing XML data associated with multiple execution units ensure that execution units are able to use XML data coming from other execution units. Such techniques are applicable when, but for the technique, an XML type value is produced in a particular form by one execution unit and is supposed to be consumed by another execution unit that is unable to process data in the particular form, and involves detecting that the foregoing situation exists and annotating information sent to an XML producer execution unit to cause the XML type value to be transformed into a canonical form that can be shared by all relevant execution units.

    摘要翻译: 用于管理与多个执行单元相关联的XML数据的技术确保执行单元能够使用来自其他执行单元的XML数据。 这种技术适用于但是对于技术而言,XML类型值由一个执行单元以特定形式产生并且被假定由不能处理特定形式的数据的另一执行单元消耗,并且涉及检测该 存在上述情况并且向XML生成器执行单元注释信息,以使XML类型值被转换成可由所有相关执行单元共享的规范形式。

    Extensible mechanism for grouping search results
    5.
    发明授权
    Extensible mechanism for grouping search results 有权
    用于分组搜索结果的可扩展机制

    公开(公告)号:US08271493B2

    公开(公告)日:2012-09-18

    申请号:US11974085

    申请日:2007-10-11

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: Systems, methods, and other embodiments associated with grouping automated search results are described. One embodiment includes a computer-readable medium storing computer-executable instructions operable to perform a method that includes identifying items to group. The method also includes selectively grouping a first item and a second item upon determining that a comparison of a metadata attributes indicates that the first item and the second item are to be treated as members of a group.

    摘要翻译: 描述了与分组自动搜索结果相关联的系统,方法和其他实施例。 一个实施例包括存储可操作以执行包括识别要分组的项目的方法的计算机可执行指令的计算机可读介质。 该方法还包括在确定元数据属性的比较指示将第一项目和第二个项目视为组的成员时,选择性地对第一项目和第二项目进行分组。

    Document summarization
    6.
    发明授权
    Document summarization 有权
    文件总结

    公开(公告)号:US08027979B2

    公开(公告)日:2011-09-27

    申请号:US12829766

    申请日:2010-07-02

    IPC分类号: G06F7/00 G06F17/00

    CPC分类号: G06F17/30719

    摘要: Systems, methods, and other embodiments associated with automatically summarizing a document are described. One method embodiment includes computing term scores for members of a set of terms in a document to be summarized and computing sentence scores for sentences in a set of sentences in the document. The method embodiment also includes computing a set of entries for a term-sentence matrix that relates terms to sentences. The method embodiment also includes computing a dominant topic for the document and simultaneously ranking the set of terms and the set of sentences based on the dominant topic. The method embodiment provides a summarization item(s) selected from the set of terms and/or the set of sentences.

    摘要翻译: 描述与自动总结文档相关联的系统,方法和其他实施例。 一个方法实施例包括计算要汇总的文档中的一组术语的成员的术语分数,以及计算文档中一组句子中的句子的句子分数。 方法实施例还包括计算用于将术语与句子相关联的术语矩阵的条目集合。 该方法实施例还包括计算文档的主导主题,并且基于主题来同时对该组语句和一组句子进行排序。 该方法实施例提供从该组项和/或一组句子中选择的摘要项目。

    Document summarization
    7.
    发明授权
    Document summarization 有权
    文件总结

    公开(公告)号:US07783640B2

    公开(公告)日:2010-08-24

    申请号:US11647871

    申请日:2006-12-29

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30719

    摘要: Systems, methods, and other embodiments associated with automatically summarizing a document are described. One method embodiment includes computing term scores for members of a set of terms in a document to be summarized and computing sentence scores for sentences in a set of sentences in the document. The method embodiment also includes computing a set of entries for a term-sentence matrix that relates terms to sentences. The method embodiment also includes computing a dominant topic for the document and simultaneously ranking the set of terms and the set of sentences based on the dominant topic. The method embodiment provides a summarization item(s) selected from the set of terms and/or the set of sentences.

    摘要翻译: 描述与自动总结文档相关联的系统,方法和其他实施例。 一个方法实施例包括计算要汇总的文档中的一组术语的成员的术语分数,以及计算文档中一组句子中的句子的句子分数。 方法实施例还包括计算用于将术语与句子相关联的术语矩阵的条目集合。 该方法实施例还包括计算文档的主导主题,并且基于主题来同时对该组语句和一组句子进行排序。 该方法实施例提供从该组项和/或一组句子中选择的摘要项目。

    Extensible mechanism for detecting duplicate search items
    8.
    发明授权
    Extensible mechanism for detecting duplicate search items 有权
    用于检测重复搜索项的可扩展机制

    公开(公告)号:US07756798B2

    公开(公告)日:2010-07-13

    申请号:US11714418

    申请日:2007-03-06

    IPC分类号: G06N5/00

    CPC分类号: H04L51/12

    摘要: Systems, methods, and other embodiments associated with identifying and selectively deleting duplicate search results are described. One example system embodiment includes logic to receive an identity indicator from a search logic. The identity indicator is associated with a search item that the search logic determines to be relevant to a search request. The example system may also include logic to determine whether the search result associated with the identity indicator is a duplicate result based on comparing the identity indicator to another identity indicator associated with another search result.

    摘要翻译: 描述与识别和选择性地删除重复搜索结果相关联的系统,方法和其他实施例。 一个示例系统实施例包括从搜索逻辑接收身份指示符的逻辑。 身份指示符与搜索项目相关联,搜索逻辑确定与搜索请求相关。 该示例系统还可以包括用于基于将身份指示符与与另一搜索结果相关联的另一身份指示符进行比较来确定与身份指示符相关联的搜索结果是否是重复结果的逻辑。

    DOCUMENT SUMMARIZATION
    9.
    发明申请
    DOCUMENT SUMMARIZATION 有权
    文件总结

    公开(公告)号:US20100268711A1

    公开(公告)日:2010-10-21

    申请号:US12829766

    申请日:2010-07-02

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30719

    摘要: Systems, methods, and other embodiments associated with automatically summarizing a document are described. One method embodiment includes computing term scores for members of a set of terms in a document to be summarized and computing sentence scores for sentences in a set of sentences in the document. The method embodiment also includes computing a set of entries for a term-sentence matrix that relates terms to sentences. The method embodiment also includes computing a dominant topic for the document and simultaneously ranking the set of terms and the set of sentences based on the dominant topic. The method embodiment provides a summarization item(s) selected from the set of terms and/or the set of sentences.

    摘要翻译: 描述与自动总结文档相关联的系统,方法和其他实施例。 一个方法实施例包括计算要汇总的文档中的一组术语的成员的术语分数,以及计算文档中一组句子中的句子的句子分数。 方法实施例还包括计算用于将术语与句子相关联的术语矩阵的条目集合。 该方法实施例还包括计算文档的主导主题,并且基于主题来同时对该组语句和一组句子进行排序。 该方法实施例提供从该组项和/或一组句子中选择的摘要项目。

    Extensible mechanism for detecting duplicate search items
    10.
    发明申请
    Extensible mechanism for detecting duplicate search items 有权
    用于检测重复搜索项的可扩展机制

    公开(公告)号:US20080222063A1

    公开(公告)日:2008-09-11

    申请号:US11714418

    申请日:2007-03-06

    IPC分类号: G06F15/18

    CPC分类号: H04L51/12

    摘要: Systems, methods, and other embodiments associated with identifying and selectively deleting duplicate search results are described. One example system embodiment includes logic to receive an identity indicator from a search logic. The identity indicator is associated with a search item that the search logic determines to be relevant to a search request. The example system may also include logic to determine whether the search result associated with the identity indicator is a duplicate result based on comparing the identity indicator to another identity indicator associated with another search result.

    摘要翻译: 描述与识别和选择性地删除重复搜索结果相关联的系统,方法和其他实施例。 一个示例系统实施例包括从搜索逻辑接收身份指示符的逻辑。 身份指示符与搜索项目相关联,搜索逻辑确定与搜索请求相关。 该示例系统还可以包括用于基于将身份指示符与与另一搜索结果相关联的另一身份指示符进行比较来确定与身份指示符相关联的搜索结果是否是重复结果的逻辑。