Extensible mechanism for detecting duplicate search items
    1.
    发明申请
    Extensible mechanism for detecting duplicate search items 有权
    用于检测重复搜索项的可扩展机制

    公开(公告)号:US20080222063A1

    公开(公告)日:2008-09-11

    申请号:US11714418

    申请日:2007-03-06

    IPC分类号: G06F15/18

    CPC分类号: H04L51/12

    摘要: Systems, methods, and other embodiments associated with identifying and selectively deleting duplicate search results are described. One example system embodiment includes logic to receive an identity indicator from a search logic. The identity indicator is associated with a search item that the search logic determines to be relevant to a search request. The example system may also include logic to determine whether the search result associated with the identity indicator is a duplicate result based on comparing the identity indicator to another identity indicator associated with another search result.

    摘要翻译: 描述与识别和选择性地删除重复搜索结果相关联的系统,方法和其他实施例。 一个示例系统实施例包括从搜索逻辑接收身份指示符的逻辑。 身份指示符与搜索项目相关联,搜索逻辑确定与搜索请求相关。 该示例系统还可以包括用于基于将身份指示符与与另一搜索结果相关联的另一身份指示符进行比较来确定与身份指示符相关联的搜索结果是否是重复结果的逻辑。

    Index replication using crawl modification information
    2.
    发明授权
    Index replication using crawl modification information 有权
    索引复制使用爬网修改信息

    公开(公告)号:US07945533B2

    公开(公告)日:2011-05-17

    申请号:US11710100

    申请日:2007-02-23

    IPC分类号: G06F7/00 G06F15/16

    CPC分类号: G06F17/30864

    摘要: Systems, methodologies, media, and other embodiments associated with index replication using crawl modification information are described. One exemplary system embodiment includes an enterprise search system comprising a target search system comprising an index logic that uses modified crawl information related to items associated with sources to maintain an index that supports searching of the items; and, a crawl search system comprising a pipeline processor configured to receive modified crawl information related to the items and to propagate the modified crawl information to the target system.

    摘要翻译: 描述了使用爬行修改信息与索引复制相关联的系统,方法,媒体和其他实施例。 一个示例性系统实施例包括企业搜索系统,其包括目标搜索系统,该目标搜索系统包括索引逻辑,该索引逻辑使用与源相关联的项目相关的修改的爬网信息来维护支持搜索项的索引; 以及爬行搜索系统,包括流水线处理器,其被配置为接收与所述项目相关的经修改的爬网信息,并将修改的抓取信息传播到所述目标系统。

    Extensible mechanism for grouping search results
    3.
    发明申请
    Extensible mechanism for grouping search results 有权
    用于分组搜索结果的可扩展机制

    公开(公告)号:US20090100039A1

    公开(公告)日:2009-04-16

    申请号:US11974085

    申请日:2007-10-11

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: Systems, methods, and other embodiments associated with grouping automated search results are described. One embodiment includes a computer-readable medium storing computer-executable instructions operable to perform a method that includes identifying items to group. The method also includes selectively grouping a first item and a second item upon determining that a comparison of a metadata attributes indicates that the first item and the second item are to be treated as members of a group.

    摘要翻译: 描述了与分组自动搜索结果相关联的系统,方法和其他实施例。 一个实施例包括存储可操作以执行包括识别要分组的项目的方法的计算机可执行指令的计算机可读介质。 该方法还包括在确定元数据属性的比较指示将第一项目和第二个项目视为组的成员时,选择性地对第一项目和第二项目进行分组。

    Index replication using crawl modification information
    4.
    发明申请
    Index replication using crawl modification information 有权
    索引复制使用爬网修改信息

    公开(公告)号:US20070208716A1

    公开(公告)日:2007-09-06

    申请号:US11710100

    申请日:2007-02-23

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: Systems, methodologies, media, and other embodiments associated with index replication using crawl modification information are described. One exemplary system embodiment includes an enterprise search system comprising a target search system comprising an index logic that uses modified crawl information related to items associated with sources to maintain an index that supports searching of the items; and, a crawl search system comprising a pipeline processor configured to receive modified crawl information related to the items and to propagate the modified crawl information to the target system.

    摘要翻译: 描述了使用爬行修改信息与索引复制相关联的系统,方法,媒体和其他实施例。 一个示例性系统实施例包括企业搜索系统,其包括目标搜索系统,该目标搜索系统包括索引逻辑,该索引逻辑使用与源相关联的项目相关的修改的爬网信息来维护支持搜索项的索引; 以及爬行搜索系统,包括流水线处理器,其被配置为接收与所述项目相关的经修改的爬网信息,并将修改的抓取信息传播到所述目标系统。

    Extensible mechanism for grouping search results
    5.
    发明授权
    Extensible mechanism for grouping search results 有权
    用于分组搜索结果的可扩展机制

    公开(公告)号:US08271493B2

    公开(公告)日:2012-09-18

    申请号:US11974085

    申请日:2007-10-11

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: Systems, methods, and other embodiments associated with grouping automated search results are described. One embodiment includes a computer-readable medium storing computer-executable instructions operable to perform a method that includes identifying items to group. The method also includes selectively grouping a first item and a second item upon determining that a comparison of a metadata attributes indicates that the first item and the second item are to be treated as members of a group.

    摘要翻译: 描述了与分组自动搜索结果相关联的系统,方法和其他实施例。 一个实施例包括存储可操作以执行包括识别要分组的项目的方法的计算机可执行指令的计算机可读介质。 该方法还包括在确定元数据属性的比较指示将第一项目和第二个项目视为组的成员时,选择性地对第一项目和第二项目进行分组。

    Extensible mechanism for detecting duplicate search items
    6.
    发明授权
    Extensible mechanism for detecting duplicate search items 有权
    用于检测重复搜索项的可扩展机制

    公开(公告)号:US07756798B2

    公开(公告)日:2010-07-13

    申请号:US11714418

    申请日:2007-03-06

    IPC分类号: G06N5/00

    CPC分类号: H04L51/12

    摘要: Systems, methods, and other embodiments associated with identifying and selectively deleting duplicate search results are described. One example system embodiment includes logic to receive an identity indicator from a search logic. The identity indicator is associated with a search item that the search logic determines to be relevant to a search request. The example system may also include logic to determine whether the search result associated with the identity indicator is a duplicate result based on comparing the identity indicator to another identity indicator associated with another search result.

    摘要翻译: 描述与识别和选择性地删除重复搜索结果相关联的系统,方法和其他实施例。 一个示例系统实施例包括从搜索逻辑接收身份指示符的逻辑。 身份指示符与搜索项目相关联,搜索逻辑确定与搜索请求相关。 该示例系统还可以包括用于基于将身份指示符与与另一搜索结果相关联的另一身份指示符进行比较来确定与身份指示符相关联的搜索结果是否是重复结果的逻辑。

    Indexing secure enterprise documents using generic references
    7.
    发明授权
    Indexing secure enterprise documents using generic references 有权
    使用通用引用索引安全企业文档

    公开(公告)号:US08626794B2

    公开(公告)日:2014-01-07

    申请号:US13539622

    申请日:2012-07-02

    IPC分类号: G06F17/30

    摘要: A web crawler indexes documents including information about document contents and metadata including information such as a URL. However, some applications rely on URL's that change frequently or are constructed to include user information so that the contents retrieved is customized to the user. An approach is provided for storing generic URL's in an index at crawl time, which are customized for the user at search time. A callback mechanism may be used to dynamically transform the generic URL into a URL that is specific to the user issuing the query and/or includes current information that may change frequently. In this way, when the query or search results are returned to the user, the user receives links that are active and valid for that particular user, directing the user to the appropriate site, application, etc. without requiring continuous updating of a very large index.

    摘要翻译: 网页抓取工具索引文档,包括有关文档内容和元数据的信息,包括诸如URL之类的信息。 然而,一些应用程序依赖于频​​繁更改的URL或被构造为包括用户信息,以便检索到的内容是为用户定制的。 提供了一种方法,用于将通用URL存储在抓取时间的索引中,这是在搜索时为用户定制的。 可以使用回调机制来动态地将通用URL变换成特定于发布查询的用户的URL和/或包括可能频繁变化的当前信息。 以这种方式,当查询或搜索结果被返回给用户时,用户接收对该特定用户有效且有效的链接,将用户引导到适当的站点,应用等,而不需要持续更新非常大的 指数。

    PROPAGATING USER IDENTITIES IN A SECURE FEDERATED SEARCH SYSTEM
    8.
    发明申请
    PROPAGATING USER IDENTITIES IN A SECURE FEDERATED SEARCH SYSTEM 有权
    在安全的联合搜索系统中传播用户标识

    公开(公告)号:US20120278303A1

    公开(公告)日:2012-11-01

    申请号:US13483958

    申请日:2012-05-30

    IPC分类号: G06F17/30

    摘要: A flexible and extensible architecture allows for secure searching across an enterprise. Such an architecture can provide a simple Internet-like search experience to users searching secure content inside (and outside) the enterprise. The architecture allows for the crawling and searching of a variety or sources across an enterprise, regardless of whether any of these sources conform to a conventional user role model. The architecture further allows for security attributes to be submitted at query time, for example, in order to provide real-time secure access to enterprise resources. The user query also can be transformed to provide for dynamic querying that provides for a more current result list than can be obtained for static queries.

    摘要翻译: 灵活可扩展的架构允许跨企业进行安全搜索。 这样的架构可以为在企业内部(和外部)搜索安全内容的用户提供简单的类似Internet的搜索体验。 该架构允许在整个企业中爬行和搜索各种或多个源,无论这些源是否符合常规用户角色模型。 该体系结构进一步允许在查询时提交安全属性,例如为了提供对企业资源的实时安全访问。 用户查询也可以被转换以提供动态查询,其提供比静态查询可获得的更多当前结果列表。

    Re-ranking search results from an enterprise system
    9.
    发明授权
    Re-ranking search results from an enterprise system 有权
    从企业系统重新排列搜索结果

    公开(公告)号:US07970791B2

    公开(公告)日:2011-06-28

    申请号:US12751268

    申请日:2010-03-31

    IPC分类号: G06F17/30

    摘要: A flexible and extensible architecture allows for secure searching across an enterprise. Such an architecture can provide a simple Internet-like search experience to users searching secure content inside (and outside) the enterprise. The architecture allows for the crawling and searching of a variety of sources across an enterprise, regardless of whether any of these sources conform to a conventional user role model. The architecture further allows for security, recency, or other attributes to be submitted at query time, for example, in order to re-rank query results from enterprise resources. The user query also can be transformed to provide for dynamic querying that provides for a more current result list than can be obtained for static queries.

    摘要翻译: 灵活可扩展的架构允许跨企业进行安全搜索。 这样的架构可以为在企业内部(和外部)搜索安全内容的用户提供简单的类似Internet的搜索体验。 该架构允许在整个企业中爬行和搜索各种源,而不管这些源是否符合常规用户角色模型。 该体系结构还允许在查询时提交安全性,新近度或其他属性,例如,以便从企业资源重新排列查询结果。 用户查询也可以被转换以提供动态查询,其提供比静态查询可获得的更多当前结果列表。

    DOCUMENT DATE AS A RANKING FACTOR FOR CRAWLING
    10.
    发明申请
    DOCUMENT DATE AS A RANKING FACTOR FOR CRAWLING 有权
    文件日期作为破坏的一个排名因素

    公开(公告)号:US20070250486A1

    公开(公告)日:2007-10-25

    申请号:US11737091

    申请日:2007-04-18

    IPC分类号: G06F17/30

    摘要: A flexible and extensible architecture allows for secure searching across an enterprise. Such an architecture can provide a simple Internet-like search experience to users searching secure content inside (and outside) the enterprise. The architecture allows for the crawling and searching of a variety or sources across an enterprise, regardless of whether any of these sources conform to a conventional user role model. The architecture further allows for security attributes to be submitted at query time, for example, in order to provide real-time secure access to enterprise resources. The user query also can be transformed to provide for dynamic querying that provides for a more current result list than can be obtained for static queries.

    摘要翻译: 灵活可扩展的架构允许跨企业进行安全搜索。 这样的架构可以为在企业内部(和外部)搜索安全内容的用户提供简单的类似Internet的搜索体验。 该架构允许在整个企业中爬行和搜索各种或多个源,无论这些源是否符合常规用户角色模型。 该体系结构进一步允许在查询时提交安全属性,例如为了提供对企业资源的实时安全访问。 用户查询也可以被转换以提供动态查询,其提供比静态查询可获得的更多当前结果列表。