Method ranking search results using biased click distance
    81.
    发明申请
    Method ranking search results using biased click distance 有权
    方法使用偏倚距离搜索结果排名

    公开(公告)号:US20070038622A1

    公开(公告)日:2007-02-15

    申请号:US11206286

    申请日:2005-08-15

    IPC分类号: G06F17/30

    摘要: Methods of providing a document relevance score to a document on a network are disclosed. Computer readable medium having stored thereon computer-executable instructions for performing a method of providing a document relevance score to a document on a network are also disclosed. Further, computing systems containing at least one application module, wherein the at least one application module comprises application code for performing methods of providing a document relevance score to a document on a network are disclosed.

    摘要翻译: 公开了向网络上的文档提供文档相关性分数的方法。 还公开了其上存储有用于执行向网络上的文档提供文档相关性得分的方法的计算机可执行指令的计算机可读介质。 此外,公开了包含至少一个应用模块的计算系统,其中所述至少一个应用模块包括用于执行向网络上的文档提供文档相关性分数的方法的应用代码。

    System and method for intelligent deletion of crawled documents from an index
    82.
    发明申请
    System and method for intelligent deletion of crawled documents from an index 审中-公开
    从索引中智能删除抓取文档的系统和方法

    公开(公告)号:US20060161591A1

    公开(公告)日:2006-07-20

    申请号:US11036412

    申请日:2005-01-14

    IPC分类号: G06F17/30

    CPC分类号: G06F16/951

    摘要: Documents are intelligently deleted from an index of crawled documents based on link and parent node information recorded from the crawl. A document visited during a first crawl may not be navigated to during a second crawl because of an error and the present invention verifies whether the document has been deleted. The present invention also prevents the document from being deleted when it is referenced by another document, indicating that the document is still a valid document.

    摘要翻译: 基于从爬网记录的链接和父节点信息,从被抓取文档的索引中智能地删除文档。 在第一次爬网期间访问的文档可能由于错误而在第二次爬行期间被导航,并且本发明验证文档是否已被删除。 本发明还防止当文档被另一文档引用时被删除,指示文档仍然是有效的文档。

    Synchronizing crawler with notification source
    83.
    发明授权
    Synchronizing crawler with notification source 失效
    同步抓取器与通知源

    公开(公告)号:US06424966B1

    公开(公告)日:2002-07-23

    申请号:US09107227

    申请日:1998-06-30

    IPC分类号: G06F1730

    摘要: A method and system for the processing and maintenance of electronic information retrieved from electronic documents stored on a computer network. The gatherer program of the present invention employs a crawler to crawl a portion of the computer network to retrieve electronic documents found during the crawl and that meet a set of crawl restriction rules. Some or all of the data contained in the copies of electronic documents is then stored in a data store such as an index. The invention keeps the data in the data store current by accepting notifications of when a previously retrieved document has changed. The notifications are sent by a notification source that monitors a space containing the previously retrieved documents for changes occurring after the document was last retrieved by the gatherer program. Because the document is being monitored for changes by the notification source, the gatherer program only needs to retrieve the document again when the gatherer program has been notified that the document has changed. If the notification source experiences a discontinuity, such as a system shutdown, the notification source requests that the gatherer perform an initialization crawl to retrieve any documents that changed while the notification source was not operational.

    摘要翻译: 一种用于处理和维护从存储在计算机网络上的电子文档中检索的电子信息的方法和系统。 本发明的收集者程序使用爬行器爬行计算机网络的一部分以检索在爬行期间找到并且满足一组爬行限制规则的电子文档。 然后将包含在电子文档副本中的部分或全部数据存储在诸如索引的数据存储中。 本发明通过接受先前检索的文档何时改变的通知来保持数据存储中的数据。 通知由通知源发送,该通知源监视包含先前检索到的文档的空间,以便在收集器程序上次检索文档之后发生更改。 由于文档正在被通知源的更改监控,所以收集器程序只有在通知采集器程序文档已更改时才需要重新检索文档。 如果通知源遇到不连续性(例如系统关闭),则通知源会要求采集者执行初始化爬网以检索在通知源不可操作时更改的任何文档。

    Document length as a static relevance feature for ranking search results
    84.
    发明授权
    Document length as a static relevance feature for ranking search results 有权
    文档长度作为用于排名搜索结果的静态相关性特征

    公开(公告)号:US09348912B2

    公开(公告)日:2016-05-24

    申请号:US12207910

    申请日:2008-09-10

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30864

    摘要: Embodiments are configured to provide information based on a user query. In an embodiment, a system includes a search component having a ranking component that can be used to rank search results as part of a query response. In one embodiment, the ranking component includes a ranking algorithm that can use the length of documents returned in response to a search query to rank search results.

    摘要翻译: 实施例被配置为基于用户查询来提供信息。 在一个实施例中,系统包括具有排序组件的搜索组件,其可用于将搜索结果排序为查询响应的一部分。 在一个实施例中,排名组件包括排序算法,其可以使用响应于搜索查询返回的文档的长度来排序搜索结果。

    User pipeline configuration for rule-based query transformation, generation and result display
    85.
    发明授权
    User pipeline configuration for rule-based query transformation, generation and result display 有权
    用户管道配置,用于基于规则的查询转换,生成和结果显示

    公开(公告)号:US09177022B2

    公开(公告)日:2015-11-03

    申请号:US13287717

    申请日:2011-11-02

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30448

    摘要: A query pipeline for an enterprise search system is configurable by a user of the system. A user may create rules for custom query transformation and parallel query generation, federation of queries, mixing of results and application of display layouts to the received search results. A user interface (UI) assists a user in configuring the search pipeline. For example, a user may enter condition action rules for queries that affect how a query is transformed, how parallel queries are generated, how queries are federated, how search results are ranked and displayed, how rules are ordered and the like.

    摘要翻译: 用于企业搜索系统的查询流水线可由系统的用户配置。 用户可以创建用于自定义查询转换和并行查询生成,查询联合,结果混合和显示布局应用于接收到的搜索结果的规则。 用户界面(UI)帮助用户配置搜索管道。 例如,用户可以为影响查询如何转换的查询,如何并行查询生成,查询如何联合,查询结果如何排序和显示,规则如何排序等输入条件操作规则。

    Ranking documents with social tags
    86.
    发明授权
    Ranking documents with social tags 有权
    使用社交标签排列文件

    公开(公告)号:US08914359B2

    公开(公告)日:2014-12-16

    申请号:US12345664

    申请日:2008-12-30

    IPC分类号: G06G7/00 G06F17/30

    摘要: Technologies are described herein for ranking documents with social tags. A number ranking feature containing a number of times a document was tagged is received. A textual property ranking feature containing a union of each social tag associated with the document is also received. The number ranking feature is transformed into a static input value. Further, the textual property ranking feature is transformed into a dynamic input value. A document rank for the document is determined by inputting the static input value and/or the dynamic input value into a ranking function.

    摘要翻译: 这里描述了用于使用社会标签对文档进行排序的技术。 收到包含文档被标记的次数的数字排名功能。 还收到包含与文档相关联的每个社会标签的联合的文本属性排名功能。 数字排序功能被转换为静态输入值。 此外,文本属性排名特征被转换成动态输入值。 通过将静态输入值和/或动态输入值输入排序函数来确定文档的文档等级。

    RE-RANKING SEARCH RESULTS
    87.
    发明申请
    RE-RANKING SEARCH RESULTS 有权
    重新排名搜索结果

    公开(公告)号:US20130198174A1

    公开(公告)日:2013-08-01

    申请号:US13360536

    申请日:2012-01-27

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30867

    摘要: Search results obtained from a ranking model are re-ranked based on user-configured ranking rules. For example, a user may desire to: place certain search results at a top/bottom of a ranking of search results; remove some search results; and/or adjust a ranking of some of the search results. A Graphical User Interface (GUI) allows a user to configure the ranking rules (e.g. enter key/value restrictions and to set a boost value) and to preview an application of one or more of the ranking rules. Query language operators that follow a standard operator syntax are created based on the inputs (e.g. a ranking query operator is created that may include multiple user supplied parameters). The user may also specify a portion of the results from which statistics (e.g. standard deviation, average score) are calculated. For example, a user may specify to calculate statistics for the top N number results.

    摘要翻译: 从排名模型获得的搜索结果根据用户配置的排名规则进行重新排名。 例如,用户可能希望:将某些搜索结果放置在搜索结果的排名的顶部/底部; 删除一些搜索结果; 和/或调整某些搜索结果的排名。 图形用户界面(GUI)允许用户配置排序规则(例如输入键/值限制并设置升压值)并预览一个或多个排序规则的应用程序。 基于输入(例如创建可能包括多个用户提供的参数的排名查询运算符)创建遵循标准运算符语法的查询语言运算符。 用户还可以指定计算统计数据(例如标准偏差,平均分数)的结果的一部分。 例如,用户可以指定计算前N个结果的统计。

    USING POPULAR QUERIES TO DECIDE WHEN TO FEDERATE QUERIES
    88.
    发明申请
    USING POPULAR QUERIES TO DECIDE WHEN TO FEDERATE QUERIES 有权
    使用热门问题来纠正问题

    公开(公告)号:US20130191371A1

    公开(公告)日:2013-07-25

    申请号:US13355290

    申请日:2012-01-20

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: A query received from a user is directed to a particular search application (e.g. an Enterprise search portal) that is associated with a result source from which to retrieve results. The received query may be federated to additional result sources when the received query is determined to be a popular query in a result source. Query logs associated with the additional result sources are analyzed to determine when a query is popular as compared to the original result source. The query may be altered before being executed that uses one or more of the additional result sources. When the query (altered/unaltered) is determined to be popular for any of the additional result sources as compared to the original result source, the query is executed using that additional result source.

    摘要翻译: 从用户接收的查询被定向到与要从其中检索结果的结果源相关联的特定搜索应用(例如,企业搜索门户)。 当接收到的查询被确定为结果源中的流行查询时,所接收的查询可以联合到附加结果源。 分析与附加结果源相关联的查询日志,以确定查询与原始结果源相比流行的时间。 在使用一个或多个附加结果源的执行之前,查询可能会被更改。 当与原始结果源相比,当查询(更改/未更改)被确定为任何附加结果源受欢迎时,使用该附加结果源执行查询。

    Generating search result summaries
    89.
    发明授权
    Generating search result summaries 有权
    生成搜索结果摘要

    公开(公告)号:US08285699B2

    公开(公告)日:2012-10-09

    申请号:US13220929

    申请日:2011-08-30

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30867 G06F17/30719

    摘要: Embodiments are configured to provide a summary of information associated with one or more search results. In an embodiment, a system includes a summary generator that can be configured to provide a summary of information including one or more snippets associated with a search term or search terms. The system includes a ranking component that can be used to rank snippets and the ranked snippets can be used when generating a summary that includes one or more ranked snippets. In one embodiment, the system can be configured to include one or more filters that can be used to filter snippets and the filtered snippets can be used when generating a summary. Other embodiments are available.

    摘要翻译: 实施例被配置为提供与一个或多个搜索结果相关联的信息的摘要。 在一个实施例中,系统包括摘要生成器,其可以被配置为提供包括与搜索项或搜索项相关联的一个或多个片段的信息的摘要。 该系统包括可用于对片段进行排名的排名组件,并且可以在生成包含一个或多个排名片段的摘要时使用排名片段。 在一个实施例中,系统可被配置为包括一个或多个可用于过滤片段的过滤器,并且可以在生成摘要时使用经过过滤的片段。 其他实施例是可用的。

    Significant change search alerts
    90.
    发明授权
    Significant change search alerts 有权
    重大变更搜索警报

    公开(公告)号:US08108388B2

    公开(公告)日:2012-01-31

    申请号:US11412725

    申请日:2006-04-26

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30699

    摘要: An alert search mechanism is used with search engines such as a crawler to search for desired documents and/or resources. Particular documents are found by using search queries. The search mechanism track values of a set of relevant properties in queries. Whenever a document is searched for by the system, the values of these set of properties are matched with the old value. If there is no match, this is an indication that the document has changed.

    摘要翻译: 搜索引擎(例如爬行器)使用警报搜索机制来搜索期望的文档和/或资源。 使用搜索查询查找特定文档。 搜索机制跟踪查询中一组相关属性的值。 每当系统搜索文档时,这些属性的值与旧值相匹配。 如果没有匹配,则表示该文档已更改。