Determining relevance of documents to a query based on identifier distance
    1.
    发明授权
    Determining relevance of documents to a query based on identifier distance 有权
    根据标识符距离确定文档与查询的相关性

    公开(公告)号:US07630964B2

    公开(公告)日:2009-12-08

    申请号:US11273624

    申请日:2005-11-14

    IPC分类号: G06F7/00 G06F17/30

    摘要: A method and system for determining relevance of a document to a query based on identifier match distance is provided. The relevance system analyzes a training set of queries and documents to determine the relationship between identifier match distance and relevance of a document to a query. The identifier match distance indicates the distance from the end of an identifier of a document to an identifier term that matches a query term. The relevance system generates a prior relevance probability that a document with a certain identifier match distance is relevant to a query. The relevance system uses the prior relevance probabilities to determine relevance of documents to queries based on identifier match distance.

    摘要翻译: 提供了一种用于基于标识符匹配距离来确定文档与查询的相关性的方法和系统。 相关系统分析查询和文档的训练集,以确定标识符匹配距离与文档与查询的相关性之间的关系。 标识符匹配距离指示从文档的标识符的末尾到与查询项匹配的标识符项的距离。 相关系统产生具有与某个标识符匹配距离的文档与查询相关的先前相关概率。 相关系统使用先前的相关性概率来确定基于标识符匹配距离的文档与查询的相关性。

    Determining relevance of documents to a query based on identifier distance
    2.
    发明申请
    Determining relevance of documents to a query based on identifier distance 有权
    根据标识符距离确定文档与查询的相关性

    公开(公告)号:US20070112734A1

    公开(公告)日:2007-05-17

    申请号:US11273624

    申请日:2005-11-14

    IPC分类号: G06F17/30

    摘要: A method and system for determining relevance of a document to a query based on identifier match distance is provided. The relevance system analyzes a training set of queries and documents to determine the relationship between identifier match distance and relevance of a document to a query. The identifier match distance indicates the distance from the end of an identifier of a document to an identifier term that matches a query term. The relevance system generates a prior relevance probability that a document with a certain identifier match distance is relevant to a query. The relevance system uses the prior relevance probabilities to determine relevance of documents to queries based on identifier match distance.

    摘要翻译: 提供了一种用于基于标识符匹配距离来确定文档与查询的相关性的方法和系统。 相关系统分析查询和文档的训练集,以确定标识符匹配距离与文档与查询的相关性之间的关系。 标识符匹配距离指示从文档的标识符的末尾到与查询项匹配的标识符项的距离。 相关系统产生具有与某个标识符匹配距离的文档与查询相关的先前相关概率。 相关系统使用先前的相关性概率来确定基于标识符匹配距离的文档与查询的相关性。

    Determining relevance of a document to a query based on spans of query terms
    5.
    发明授权
    Determining relevance of a document to a query based on spans of query terms 有权
    根据查询项的跨度确定文档与查询的相关性

    公开(公告)号:US07480652B2

    公开(公告)日:2009-01-20

    申请号:US11259621

    申请日:2005-10-26

    IPC分类号: G06F17/30 G06F15/16

    摘要: A relevance system determines the relevance of a query term to a document based on spans within the document that contain the query term. The relevance system aggregates the relevance of the query terms into an overall relevance for the document. For each query term, the relevance system calculates a span relevance for each span that contains that query term. The relevance system then aggregates the span relevances for a query term into a query term relevance for that document. The relevance system may aggregate the query term relevances into a document relevance.

    摘要翻译: 相关系统基于包含查询项的文档中的跨度来确定查询项与文档的相关性。 相关系统将查询词的相关性聚合到文档的整体相关性。 对于每个查询项,相关系统计算包含该查询项的每个跨度的跨度相关性。 相关系统然后将查询项的跨度相关性聚合到该文档的查询词相关性中。 相关系统可以将查询词语相关性合并成文档相关性。

    Method and system for calculating importance of a block within a display page
    6.
    发明授权
    Method and system for calculating importance of a block within a display page 失效
    用于计算显示页面中块的重要性的方法和系统

    公开(公告)号:US08095478B2

    公开(公告)日:2012-01-10

    申请号:US12101109

    申请日:2008-04-10

    IPC分类号: G06F17/00 G06F17/20

    摘要: A method and system for identifying the importance of information areas of a display page. An importance system identifies information areas or blocks of a web page. A block of a web page represents an area of the web page that appears to relate to a similar topic. The importance system provides the characteristics or features of a block to an importance function that generates an indication of the importance of that block to its web page. The importance system “learns” the importance function by generating a model based on the features of blocks and the user-specified importance of those blocks. To learn the importance function, the importance system asks users to provide an indication of the importance of blocks of web pages in a collection of web pages.

    摘要翻译: 一种用于识别显示页面的信息区域的重要性的方法和系统。 重要性系统识别网页的信息区域或块。 网页的一个块表示网页的与类似主题相关的区域。 重要性系统将块的特征或特征提供给重要性功能,其产生该块对其网页的重要性的指示。 重要性系统通过基于块的特征和用户指定的这些块的重要性生成模型来“学习”重要性功能。 为了学习重要性功能,重要性系统要求用户提供网页集合中网页块重要性的指示。

    METHOD AND SYSTEM FOR CALCULATING IMPORTANCE OF A BLOCK WITHIN A DISPLAY PAGE
    7.
    发明申请
    METHOD AND SYSTEM FOR CALCULATING IMPORTANCE OF A BLOCK WITHIN A DISPLAY PAGE 失效
    用于计算显示页面中块的重要性的方法和系统

    公开(公告)号:US20080256068A1

    公开(公告)日:2008-10-16

    申请号:US12101109

    申请日:2008-04-10

    IPC分类号: G06F7/00

    摘要: A method and system for identifying the importance of information areas of a display page. An importance system identifies information areas or blocks of a web page. A block of a web page represents an area of the web page that appears to relate to a similar topic. The importance system provides the characteristics or features of a block to an importance function that generates an indication of the importance of that block to its web page. The importance system “learns” the importance function by generating a model based on the features of blocks and the user-specified importance of those blocks. To learn the importance function, the importance system asks users to provide an indication of the importance of blocks of web pages in a collection of web pages.

    摘要翻译: 一种用于识别显示页面的信息区域的重要性的方法和系统。 重要性系统识别网页的信息区域或块。 网页的一个块表示网页的与类似主题相关的区域。 重要性系统将块的特征或特征提供给重要性功能,其产生该块对其网页的重要性的指示。 重要性系统通过基于块的特征和用户指定的这些块的重要性生成模型来“学习”重要性功能。 为了学习重要性功能,重要性系统要求用户提供网页集合中网页块重要性的指示。

    Method and system for calculating importance of a block within a display page
    8.
    发明授权
    Method and system for calculating importance of a block within a display page 失效
    用于计算显示页面中块的重要性的方法和系统

    公开(公告)号:US07363279B2

    公开(公告)日:2008-04-22

    申请号:US10834639

    申请日:2004-04-29

    摘要: A method and system for identifying the importance of information areas of a display page. An importance system identifies information areas or blocks of a web page. A block of a web page represents an area of the web page that appears to relate to a similar topic. The importance system provides the characteristics or features of a block to an importance function that generates an indication of the importance of that block to its web page. The importance system “learns” the importance function by generating a model based on the features of blocks and the user-specified importance of those blocks. To learn the importance function, the importance system asks users to provide an indication of the importance of blocks of web pages in a collection of web pages.

    摘要翻译: 一种用于识别显示页面的信息区域的重要性的方法和系统。 重要性系统识别网页的信息区域或块。 网页的一个块表示网页的与类似主题相关的区域。 重要性系统将块的特征或特征提供给重要性功能,其产生该块对其网页的重要性的指示。 重要性系统通过基于块的特征和用户指定的这些块的重要性生成模型来“学习”重要性功能。 为了学习重要性功能,重要性系统要求用户提供网页集合中网页块重要性的指示。

    Determining relevance of a document to a query based on spans of query terms
    9.
    发明申请
    Determining relevance of a document to a query based on spans of query terms 有权
    根据查询项的跨度确定文档与查询的相关性

    公开(公告)号:US20070094234A1

    公开(公告)日:2007-04-26

    申请号:US11259621

    申请日:2005-10-26

    IPC分类号: G06F17/30

    摘要: A relevance system determines the relevance of a query term to a document based on spans within the document that contain the query term. The relevance system aggregates the relevance of the query terms into an overall relevance for the document. For each query term, the relevance system calculates a span relevance for each span that contains that query term. The relevance system then aggregates the span relevances for a query term into a query term relevance for that document. The relevance system may aggregate the query term relevances into a document relevance.

    摘要翻译: 相关系统基于包含查询项的文档中的跨度来确定查询项与文档的相关性。 相关系统将查询词的相关性聚合到文档的整体相关性。 对于每个查询项,相关系统计算包含该查询项的每个跨度的跨度相关性。 相关系统然后将查询项的跨度相关性聚合到该文档的查询词相关性中。 相关系统可以将查询词语相关性合并成文档相关性。

    Method and system for calculating importance of a block within a display page
    10.
    发明申请
    Method and system for calculating importance of a block within a display page 失效
    用于计算显示页面中块的重要性的方法和系统

    公开(公告)号:US20050246296A1

    公开(公告)日:2005-11-03

    申请号:US10834639

    申请日:2004-04-29

    摘要: A method and system for identifying the importance of information areas of a display page. An importance system identifies information areas or blocks of a web page. A block of a web page represents an area of the web page that appears to relate to a similar topic. The importance system provides the characteristics or features of a block to an importance function that generates an indication of the importance of that block to its web page. The importance system “learns” the importance function by generating a model based on the features of blocks and the user-specified importance of those blocks. To learn the importance function, the importance system asks users to provide an indication of the importance of blocks of web pages in a collection of web pages.

    摘要翻译: 一种用于识别显示页面的信息区域的重要性的方法和系统。 重要性系统识别网页的信息区域或块。 网页的一个块表示网页的与类似主题相关的区域。 重要性系统将块的特征或特征提供给重要性功能,其产生该块对其网页的重要性的指示。 重要性系统通过基于块的特征和用户指定的这些块的重要性生成模型来“学习”重要性功能。 为了学习重要性功能,重要性系统要求用户提供网页集合中网页块重要性的指示。