Identifying central entities
    2.
    发明授权
    Identifying central entities 有权
    识别中央实体

    公开(公告)号:US09009192B1

    公开(公告)日:2015-04-14

    申请号:US13153352

    申请日:2011-06-03

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30958

    摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for identifying central entities. In one aspect, a method includes obtaining candidate entities for a first resource; filtering a first entity graph whose nodes represent different entities found in a plurality of resources to remove nodes that do not correspond to a candidate entity, wherein pairs of nodes in the filtered first entity graph that are connected by an edge correspond to pairs of candidate entities that are associated with the same resource; generating a second entity graph for the first resource from the filtered first entity graph, wherein the second entity graph does not include nodes from the filtered first entity graph that are not connected to other nodes in the filtered first graph; and identifying candidate entities that are represented by nodes in the second entity graph as being central entities for the first resource.

    摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的用于识别中央实体的计算机程序。 一方面,一种方法包括:获取第一资源的候选实体; 过滤其节点表示在多个资源中找到的不同实体的第一实体图,以去除不对应于候选实体的节点,其中由边缘连接的经过滤的第一实体图中的节点对对应于候选实体对 与相同的资源相关联; 从经滤波的第一实体图生成第一资源的第二实体图,其中第二实体图不包括经滤波的第一实体图中未经滤波的第一图中其他节点的节点; 以及将由所述第二实体图中的节点表示的候选实体识别为所述第一资源的中心实体。

    Methods and Apparatus for Assessing Web Page Decay

    公开(公告)号:US20080097988A1

    公开(公告)日:2008-04-24

    申请号:US11955458

    申请日:2007-12-13

    IPC分类号: G06F17/30

    CPC分类号: G06F17/3089

    摘要: Systems and methods are herein disclosed for assessing the staleness of a web page. In particular, in one method of the present invention, the staleness of a web page is assessed by examining internal date references within the web page. In another method of the present invention, the staleness of a web page is assessed by examining the meta-data associated with the web page. In a further method of the present invention, the staleness of a hyperlinked web page is determined by examining the link status of the hyperlinks. If the web page has a relatively large number of dead links, it is assessed as being a stale web page. In a still further method of the present invention, the link status of web pages in the neighborhood of the web page being assessed is likewise examined.

    Identifying topical entities
    4.
    发明授权

    公开(公告)号:US10068022B2

    公开(公告)日:2018-09-04

    申请号:US13153365

    申请日:2011-06-03

    IPC分类号: G06F17/30

    摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for identifying topical entities. In one aspect, a method includes obtaining a plurality of entities that are associated with a first resource; for one or more of the identified entities, receiving search results for a search query derived from the entity; determining that search results for a search query including a particular entity include a specific type of search results; and determining that the particular entity is a topical entity of the first resource based at least in part on the particular entity appearing in a title or a resource locator of the first resource, wherein the topical entity of the first resource represents a predominant topic of the first resource.

    ENRICHING SEARCH RESULTS
    6.
    发明申请
    ENRICHING SEARCH RESULTS 有权
    增加搜索结果

    公开(公告)号:US20120109941A1

    公开(公告)日:2012-05-03

    申请号:US13118026

    申请日:2011-05-27

    IPC分类号: G06F17/30

    摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhancing search results. In one aspect, a method includes identifying a plurality of registered publishers for enriched search results and, for each registered publisher, obtaining enrichment information from the registered publisher and associating the enrichment information with a resource provided by the publisher. A query is received. A plurality of responsive resources that are responsive to the query are identified. A first responsive resource is determined to be associated with enrichment information. An enriched search result is provided, the enriched search result identifying the first responsive resource and including the first responsive resource's associated enrichment information.

    摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于增强搜索结果。 一方面,一种方法包括识别用于丰富搜索结果的多个注册发布者,并且对于每个注册的发布者,从注册的发行者获取富集信息并将所述浓缩信息与由发布者提供的资源相关联。 接收到查询。 识别响应于查询的多个响应资源。 第一响应资源被确定为与浓缩信息相关联。 提供丰富的搜索结果,丰富的搜索结果识别第一响应资源并且包括第一响应资源的相关联的富集信息。

    Running XPath queries over XML streams with incremental predicate evaluation
    7.
    发明申请
    Running XPath queries over XML streams with incremental predicate evaluation 审中-公开
    使用增量谓词评估运行XPath查询XML流

    公开(公告)号:US20070250471A1

    公开(公告)日:2007-10-25

    申请号:US11380136

    申请日:2006-04-25

    IPC分类号: G06F17/30

    CPC分类号: G06F16/83

    摘要: A method that eagerly evaluates predicates of XPath queries over XML document nodes for a set of commonly known functions and operators (including arithmetic, general comparison, value comparison, Boolean operators, etc.) without materializing sequences is discussed. Such eager evaluation of predicates reduces the amount of buffer space required since evaluation sequences have to be buffered only partially during the predicate evaluation process. Document nodes to be selected by a query are determined earlier so that they can be outputted without buffering.

    摘要翻译: 讨论了一种通过XML文档节点对一组常见已知功能和运算符(包括算术,一般比较,值比较,布尔运算符等)的XPath查询的谓词进行评估的方法,而不实现序列。 这种对谓词的这种热切的评估减少了所需的缓冲空间的量,因为评估序列必须在谓词评估过程中仅部分缓冲。 更早地确定由查询选择的文档节点,使得它们可以不缓冲地输出。

    GENERATING ADDITIONAL CONTENT
    8.
    发明申请
    GENERATING ADDITIONAL CONTENT 审中-公开
    产生附加内容

    公开(公告)号:US20160026727A1

    公开(公告)日:2016-01-28

    申请号:US13153379

    申请日:2011-06-03

    IPC分类号: G06F17/30

    CPC分类号: G06F16/9535 G06F16/335

    摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating additional content. In one aspect, a method includes identifying one or more central entities, wherein each central entity represents a topic of a first resource being presented in a user interface; generating one or more search queries, each of the one or more search queries being derived from one or more of the central entities; obtaining search results for the one or more search queries from a search engine; selecting resources relevant to the first resource from resources referenced by the obtained search results; generating additional content for presentation in a user interface element of the user interface based on the selected resources; and categorizing the generated additional content into a plurality of categories, wherein each category of additional content is displayed in a separate portion of the user interface element.

    摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于产生附加内容。 一方面,一种方法包括识别一个或多个中央实体,其中每个中心实体表示呈现在用户界面中的第一资源的主题; 生成一个或多个搜索查询,所述一个或多个搜索查询中的每一个从一个或多个中央实体导出; 从搜索引擎获取所述一个或多个搜索查询的搜索结果; 从获得的搜索结果引用的资源中选择与第一资源相关的资源; 基于所选择的资源生成附加内容以呈现在所述用户界面的用户界面元素中; 以及将生成的附加内容分类为多个类别,其中每个类别的附加内容显示在用户界面元素的单独部分中。

    Enriching search results
    9.
    发明授权
    Enriching search results 有权
    丰富搜索结果

    公开(公告)号:US09208230B2

    公开(公告)日:2015-12-08

    申请号:US13118026

    申请日:2011-05-27

    IPC分类号: G06G7/00 G06F17/30

    摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhancing search results. In one aspect, a method includes identifying a plurality of registered publishers for enriched search results and, for each registered publisher, obtaining enrichment information from the registered publisher and associating the enrichment information with a resource provided by the publisher. A query is received. A plurality of responsive resources that are responsive to the query are identified. A first responsive resource is determined to be associated with enrichment information. An enriched search result is provided, the enriched search result identifying the first responsive resource and including the first responsive resource's associated enrichment information.

    摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于增强搜索结果。 一方面,一种方法包括识别用于丰富搜索结果的多个注册发布者,并且对于每个注册的发布者,从注册的发行者获取富集信息并将所述浓缩信息与由发布者提供的资源相关联。 接收到查询。 识别响应于查询的多个响应资源。 第一响应资源被确定为与浓缩信息相关联。 提供丰富的搜索结果,丰富的搜索结果识别第一响应资源并且包括第一响应资源的相关联的富集信息。

    Methods and apparatus for assessing web page decay
    10.
    发明授权
    Methods and apparatus for assessing web page decay 有权
    评估网页衰变的方法和设备

    公开(公告)号:US07818312B2

    公开(公告)日:2010-10-19

    申请号:US11955458

    申请日:2007-12-13

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/3089

    摘要: A signal-bearing medium is disclosed that includes operations including establishing a link threshold, wherein a web page will be assessed as lacking currency if a percentage of hyperlinks contained in the web page that link to an active page is less than the link threshold, accessing a web page containing hyperlinks, and testing the hyperlinks. Testing includes: selecting a hyperlink; and monitoring a number of redirects encountered by following the selected hyperlink until a final web page is reached or a failure occurs, and assessing the selected hyperlink as linking to a dead web page if a redirect limit is exceeded, the redirect limit greater than one, wherein exceeding the redirect limit causes occurrence of a failure. The operations also include calculating a percentage of hyperlinks that return active web pages, and comparing the percentage of hyperlinks that return active web pages with the link threshold.

    摘要翻译: 公开了一种信号承载介质,其包括建立链路阈值的操作,其中如果链接到活动页面的网页中包含的超链接的百分比小于链路阈值,则访问网页将被评估为缺少货币 包含超链接的网页,并测试超链接。 测试包括:选择超链接; 以及监视通过遵循所选择的超链接而遇到的多个重定向,直到达到最终网页或发生故障,并且如果超过了重定向限制,则将所选超链接评估为链接到死网页,重定向限制大于1, 其中超过重定向限制导致发生故障。 这些操作还包括计算返回活动网页的超链接的百分比,并将返回活动网页的超链接的百分比与链接阈值进行比较。