Identification of semantic units from within a search query
    31.
    发明授权
    Identification of semantic units from within a search query 有权
    从搜索查询中识别语义单位

    公开(公告)号:US07249121B1

    公开(公告)日:2007-07-24

    申请号:US09729240

    申请日:2000-12-05

    IPC分类号: G06F17/30 G06F15/16

    摘要: A search engine for searching a corpus improves the relevancy of the results by classifying multiple terms in a search query as a single semantic unit. A semantic unit locator of the search engine generates a subset of documents that are generally relevant to the query based on the individual terms within the query. Combinations of search terms that define potential semantic units from the query are then evaluated against the subset of documents to determine which combinations of search terms should be classified as a semantic unit. The resultant semantic units are used to refine the results of the search.

    摘要翻译: 用于搜索语料库的搜索引擎通过将搜索查询中的多个项目分类为单个语义单元来提高结果的相关性。 搜索引擎的语义单元定位器基于查询中的各个术语生成通常与查询相关的文档的子集。 然后根据文档子集来评估从查询定义潜在语义单元的搜索项的组合,以确定搜索词的哪些组合应该被分类为语义单元。 所得到的语义单位用于细化搜索结果。

    Methods and apparatus for ranking documents
    32.
    发明授权
    Methods and apparatus for ranking documents 有权
    文件排序方法和装置

    公开(公告)号:US08843479B1

    公开(公告)日:2014-09-23

    申请号:US13299825

    申请日:2011-11-18

    IPC分类号: G06F7/00 G06F17/30

    摘要: Methods and apparatus are described for scoring documents in response, in part, to parameters related to the document, source, and/or cluster score. Methods and apparatus are also described for scoring a cluster in response, in part, to parameters related to documents within the cluster and/or sources corresponding to the documents within the cluster. In one embodiment, the invention may detect at least one document within the cluster; analyze a parameter corresponding to the document; and compute a cluster score based, in part, on the parameter, wherein the cluster score corresponds with at least one document within the cluster.

    摘要翻译: 描述了对文档进行评分的方法和装置,部分地响应于与文档,源和/或聚类分数相关的参数。 还描述了用于对集群进行评分的方法和装置,部分地涉及与集群内的文档相关的参数和/或对应于集群内的文档的源。 在一个实施例中,本发明可以检测群集内的至少一个文档; 分析与文档相对应的参数; 并且部分地基于所述参数来计算聚类分数,其中所述聚类分数对应于所述聚类内的至少一个文档。

    Image selection for news search
    33.
    发明授权
    Image selection for news search 有权
    新闻搜索的图像选择

    公开(公告)号:US08775436B1

    公开(公告)日:2014-07-08

    申请号:US12195167

    申请日:2008-08-20

    IPC分类号: G06F17/30

    摘要: A system identifies a first document that includes a number of first images, identifies a second document that includes a number of second images, and forms a cluster based on a relationship between the first document and the second document. The system identifies a first caption associated with one of the first images, identifies a second caption associated with one of the second images, selects the one of the first images or the one of the second images as a representative image for the cluster based on the first caption or the second caption, and associates the representative image with the cluster.

    摘要翻译: 系统识别包括多个第一图像的第一文档,识别包括多个第二图像的第二文档,并且基于第一文档和第二文档之间的关系形成集群。 系统识别与第一图像之一相关联的第一字幕,识别与第二图像之一相关联的第二字幕,基于所述第一图像选择第一图像中的一个或第二图像中的一个作为群集的代表图像 第一个标题或第二个标题,并将代表图像与群集相关联。

    Serving advertisements using user request information and user information
    34.
    发明授权
    Serving advertisements using user request information and user information 有权
    使用用户请求信息和用户信息提供广告

    公开(公告)号:US08352499B2

    公开(公告)日:2013-01-08

    申请号:US10452791

    申请日:2003-06-02

    IPC分类号: G06F17/00

    摘要: Ads are scored using, at least, user information and information associated with a user request, such as a search query or a document request. The scores may be used in determining whether to serve ads, how to serve ads, to order ads, to filter ads, etc. Items of user information, request-associated information, and/or ad information can be weighted based on previous uses of such information in the serving of ads and the performance of those served ads.

    摘要翻译: 至少使用与用户请求(例如搜索查询或文档请求)相关联的用户信息和信息来对广告进行评分。 分数可用于确定是否投放广告,如何投放广告,订购广告,过滤广告等。用户信息,请求相关信息和/或广告信息的项目可以基于以前使用的 广告投放信息以及投放广告的效果。

    Embedded communication of link information
    36.
    发明授权
    Embedded communication of link information 有权
    嵌入式通信链接信息

    公开(公告)号:US08260766B2

    公开(公告)日:2012-09-04

    申请号:US13181436

    申请日:2011-07-12

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30867

    摘要: A method of processing documents is described. The method includes the operation of receiving a document in a search engine crawler. The document includes an embedded first link tag. The first link tag includes one or more information pairs. A respective information pair includes a respective parameter and a corresponding value. The parameters in the one or more information pairs may correspond to content at one or more content locations or one or more document locations. The method also includes selecting a method of processing content associated with the first link tag in accordance with one or more of the information pairs.

    摘要翻译: 描述处理文档的方法。 该方法包括在搜索引擎爬行器中接收文档的操作。 该文档包括嵌入的第一链接标签。 第一个链接标签包括一个或多个信息对。 相应的信息对包括相应的参数和对应的值。 一个或多个信息对中的参数可对应于一个或多个内容位置或一个或多个文档位置处的内容。 该方法还包括根据信息对中的一个或多个来选择处理与第一链接标签相关联的内容的方法。

    ARTIFICIAL ANCHOR FOR A DOCUMENT
    38.
    发明申请
    ARTIFICIAL ANCHOR FOR A DOCUMENT 有权
    文件的人造锚

    公开(公告)号:US20120054169A1

    公开(公告)日:2012-03-01

    申请号:US13248293

    申请日:2011-09-29

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30882 G06F17/30899

    摘要: Methods, systems, and apparatus, including computer program products, for linking to an intra-document portion of a target document includes receiving an address for a target document identified by a search engine in response to a query, the target document including query-relevant text at an intra-document portion of the target document. An artificial anchor that corresponds to the intra-document portion is generated and appended the address.

    摘要翻译: 用于链接到目标文档的文档内部分的方法,系统和装置(包括计算机程序产品)包括响应于查询而接收由搜索引擎识别的目标文档的地址,所述目标文档包括查询相关 文本在目标文档的文档内部分。 生成对应于文件内部分的人造锚,并将其附加到地址。

    Embedded Communication of Link Information
    40.
    发明申请
    Embedded Communication of Link Information 有权
    链接信息的嵌入式通信

    公开(公告)号:US20110271095A1

    公开(公告)日:2011-11-03

    申请号:US13181436

    申请日:2011-07-12

    IPC分类号: G06F17/30 H04L9/00

    CPC分类号: G06F17/30867

    摘要: A method of processing documents is described. The method includes the operation of receiving a document in a search engine crawler. The document includes an embedded first link tag. The first link tag includes one or more information pairs. A respective information pair includes a respective parameter and a corresponding value. The parameters in the one or more information pairs may correspond to content at one or more content locations or one or more document locations. The method also includes selecting a method of processing content associated with the first link tag in accordance with one or more of the information pairs.

    摘要翻译: 描述处理文档的方法。 该方法包括在搜索引擎爬行器中接收文档的操作。 该文档包括嵌入的第一链接标签。 第一个链接标签包括一个或多个信息对。 相应的信息对包括相应的参数和对应的值。 一个或多个信息对中的参数可对应于一个或多个内容位置或一个或多个文档位置处的内容。 该方法还包括根据信息对中的一个或多个来选择处理与第一链接标签相关联的内容的方法。