-
1.
公开(公告)号:US20150169584A1
公开(公告)日:2015-06-18
申请号:US14401828
申请日:2013-05-17
Applicant: GOOGLE INC.
Inventor: Chung Tin Kwok , Lei Zhong , Zhihuan Qiu
IPC: G06F17/30
CPC classification number: G06F16/24578 , G06F16/2228 , G06F16/248 , G06F16/951
Abstract: A system, computer-readable storage medium storing at least one program, and a computer-implemented method for re-ranking ranked search results is presented. Ranked search results satisfying a search query are obtained, where the ranked search results include a first search result corresponding to a first document associated with a first entity and a second search result corresponding to a second document associated with a second entity, and where the first search result is ranked higher than the second search result. The first document and the second document are determined to satisfy a similarity criterion. The second entity is determined to satisfy a predefined authorship differential with respect to the first entity. Responsive to determining that the second entity satisfies the predefined authorship differential with respect to the first entity, the second search result and the first search result in the ranked search results are swapped to produce re-ranked search results.
Abstract translation: 提出了存储至少一个程序的系统,计算机可读存储介质和用于重新排列排名的搜索结果的计算机实现的方法。 获得满足搜索查询的排名搜索结果,其中排名的搜索结果包括对应于与第一实体相关联的第一文档的第一搜索结果和对应于与第二实体相关联的第二文档的第二搜索结果,并且其中第一 搜索结果排名高于第二搜索结果。 确定第一文件和第二文件以满足相似性标准。 确定第二实体以满足关于第一实体的预定义的作者差异。 响应于确定第二实体满足关于第一实体的预定义作者差异,排列的搜索结果中的第二搜索结果和第一搜索结果被交换以产生重新排序的搜索结果。
-
公开(公告)号:US08909628B1
公开(公告)日:2014-12-09
申请号:US13668106
申请日:2012-11-02
Applicant: Google Inc.
Inventor: Chung Tin Kwok , Ryan H. Moulton , Zhihuan Qiu
IPC: G06F17/30
CPC classification number: G06F17/30864 , G06Q30/0201
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying a plurality of n-grams in a plurality of resources found in a particular site; determining, for each of the plurality of resources, a count of n-grams that originated in the resource; determining, based on counts of n-grams that originated in the resources, a first aggregate count of n-grams that originated in the particular site; determining a second aggregate count of the plurality of n-grams that were identified in the plurality of resources found in the particular site; and determining, based on the first and second aggregate counts, a site originality score for the particular site.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于识别在特定站点中发现的多个资源中的多个n克; 为所述多个资源中的每一个确定源自所述资源的n克的计数; 根据源自资源的n-gram的计数确定起源于该特定地点的n克的第一个总计数; 确定在所述特定站点中发现的所述多个资源中识别的所述多个n-gram的第二聚合计数; 以及基于所述第一和第二聚合计数确定所述特定站点的站点原创性得分。
-