-
公开(公告)号:US09477758B1
公开(公告)日:2016-10-25
申请号:US13553731
申请日:2012-07-19
申请人: Simon Tong , Jeffrey Adgate Dean , Sanjay Ghemawat
发明人: Simon Tong , Jeffrey Adgate Dean , Sanjay Ghemawat
CPC分类号: G06F17/30864
摘要: In one aspect, the present disclosure can be embodied in a method that includes identifying a collection of entities from one or more data sources, calculating a score for subsets of entities from the collection based on one or more seed entities associated with the collection, identifying one or more entities from each of the subsets based on the calculated score, assigning the calculated score to the identified one or more entities from the respective subset, and ranking the one or more entities based on the assigned score, so as to identify entities in the collection that are related to the one or more seed entities.
摘要翻译: 一方面,本公开可以体现在一种方法中,该方法包括从一个或多个数据源识别实体的集合,基于与集合相关联的一个或多个种子实体从集合计算实体的子集的分数,识别 基于所计算的分数从所述子集中的每一个的一个或多个实体,将所计算的分数从所述相应子集分配给所识别的一个或多个实体,并且基于所分配的分数对所述一个或多个实体进行排名,以便识别 与一个或多个种子实体相关的集合。
-
公开(公告)号:US08417697B2
公开(公告)日:2013-04-09
申请号:US11208005
申请日:2005-08-22
申请人: Sanjay Ghemawat , John Piscitello , Simon Tong , Matt Cutts
发明人: Sanjay Ghemawat , John Piscitello , Simon Tong , Matt Cutts
CPC分类号: G06F17/3053 , G06F17/30867
摘要: A system may present information regarding a document and provide an option for removing the document. The system may also receive selection of the option and remove the document when the option is selected. The system may aggregate information regarding documents that have been removed by a group of users and assign scores to a set of documents based on the aggregated information.
-
公开(公告)号:US20070043721A1
公开(公告)日:2007-02-22
申请号:US11208005
申请日:2005-08-22
申请人: Sanjay Ghemawat , John Piscitello , Simon Tong , Matt Cutts
发明人: Sanjay Ghemawat , John Piscitello , Simon Tong , Matt Cutts
IPC分类号: G06F7/00
CPC分类号: G06F17/3053 , G06F17/30867
摘要: A system may present information regarding a document and provide an option for removing the document. The system may also receive selection of the option and remove the document when the option is selected. The system may aggregate information regarding documents that have been removed by a group of users and assign scores to a set of documents based on the aggregated information.
摘要翻译: 系统可以呈现关于文档的信息并提供用于移除文档的选项。 当选择该选项时,系统还可以接收该选项的选择并移除文档。 该系统可以聚合关于一组用户已被删除的文档的信息,并且基于聚合信息将分数分配给一组文档。
-
公开(公告)号:US09189548B2
公开(公告)日:2015-11-17
申请号:US12901274
申请日:2010-10-08
申请人: Simon Tong
发明人: Simon Tong
CPC分类号: G06F17/30864
摘要: A search engine includes a decision component that determines whether documents that are returned in response to a user search query are likely to be very relevant to the search query. Links that refer to documents that the search engine determines to likely be very relevant may be displayed with visual cues that assist the user in browsing the links. The decision component may base its decision on a number of parameters, including: (1) the position of the document in a ranked list of search results, (2) the click through rate of the document, (3) relevance scores for the document and other documents that are returned as hits in response to the search query, and (4) whether the document is classified as a pornographic document (the search engine may refrain from showing visual cues for potentially pornographic documents).
摘要翻译: 搜索引擎包括决定组件,其确定响应于用户搜索查询返回的文档是否可能与搜索查询非常相关。 指向搜索引擎确定可能非常相关的文档的链接可以用辅助用户浏览链接的视觉提示来显示。 决策组件可以根据多个参数进行决策,包括:(1)文档在搜索结果排名列表中的位置,(2)文档的点击率,(3)文档的相关性分数 以及作为响应于搜索查询的命中而返回的其他文档,以及(4)文档是否被分类为色情文档(搜索引擎可能不会显示可能的色情文档的视觉提示)。
-
5.
公开(公告)号:US08977612B1
公开(公告)日:2015-03-10
申请号:US13617019
申请日:2012-09-14
申请人: Simon Tong , Benjamin N. Lee , Eric E. Altendorf
发明人: Simon Tong , Benjamin N. Lee , Eric E. Altendorf
IPC分类号: G06F17/30
CPC分类号: G06F17/3053 , G06F17/3071
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying one or more second documents related to one or more first documents. Strength of relationship scores between candidate documents in a group of candidate documents and each first document are determined by aggregating user selection data for users, the user selection data indicating, for each user, whether the user viewed the candidate document during a window of time after the first document is presented to the user on a search results web page in response to a query. An aggregate strength of relationship score is calculated for each candidate document from the strength of relationship scores for the candidate document. Second documents are selected from the candidate documents according to the aggregate strength of relationship scores for the candidate documents.
摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于识别与一个或多个第一文档相关的一个或多个第二文档。 通过聚合用户的用户选择数据来确定候选文件组中的候选文档和每个第一文档之间的关系分数的强度,用户选择数据指示用户在每个用户之间在一段时间后的时间段内查看候选文档 响应于查询,第一个文档在搜索结果网页上呈现给用户。 根据候选文件的关系分数的强度,计算每个候选文件的关系分数的总和强度。 根据候选文件的关系分数的总体强度,从候选文件中选择第二份文件。
-
公开(公告)号:US08819004B1
公开(公告)日:2014-08-26
申请号:US13585894
申请日:2012-08-15
IPC分类号: G06F17/30
CPC分类号: G06F17/30265 , G06F17/30 , G06F17/30864
摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for ranking images using hover data. In one aspect, a method includes determining a click count and a hover count for an image and a search query pair. The click count specifies a number of times that an image search result that includes a representation of the image has been selected when provided in response to the search query. The hover count specifies a number of times that the representation of the image has been hovered over when the image search result has been provided in response to the search query. A quality measure for the image with respect to the search query is determined. The quality measure is based on the click count and the hover count. A ranking of the image is adjusted for the search query based on the quality measure for the image.
摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于使用悬停数据对图像进行排序。 一方面,一种方法包括确定图像和搜索查询对的点击次数和悬停计数。 点击计数指定响应于搜索查询而提供的包括图像的表示的图像搜索结果已经被选择的次数。 悬停计数指定当响应于搜索查询而提供图像搜索结果时图像的表示已经被悬停的次数。 确定相对于搜索查询的图像的质量度量。 质量度量基于点击次数和悬停次数。 基于图像的质量测量,针对搜索查询调整图像的排名。
-
公开(公告)号:US08650199B1
公开(公告)日:2014-02-11
申请号:US13531670
申请日:2012-06-25
申请人: Simon Tong
发明人: Simon Tong
CPC分类号: G06F17/30622
摘要: A similarity detector detects similar or near duplicate occurrences of a document. The similarity detector determines similarity of documents by characterizing the documents as clusters each made up of a set of term entries, such as pairs of terms. A pair of terms, for example, indicates that the first term of the pair occurs before the second term of the pair in the underlying document. Another document that has a threshold level of term entries in common with a cluster is considered similar to the document characterized by the cluster.
摘要翻译: 相似性检测器检测文档的类似或接近重复的出现。 相似度检测器通过将文档表征为各自由诸如术语对的一组术语条目组成的簇来确定文档的相似性。 例如,一对术语表示该对中的第一项出现在基础文档中该对的第二项之前。 具有与集群相同的术语条目的阈值级别的另一文档被认为与由集群表征的文档类似。
-
公开(公告)号:US08527524B2
公开(公告)日:2013-09-03
申请号:US13174304
申请日:2011-06-30
申请人: Anurag Acharya , Jeffrey Dean , Paul Haahr , Monika Henzinger , Steve Lawrence , Karl Pfleger , Simon Tong
发明人: Anurag Acharya , Jeffrey Dean , Paul Haahr , Monika Henzinger , Steve Lawrence , Karl Pfleger , Simon Tong
IPC分类号: G06F7/00
CPC分类号: G06Q30/0246 , G06F17/30864 , Y10S707/99933
摘要: A system may determine a measure of how a content of a document changes over time, generate a score for the document based, at least in part, on the measure of how the content of the document changes over time, and rank the document with regard to at least one other document based, at least in part, on the score.
摘要翻译: 系统可以确定文档的内容如何随时间而变化的度量,至少部分地基于文档的内容如何随时间而变化的度量,以及关于文档的排序, 至少部分地基于该分数至少一个其他文档。
-
公开(公告)号:US08521725B1
公开(公告)日:2013-08-27
申请号:US10726345
申请日:2003-12-03
申请人: Mark Pearson , Simon Tong
发明人: Mark Pearson , Simon Tong
CPC分类号: G06F17/3053 , G06F17/30675 , G06F17/30864 , H04L67/42
摘要: Methods and systems for improved searching are described. In one of the described methods, a user enters a search query, and in response, a search engine receives a substantially complete initial search result set having a plurality of ranked article identifiers. The search engine automatically selects at least one of the article identifiers and provides a final result set in which the selected article identifier is ranked higher than in the initial search result set.
-
公开(公告)号:US08452758B2
公开(公告)日:2013-05-28
申请号:US13438145
申请日:2012-04-03
申请人: Simon Tong , Mark Pearson , Sergey Brin
发明人: Simon Tong , Mark Pearson , Sergey Brin
IPC分类号: G06F7/00
CPC分类号: G06F17/30864 , Y10S707/99933 , Y10S707/99935 , Y10S707/99942 , Y10S707/99943 , Y10S707/99944 , Y10S707/99945
摘要: Systems and methods that improve search rankings for a search query by using data associated with queries related to the search query are described. In one aspect, a search query is received, a related query related to the search query is determined, an article (such as a web page) associated with the search query is determined, and a ranking score for the article based at least in part on data associated with the related query is determined. Several algorithms and types of data associated with related queries useful in carrying out such systems and methods are described.
-
-
-
-
-
-
-
-
-