-
公开(公告)号:US09047278B1
公开(公告)日:2015-06-02
申请号:US13673015
申请日:2012-11-09
Applicant: Google Inc.
Inventor: Benjamin J. Mann , Randolph G. Brown , John R. Provine , Vinicius J. Fortuna , Andrew W. Hogue
CPC classification number: G06F17/30 , G06F17/30867
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for query analysis. Queries are identified in query data, and an entity-descriptive portion and a suffix are determined in each query. Query counts are determined for a number of times that the respective queries occur in the query data. Based on the query counts, an entity-level count is estimated, which represents a number of query submissions that include the particular suffix and are considered to refer to a first entity. The entity is determined to be a particular type of entity. A type-level count is determined, which represents a number of query submissions that include the first suffix and are estimated to refer to entities of the particular type of entity. A score is assigned to the particular suffix based on the entity-level count and the type-level count.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于查询分析。 在查询数据中识别查询,并在每个查询中确定实体描述部分和后缀。 查询计数确定了相应查询在查询数据中发生的次数。 基于查询计数,估计实体级计数,其表示包括特定后缀并被认为指代第一实体的查询提交的数量。 该实体被确定为特定类型的实体。 确定类型级别计数,其表示包括第一后缀并被估计为引用特定类型的实体的实体的查询提交的数量。 基于实体级别计数和类型级别计数,将分数分配给特定后缀。