Category generalization for search queries

    公开(公告)号:US09501571B1

    公开(公告)日:2016-11-22

    申请号:US14658899

    申请日:2015-03-16

    Applicant: Google Inc.

    Abstract: A system and computer-implemented method are provided for associating categories with business names for generalizing search queries, the method including identifying one or more businesses within a first geographic region, determining a business name and one or more categories for each of the one or more businesses, generating one or more name components for each of the one or more businesses from the name of the business, generating one or more name component groups from the name components of the one or more businesses, each name component group including one or more identical name components, determining for each name component group, if the one or more name components within the name component group are associated with businesses that share one or more common categories and associating the one or more common categories with the name component of the name component group.

    Weighted-distance spatial indexing
    2.
    发明授权
    Weighted-distance spatial indexing 有权
    加权距离空间索引

    公开(公告)号:US08958817B1

    公开(公告)日:2015-02-17

    申请号:US13744700

    申请日:2013-01-18

    Applicant: Google Inc.

    CPC classification number: H04W4/02

    Abstract: Provided is a process of and apparatus for spatially indexing geographic items. The process may include obtaining geographic-item data identifying geographic items, the geographic location of each item, and an attribute of each item, wherein the geographic-item data identifies values of key-value pairs to be formed in a spatial index; obtaining a plurality of geographic-location keys each corresponding to a geographic area, the geographic-location keys identifying keys of the key-value pairs to be formed in the spatial index; and pairing each geographic-location key with an item among the geographic-item data. Pairing may be performed by: calculating distances between the geographic location of each of the items and the geographic-location key; weighting each of the distances based on the attribute of the item corresponding to that distance; and selecting the geographic item having the closest attribute-weighted distance as the item to be paired with the geographic-location key.

    Abstract translation: 提供了用于空间索引地理项目的过程和装置。 该过程可以包括获得识别地理项目的地理项目数据,每个项目的地理位置和每个项目的属性,其中地理项目数据标识要在空间索引中形成的键 - 值对的值; 获取每个对应于地理区域的多个地理位置密钥,所述地理位置密钥识别要在空间索引中形成的键值对的键; 以及将每个地理位置密钥与地理项目数据中的项目配对。 配对可以通过以下方式来执行:计算每个项目的地理位置与地理位置密钥之间的距离; 基于与该距离相对应的项目的属性对每个距离进行加权; 并且选择具有最接近的属性加权距离的地理项目作为要与地理位置密钥配对的项目。

    Training a probabilistic spelling checker from structured data
    3.
    发明授权
    Training a probabilistic spelling checker from structured data 有权
    从结构化数据训练概率拼写检查器

    公开(公告)号:US09558179B1

    公开(公告)日:2017-01-31

    申请号:US14098394

    申请日:2013-12-05

    Applicant: Google Inc.

    CPC classification number: G06F17/274 G06F15/18 G06F17/273 G06F17/30241

    Abstract: A spelling system derives a language model for a particular domain of structured data, the language model enabling determinations of alternative spellings of queries or other strings of text from that domain. More specifically, the spelling system calculates (a) probabilities that the various query entity types—such as STREET, CITY, or STATE for queries in the geographical domain—are arranged in each of the various possible orders, and (b) probabilities that an arbitrary query references given particular ones of the entities, such as the street “El Camino Real.” Based on the calculated probabilities, the spelling system generates a language model that has associated scores (e.g., probabilities) for each of a set of probable entity name orderings, where the total number of entity name orderings is substantially less than the number of all possible orderings. The language model can be applied to determine probabilities of arbitrary queries, and thus to suggest alternative queries more likely to represent what a user intended.

    Abstract translation: 拼写系统为结构化数据的特定域派生语言模型,该语言模型能够确定查询的替代拼写或来自该域的其他文本字符串。 更具体地,拼写系统计算(a)各种查询实体类型的概率,例如在地理域中的查询的STREET,CITY或STATE,排列在各种可能的顺序中的每一个中,以及(b)概率 给出特定实体的任意查询参考,例如街道“El Camino Real”。基于计算的概率,拼写系统生成具有与一组可能实体中的每一个相关联的分数(例如,概率)的语言模型 名称排序,其中实体名称排序的总数大大少于所有可能排序的数量。 语言模型可以用于确定任意查询的概率,从而建议替代查询更有可能表示用户想要的内容。

    Detecting new businesses with unrecognized query terms
    4.
    发明授权
    Detecting new businesses with unrecognized query terms 有权
    用无法识别的查询条件检测新业务

    公开(公告)号:US09218420B1

    公开(公告)日:2015-12-22

    申请号:US13777476

    申请日:2013-02-26

    Applicant: Google Inc.

    CPC classification number: G06F17/30864

    Abstract: Provided is a process for identifying a new business listing, the process including: identifying, from a log of local search queries, a term that does not correspond to a name of a business listing; determining a number of recent search queries containing the term and a number of historical search queries containing the term; determining that a rate based on the number of recent search queries exceeds a threshold rate based on the number of historical search queries; and identifying the term as a name of a new business listing.

    Abstract translation: 提供了用于识别新业务列表的过程,该过程包括:从本地搜索查询的日志中识别与商业列表的名称不对应的术语; 确定包含所述术语和包含所述术语的历史搜索查询的多个最近的搜索查询; 基于历史搜索查询的数量确定基于最近搜索查询的数量的速率超过阈值率; 并将该术语识别为新业务列表的名称。

Patent Agency Ranking