IDENTIFYING QUERY ASPECTS
    1.
    发明申请
    IDENTIFYING QUERY ASPECTS 审中-公开
    识别查询方面

    公开(公告)号:US20160026696A1

    公开(公告)日:2016-01-28

    申请号:US14875177

    申请日:2015-10-05

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer program products, for generating aspects associated with entities. In some implementations, a method includes receiving data identifying an entity; generating a group of candidate aspects for the entity; modifying the group of candidate aspects to generate a group of modified candidate aspects comprising combining similar candidate aspects and grouping candidate aspects using one or more aspect classes each associated with one or more candidate aspects; ranking one or more modified candidate aspects in the group of modified candidate aspects based on a diversity score and a popularity score; and storing an association between one or more highest ranked modified candidate aspects and the entity. The aspects can be used to organize and present search results in response to queries for the entity.

    Abstract translation: 用于生成与实体相关的方面的方法,系统和装置,包括计算机程序产品。 在一些实现中,一种方法包括接收识别实体的数据; 为该实体产生一组候选方面; 修改所述候选方面的组以生成一组修改的候选方面,其包括组合类似候选方面并使用与一个或多个候选方面相关联的一个或多个方面类别对候选方面分组; 基于多样性分数和受欢迎程度,对修改的候选方面组中的一个或多个修改后的候选方面进行排序; 以及存储一个或多个最高排名的修改候选方面与所述实体之间的关联。 这些方面可以用于组织和呈现搜索结果以响应对实体的查询。

    Automatic definition of entity collections
    2.
    发明授权
    Automatic definition of entity collections 有权
    实体集合的自动定义

    公开(公告)号:US09454599B2

    公开(公告)日:2016-09-27

    申请号:US14186320

    申请日:2014-02-21

    Applicant: GOOGLE INC.

    CPC classification number: G06F17/30651 G06F17/30958 G06N5/02

    Abstract: A system for automatically generating entity collections comprises a data graph including entities connected by edges and instructions that cause the computer system to determine a set of entities from the data graph and to determine a set of constraints that has a quantity of constraints. A constraint in the set represents a path in the data graph shared by at least two of the entities in the set of entities. The instructions also cause the computer system to generate candidate collection definitions from combinations of the constraints, where each candidate collection definition identifies at least one constraint and no more than the quantity of constraints. The instructions also cause the computer system to determine an information gain for at least some of the candidate collection definitions, and store at least one candidate collection definition that has an information gain that meets a threshold as a candidate collection.

    Abstract translation: 用于自动生成实体集合的系统包括包括通过边缘连接的实体的数据图和指令,其使得计算机系统从数据图确定一组实体,并且确定具有约束量的一组约束。 集合中的约束表示由该组实体中的至少两个实体共享的数据图中的路径。 指令还使得计算机系统从约束的组合中生成候选集合定义,其中每个候选集合定义识别至少一个约束并且不超过约束的数量。 所述指令还使得所述计算机系统确定所述候选集合定义中的至少一些的信息增益,并将具有满足阈值的信息增益的至少一个候选集合定义存储为候选集合。

    AUTOMATIC DEFINITION OF ENTITY COLLECTIONS
    3.
    发明申请
    AUTOMATIC DEFINITION OF ENTITY COLLECTIONS 有权
    实体集合的自动定义

    公开(公告)号:US20150100568A1

    公开(公告)日:2015-04-09

    申请号:US14186320

    申请日:2014-02-21

    Applicant: GOOGLE INC.

    CPC classification number: G06F17/30651 G06F17/30958 G06N5/02

    Abstract: A system for automatically generating entity collections comprises a data graph including entities connected by edges and instructions that cause the computer system to determine a set of entities from the data graph and to determine a set of constraints that has a quantity of constraints. A constraint in the set represents a path in the data graph shared by at least two of the entities in the set of entities. The instructions also cause the computer system to generate candidate collection definitions from combinations of the constraints, where each candidate collection definition identifies at least one constraint and no more than the quantity of constraints. The instructions also cause the computer system to determine an information gain for at least some of the candidate collection definitions, and store at least one candidate collection definition that has an information gain that meets a threshold as a candidate collection.

    Abstract translation: 用于自动生成实体集合的系统包括包括通过边缘连接的实体的数据图和指令,其使得计算机系统从数据图确定一组实体,并且确定具有约束量的一组约束。 集合中的约束表示由该组实体中的至少两个实体共享的数据图中的路径。 指令还使得计算机系统从约束的组合中生成候选集合定义,其中每个候选集合定义识别至少一个约束并且不超过约束的数量。 所述指令还使得所述计算机系统确定所述候选集合定义中的至少一些的信息增益,并将具有满足阈值的信息增益的至少一个候选集合定义存储为候选集合。

    Synthesizing union tables from the web

    公开(公告)号:US09720896B1

    公开(公告)日:2017-08-01

    申请号:US14143032

    申请日:2013-12-30

    Applicant: Google Inc.

    CPC classification number: G06F17/245 G06F17/2247 G06F17/3089

    Abstract: Systems and techniques are provided for generating a union table with from stitchable tables. Tables may be extracted from web pages to obtain extracted tables. Stitchable tables may be determined from the extracted tables. Hidden attributes for the stitchable tables may be extracted from the web pages from which the stitchable tables were extracted using segmentation of text for contextual data from the web pages into segment sequences, and alignment of the segment sequences. Iterative pairwise alignment may be used to align the segment sequences and obtain aligned segments. The stitchable tables may be joined into a union table. Hidden attributes from the aligned segments may be added to the union table. Headers for the hidden attributes in the union table may be labeled using a database of entities and class labels.

    Identifying Query Aspects
    6.
    发明申请
    Identifying Query Aspects 有权
    识别查询方面

    公开(公告)号:US20130268517A1

    公开(公告)日:2013-10-10

    申请号:US13908456

    申请日:2013-06-03

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer program products, for generating aspects associated with entities. In some implementations, a method includes receiving data identifying an entity; generating a group of candidate aspects for the entity; modifying the group of candidate aspects to generate a group of modified candidate aspects comprising combining similar candidate aspects and grouping candidate aspects using one or more aspect classes each associated with one or more candidate aspects; ranking one or more modified candidate aspects in the group of modified candidate aspects based on a diversity score and a popularity score; and storing an association between one or more highest ranked modified candidate aspects and the entity. The aspects can be used to organize and present search results in response to queries for the entity.

    Abstract translation: 用于生成与实体相关的方面的方法,系统和装置,包括计算机程序产品。 在一些实现中,一种方法包括接收识别实体的数据; 为该实体产生一组候选方面; 修改所述候选方面的组以生成一组修改的候选方面,其包括组合类似候选方面并使用与一个或多个候选方面相关联的一个或多个方面类别对候选方面分组; 基于多样性分数和受欢迎程度,对修改的候选方面组中的一个或多个修改后的候选方面进行排序; 以及存储一个或多个最高排名的修改候选方面与所述实体之间的关联。 这些方面可以用于组织和呈现搜索结果以响应对实体的查询。

    Identifying query aspects
    7.
    发明授权
    Identifying query aspects 有权
    识别查询方面

    公开(公告)号:US09152676B2

    公开(公告)日:2015-10-06

    申请号:US13908456

    申请日:2013-06-03

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer program products, for generating aspects associated with entities. In some implementations, a method includes receiving data identifying an entity; generating a group of candidate aspects for the entity; modifying the group of candidate aspects to generate a group of modified candidate aspects comprising combining similar candidate aspects and grouping candidate aspects using one or more aspect classes each associated with one or more candidate aspects; ranking one or more modified candidate aspects in the group of modified candidate aspects based on a diversity score and a popularity score; and storing an association between one or more highest ranked modified candidate aspects and the entity. The aspects can be used to organize and present search results in response to queries for the entity.

    Abstract translation: 用于生成与实体相关的方面的方法,系统和装置,包括计算机程序产品。 在一些实现中,一种方法包括接收识别实体的数据; 为该实体产生一组候选方面; 修改所述候选方面的组以生成一组修改的候选方面,其包括组合类似候选方面并使用与一个或多个候选方面相关联的一个或多个方面类别对候选方面分组; 基于多样性分数和受欢迎程度,对修改的候选方面组中的一个或多个修改后的候选方面进行排序; 以及存储一个或多个最高排名的修改候选方面与所述实体之间的关联。 这些方面可以用于组织和呈现搜索结果以响应对实体的查询。

Patent Agency Ranking