-
公开(公告)号:US09454599B2
公开(公告)日:2016-09-27
申请号:US14186320
申请日:2014-02-21
Applicant: GOOGLE INC.
Inventor: Keith Golden , Ben Hutchinson , Amit Behal , Alexander Oliver Marks , Faen Zhang , Yuan Gao , Fei Wu
IPC: G06F17/30
CPC classification number: G06F17/30651 , G06F17/30958 , G06N5/02
Abstract: A system for automatically generating entity collections comprises a data graph including entities connected by edges and instructions that cause the computer system to determine a set of entities from the data graph and to determine a set of constraints that has a quantity of constraints. A constraint in the set represents a path in the data graph shared by at least two of the entities in the set of entities. The instructions also cause the computer system to generate candidate collection definitions from combinations of the constraints, where each candidate collection definition identifies at least one constraint and no more than the quantity of constraints. The instructions also cause the computer system to determine an information gain for at least some of the candidate collection definitions, and store at least one candidate collection definition that has an information gain that meets a threshold as a candidate collection.
Abstract translation: 用于自动生成实体集合的系统包括包括通过边缘连接的实体的数据图和指令,其使得计算机系统从数据图确定一组实体,并且确定具有约束量的一组约束。 集合中的约束表示由该组实体中的至少两个实体共享的数据图中的路径。 指令还使得计算机系统从约束的组合中生成候选集合定义,其中每个候选集合定义识别至少一个约束并且不超过约束的数量。 所述指令还使得所述计算机系统确定所述候选集合定义中的至少一些的信息增益,并将具有满足阈值的信息增益的至少一个候选集合定义存储为候选集合。
-
公开(公告)号:US20150100568A1
公开(公告)日:2015-04-09
申请号:US14186320
申请日:2014-02-21
Applicant: GOOGLE INC.
Inventor: Keith Golden , Ben Hutchinson , Amit Behal , Alexander Oliver Marks , Faen Zhang , Yuan Gao , Fei Wu
IPC: G06F17/30
CPC classification number: G06F17/30651 , G06F17/30958 , G06N5/02
Abstract: A system for automatically generating entity collections comprises a data graph including entities connected by edges and instructions that cause the computer system to determine a set of entities from the data graph and to determine a set of constraints that has a quantity of constraints. A constraint in the set represents a path in the data graph shared by at least two of the entities in the set of entities. The instructions also cause the computer system to generate candidate collection definitions from combinations of the constraints, where each candidate collection definition identifies at least one constraint and no more than the quantity of constraints. The instructions also cause the computer system to determine an information gain for at least some of the candidate collection definitions, and store at least one candidate collection definition that has an information gain that meets a threshold as a candidate collection.
Abstract translation: 用于自动生成实体集合的系统包括包括通过边缘连接的实体的数据图和指令,其使得计算机系统从数据图确定一组实体,并且确定具有约束量的一组约束。 集合中的约束表示由该组实体中的至少两个实体共享的数据图中的路径。 指令还使得计算机系统从约束的组合中生成候选集合定义,其中每个候选集合定义识别至少一个约束并且不超过约束的数量。 所述指令还使得所述计算机系统确定所述候选集合定义中的至少一些的信息增益,并将具有满足阈值的信息增益的至少一个候选集合定义存储为候选集合。
-