Generative Models for Discrete Datasets Constrained by a Marginal Distribution Specification

    公开(公告)号:US20240112013A1

    公开(公告)日:2024-04-04

    申请号:US17951889

    申请日:2022-09-23

    Applicant: Google LLC

    CPC classification number: G06N3/08 G06N3/0472

    Abstract: The present disclosure is directed to generative models for datasets constrained by marginal constraints. One method includes receiving a request to generate a target dataset based on a marginal constraint for a source dataset. A first object occurs at a source frequency in the source dataset. The marginal constraint indicates a target frequency for the first object. The source dataset encodes a set of co-occurrence frequencies for a plurality of object pairs. A source generative model is accessed. The source generative model includes a first module and a second module that are trained on the source dataset. The second module is updated based on the marginal constraint. An adapted generative model is generated that includes the first module and the updated second module. The target dataset is generated based on the adapted generative model. The first object occurs at the target frequency in the target dataset. The target dataset encodes the set of co-occurrence frequencies for the plurality of object pairs.

    Autoregressive graph generation machine learning models

    公开(公告)号:US11947503B2

    公开(公告)日:2024-04-02

    申请号:US17351086

    申请日:2021-06-17

    Applicant: Google LLC

    CPC classification number: G06F16/212 G06F16/2237 G06F16/2246 G06N3/045

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating data defining a graph. In one aspect, a method comprises: sequentially generating a respective edge set for each node in the graph, wherein for each of a plurality of nodes after a first node, generating the edge set for the node comprises: receiving a context embedding for the node that summarizes a respective edge set for each node that precedes the node; generating, based on the context embedding for the node: (i) a respective edge set for the node, and (ii) a respective embedding of the edge set for the node; generating a context embedding for a next node in the ordering of the nodes using the embedding of the edge set for the node; and adding the set of edges defined by the edge set for the node to the graph.

    Knowledge Graph Completion and Multi-Hop Reasoning in Knowledge Graphs at Scale

    公开(公告)号:US20230289626A1

    公开(公告)日:2023-09-14

    申请号:US18183410

    申请日:2023-03-14

    Applicant: Google LLC

    CPC classification number: G06N5/022 G06F16/2453

    Abstract: Provided are computing systems, methods, and platforms for negative sampling in knowledge graphs with improved efficiency. A knowledge graph comprising entities and links between the entities can be obtained. A query computation graph comprising nodes and edges can be generated based on the knowledge graph. The nodes of the query computation graph can include anchor nodes, a root node, and intermediate nodes positioned in paths between the anchor nodes and the root node. A node cut of a query of the query computation graph can be determined and can include at least one node that cuts at least one path between each anchor node and the root node of the query computation graph. Negative samples can be identified by bidirectionally traversing the query computation graph in a first direction from the anchor nodes to the node cut and in a second direction from the root node to the node cut.

Patent Agency Ranking