-
公开(公告)号:US20250111201A1
公开(公告)日:2025-04-03
申请号:US18477184
申请日:2023-09-28
Applicant: Nvidia Corporation
Inventor: James Robert Lucas , Derek Lim , Haggai Maron , Marc Teva Law
IPC: G06N3/0455 , G06N3/08
Abstract: Embodiments are disclosed for a generating graph representations of neural networks to be used as input for one or more metanetworks. Architectural information can be extracted from a neural network and used to generate graph a representation. A subgraph can be generated for each layer of the neural network, where each subgraph includes nodes that correspond to neurons and connecting edges that correspond to weights. Each layer of the neural network can be associated with a bias node that is connected to individual nodes of that layer using edges representing bias weights. Various types of neural networks and layers of neural networks can be represented by such graphs, which are then used as inputs for metanetworks. The subgraphs can be combined into a comprehensive graph representation of the neural network, which can be provided as input to a metanetwork to generate network parameters or perform another such operation.
-
公开(公告)号:US20230229916A1
公开(公告)日:2023-07-20
申请号:US18157608
申请日:2023-01-20
Applicant: NVIDIA Corporation
Inventor: Gal Chechik , Eli Alexander Meirom , Haggai Maron , Brucek Kurdo Khailany , Paul Martin Springer , Shie Mannor
Abstract: A method for contracting a tensor network is provided. The method comprises generating a graph representation of the tensor network, processing the graph representation to determine a contraction for the tensor network by an agent that implements a reinforcement learning algorithm, and processing the tensor network in accordance with the contraction to generate a contracted tensor network.
-
公开(公告)号:US20220391781A1
公开(公告)日:2022-12-08
申请号:US17827446
申请日:2022-05-27
Applicant: NVIDIA Corporation
Inventor: Or Litany , Haggai Maron , David Jesus Acuna Marrero , Jan Kautz , Sanja Fidler , Gal Chechik
Abstract: A method performed by a server is provided. The method comprises sending copies of a set of parameters of a hyper network (HN) to at least one client device, receiving from each client device in the at least one client device, a corresponding set of updated parameters of the HN, and determining a next set of parameters of the HN based on the corresponding sets of updated parameters received from the at least one client device. Each client device generates the corresponding set of updated parameters based on a local model architecture of the client device.
-
-