专利检索 aee:"DeepMind Technologies Limited" 第 1 页

1.

发明公开
CONTROLLING AGENTS BY SWITCHING BETWEEN CONTROL POLICIES DURING TASK EPISODES 审中-公开

公开(公告)号：US20240345873A1

公开(公告)日：2024-10-17

申请号：US18294784

申请日：2022-08-03

申请人： DeepMind Technologies Limited

发明人： Tom Schaul , Miruna Pîslar

IPC分类号： G06F9/48

CPC分类号： G06F9/4875

摘要： Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an agent can be controlled to perform a task episode by switching the control policy that is used to control the agent at one or more time steps during the task episode.

2.

发明公开
TRAINING AGENT NEURAL NETWORKS THROUGH OPEN-ENDED LEARNING 审中-公开

公开(公告)号：US20240330701A1

公开(公告)日：2024-10-03

申请号：US18577484

申请日：2022-07-27

申请人： DeepMind Technologies Limited

发明人： Maxwell Elliot Jaderberg , Wojciech Czarnecki

IPC分类号： G06N3/092 , G06N3/0985

CPC分类号： G06N3/092 , G06N3/0985

摘要： Methods, systems, and apparatus, including computer programs encoded on computer storage media, for raining an agent neural network for use in controlling an agent to perform a plurality of tasks. One of the methods includes maintaining population data specifying a population of one or more candidate agent neural networks; and training each candidate agent neural network on a respective set of one or more tasks to update the parameter values of the parameters of the candidate agent neural networks in the population data, the training comprising, for each candidate agent neural network: obtaining data identifying a candidate task; obtaining data specifying a control policy for the candidate task; determining whether to train the candidate agent neural network on the candidate task; and in response to determining to train the candidate agent neural network on the candidate task, training the candidate agent neural network on the candidate task.

3.

发明公开
RETRIEVAL AUGMENTED REINFORCEMENT LEARNING 审中-公开

公开(公告)号：US20240320506A1

公开(公告)日：2024-09-26

申请号：US18698890

申请日：2022-10-05

申请人： DeepMind Technologies Limited

发明人： Anirudh Goyal , Andrea Banino , Abram Luke Friesen , Theophane Guillaume Weber , Adrià Puigdomènech Badia , Nan Ke , Simon Osindero , Timothy Paul Lillicrap , Charles Blundell

IPC分类号： G06N3/092 , G06N3/044 , G06N3/0455 , G06N3/084

CPC分类号： G06N3/092 , G06N3/044 , G06N3/0455 , G06N3/084

摘要： Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling a reinforcement learning agent in an environment to perform a task using a retrieval-augmented action selection process. One of the methods includes receiving a current observation characterizing a current state of the environment; processing an encoder network input comprising the current observation to determine a policy neural network hidden state that corresponds to the current observation; maintaining a plurality of trajectories generated as a result of the reinforcement learning agent interacting with the environment; selecting one or more trajectories from the plurality of trajectories; updating the policy neural network hidden state using update data determined from the one or more selected trajectories; and processing the updated hidden state using a policy neural network to generate a policy output that specifies an action to be performed by the agent in response to the current observation.

4.

发明公开
GENERATING ENVIRONMENT MODELS USING IN-CONTEXT ADAPTATION AND EXPLORATION 审中-公开

公开(公告)号：US20240256884A1

公开(公告)日：2024-08-01

申请号：US18424687

申请日：2024-01-26

申请人： DeepMind Technologies Limited

发明人： Hado Philip van Hasselt , Nan Ke , Chentian Jiang

IPC分类号： G06N3/092 , G06N3/042

CPC分类号： G06N3/092 , G06N3/042

摘要： Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling an agent interacting with an environment to perform a task. In one aspect, one of the methods include: maintaining context data; receiving a current observation characterizing a current state of the environment; generating a current graph model that represents the environment; selecting, from a possible set of actions and using the current graph model, a current action to be performed by the agent in response to the current observation; controlling the agent to perform the selected current action to cause the environment to transition from the current state into a new state; and updating the context data to include (i) data identifying the selected current action and (ii) a new observation characterizing the new state of the environment.

5.

发明公开
Using Hierarchical Representations for Neural Network Architecture Searching 审中-公开

公开(公告)号：US20240249146A1

公开(公告)日：2024-07-25

申请号：US18415376

申请日：2024-01-17

申请人： DeepMind Technologies Limited

发明人： Chrisantha Thomas Fernando , Karen Simonyan , Koray Kavukcuoglu , Hanxiao Liu , Oriol Vinyals

IPC分类号： G06N3/086 , G06F16/901 , G06F17/15 , G06N3/045

CPC分类号： G06N3/086 , G06F16/9024 , G06N3/045 , G06F17/15

摘要： A computer-implemented method for automatically determining a neural network architecture represents a neural network architecture as a data structure defining a hierarchical set of directed acyclic graphs in multiple levels. Each graph has an input, an output, and a plurality of nodes between the input and the output. At each level, a corresponding set of the nodes are connected pairwise by directed edges which indicate operations performed on outputs of one node to generate an input to another node. Each level is associated with a corresponding set of operations. At a lowest level, the operations associated with each edge are selected from a set of primitive operations. The method includes repeatedly generating new sample neural network architectures, and evaluating their fitness. The modification is performed by selecting a level, selecting two nodes at that level, and modifying, removing or adding an edge between those nodes according to operations associated with lower levels of the hierarchy.

6.

发明授权
Compressed sensing using neural networks 有权

公开(公告)号：US12032523B2

公开(公告)日：2024-07-09

申请号：US16818895

申请日：2020-03-13

申请人： DeepMind Technologies Limited

发明人： Yan Wu , Timothy Paul Lillicrap , Mihaela Rosca

IPC分类号： G06F16/174 , G06N3/045 , G06N3/08

CPC分类号： G06F16/1744 , G06N3/045 , G06N3/08

摘要： Methods, systems, and apparatus, including computer programs encoded on computer storage media, for compressed sensing using neural networks. One of the methods includes receiving an input measurement of an input data item; for each of one or more optimization steps: processing a latent representation using a generator neural network to generate a candidate reconstructed data item, processing the candidate reconstructed data item using a measurement neural network to generate a measurement of the candidate reconstructed data item, and updating the latent representation to reduce an error between the measurement and the input measurement; and processing the latent representation after the one or more optimization steps using the generator neural network to generate a reconstruction of the input data item.

7.

发明授权
Neural networks for scalable continual learning in domains with sequentially learned tasks 有权

公开(公告)号：US12020164B2

公开(公告)日：2024-06-25

申请号：US17048023

申请日：2019-04-18

申请人： DEEPMIND TECHNOLOGIES LIMITED

发明人： Jonathan Schwarz , Razvan Pascanu , Raia Thais Hadsell , Wojciech Czarnecki , Yee Whye Teh , Jelena Luketina

IPC分类号： G06N3/08 , G06F18/22 , G06N3/084 , G06N5/02 , G06N20/20

CPC分类号： G06N3/084 , G06F18/22 , G06N3/08 , G06N5/02 , G06N20/20

摘要： Methods, systems, and apparatus, including computer programs encoded on computer storage media, for scalable continual learning using neural networks. One of the methods includes receiving new training data for a new machine learning task; training an active subnetwork on the new training data to determine trained values of the active network parameters from initial values of the active network parameters while holding current values of the knowledge parameters fixed; and training a knowledge subnetwork on the new training data to determine updated values of the knowledge parameters from the current values of the knowledge parameters by training the knowledge subnetwork to generate knowledge outputs for the new training inputs that match active outputs generated by the trained active subnetwork for the new training inputs.

8.

发明授权
Low latency multi-constraint ranking of content items 有权

公开(公告)号：US12001484B2

公开(公告)日：2024-06-04

申请号：US17177097

申请日：2021-02-16

申请人： DeepMind Technologies Limited

发明人： Timothy Arthur Mann , Ivan Lobov , Anton Zhernov , Krishnamurthy Dvijotham , Xiaohong Gong , Dan-Andrei Calian

IPC分类号： G06F16/95 , G06F16/903 , G06F17/11 , G06F17/16

CPC分类号： G06F16/90335 , G06F17/11 , G06F17/16

摘要： Methods and systems for low-latency multi-constraint ranking of content items. One of the methods includes receiving a request to rank a plurality of content items for presentation to a user to maximize a primary objective subject to a plurality of constraints; initializing a dual variable vector; updating the dual variable vector, comprising: determining an overall objective score for the dual variable vector; identifying a plurality of candidate dual variable vectors that includes one or more neighboring node dual variable vectors; determining respective overall objective scores for each of the one or more candidate dual variable vectors; identifying the candidate with the best overall objective score; and determining whether to update the dual variable vector based on whether the identified candidate has a better overall objective score than the dual variable vector; and determining a final ranking for the content items based on the dual variable vector.

9.

发明公开
TRAINING GRAPH NEURAL NETWORKS USING A DE-NOISING OBJECTIVE 审中-公开

公开(公告)号：US20240176982A1

公开(公告)日：2024-05-30

申请号：US18283131

申请日：2022-05-30

申请人： DeepMind Technologies Limited

发明人： Jonathan William Godwin , Peter William Battaglia , Kevin Michael Schaarschmidt , Alvaro Sanchez

IPC分类号： G06N3/04 , G06N3/084

CPC分类号： G06N3/04 , G06N3/084

摘要： Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network that includes one or more graph neural network layers. In one aspect, a method comprises: generating data defining a graph, comprising: generating a respective final feature representation for each node, wherein, for each of one or more of the nodes, the respective final feature representation is a modified feature representation that is generated from a respective feature representation for the node using respective noise; processing the data defining the graph using one or more of the graph neural network layers of the neural network to generate a respective updated node embedding of each node; and processing, for each of one or more of the nodes having modified feature representations, the updated node embedding of the node to generate a respective de-noising prediction for the node.

10.

发明授权
Deep neural network system for similarity-based graph representations 有权

公开(公告)号：US11983269B2

公开(公告)日：2024-05-14

申请号：US18087704

申请日：2022-12-22

申请人： DeepMind Technologies Limited

发明人： Yujia Li , Chenjie Gu , Thomas Dullien , Oriol Vinyals , Pushmeet Kohli

IPC分类号： G06F21/56 , G06F16/901 , G06F17/16 , G06F18/22 , G06F21/57 , G06N3/04 , G06V10/426 , G06V10/82 , G06V30/196

CPC分类号： G06F21/563 , G06F16/9024 , G06F17/16 , G06F18/22 , G06F21/577 , G06N3/04 , G06V10/426 , G06V10/82 , G06V30/1988

摘要： There is described a neural network system implemented by one or more computers for determining graph similarity. The neural network system comprises one or more neural networks configured to process an input graph to generate a node state representation vector for each node of the input graph and an edge representation vector for each edge of the input graph; and process the node state representation vectors and the edge representation vectors to generate a vector representation of the input graph. The neural network system further comprises one or more processors configured to: receive a first graph; receive a second graph; generate a vector representation of the first graph; generate a vector representation of the second graph; determine a similarity score for the first graph and the second graph based upon the vector representations of the first graph and the second graph.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类