-
公开(公告)号:US20210027425A1
公开(公告)日:2021-01-28
申请号:US16324061
申请日:2018-02-26
Applicant: DeepMind Technologies Limited
Inventor: Nal Emmerich Kalchbrenner , Daniel Belov , Sergio Gomez Colmenarejo , Aaron Gerard Antonius van den Oord , Ziyu Wang , Joao Gomes de Freitas , Scott Ellison Reed
Abstract: A method of generating an output image having an output resolution of N pixels×N pixels, each pixel in the output image having a respective color value for each of a plurality of color channels, the method comprising: obtaining a low-resolution version of the output image; and upscaling the low-resolution version of the output image to generate the output image having the output resolution by repeatedly performing the following operations: obtaining a current version of the output image having a current K×K resolution; and processing the current version of the output image using a set of convolutional neural networks that are specific to the current resolution to generate an updated version of the output image having a 2K×2K resolution.
-
公开(公告)号:US20240394504A1
公开(公告)日:2024-11-28
申请号:US18637279
申请日:2024-04-16
Applicant: DeepMind Technologies Limited
Inventor: Misha Man Ray Denil , Sergio Gomez Colmenarejo , Serkan Cabi , David William Saxton , Joao Ferdinando Gomes de Freitas
Abstract: A reinforcement learning system is proposed comprising a plurality of property detector neural networks. Each property detector neural network is arranged to receive data representing an object within an environment, and to generate property data associated with a property of the object. A processor is arranged to receive an instruction indicating a task associated with an object having an associated property, and process the output of the plurality of property detector neural networks based upon the instruction to generate a relevance data item. The relevance data item indicates objects within the environment associated with the task. The processor also generates a plurality of weights based upon the relevance data item, and, based on the weights, generates modified data representing the plurality of objects within the environment. A neural network is arranged to receive the modified data and to output an action associated with the task.
-
公开(公告)号:US11361403B2
公开(公告)日:2022-06-14
申请号:US16324061
申请日:2018-02-26
Applicant: DeepMind Technologies Limited
Inventor: Nal Emmerich Kalchbrenner , Daniel Belov , Sergio Gomez Colmenarejo , Aaron Gerard Antonius van den Oord , Ziyu Wang , Joao Ferdinando Gomes de Freitas , Scott Ellison Reed
Abstract: A method of generating an output image having an output resolution of N pixels×N pixels, each pixel in the output image having a respective color value for each of a plurality of color channels, the method comprising: obtaining a low-resolution version of the output image; and upscaling the low-resolution version of the output image to generate the output image having the output resolution by repeatedly performing the following operations: obtaining a current version of the output image having a current K×K resolution; and processing the current version of the output image using a set of convolutional neural networks that are specific to the current resolution to generate an updated version of the output image having a 2K×2K resolution.
-
4.
公开(公告)号:US11615310B2
公开(公告)日:2023-03-28
申请号:US16302592
申请日:2017-05-19
Applicant: DEEPMIND TECHNOLOGIES LIMITED
Inventor: Misha Man Ray Denil , Tom Schaul , Marcin Andrychowicz , Joao Ferdinando Gomes de Freitas , Sergio Gomez Colmenarejo , Matthew William Hoffman , David Benjamin Pfau
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for training machine learning models. One method includes obtaining a machine learning model, wherein the machine learning model comprises one or more model parameters, and the machine learning model is trained using gradient descent techniques to optimize an objective function; determining an update rule for the model parameters using a recurrent neural network (RNN); and applying a determined update rule for a final time step in a sequence of multiple time steps to the model parameters.
-
公开(公告)号:US20200167633A1
公开(公告)日:2020-05-28
申请号:US16615061
申请日:2018-05-22
Applicant: DEEPMIND TECHNOLOGIES LIMITED
Inventor: Misha Man Ray Denil , Sergio Gomez Colmenarejo , Serkan Cabi , David William Saxton , Joao Ferdinando Gomes de Freitas
Abstract: A reinforcement learning system is proposed comprising a plurality of property detector neural networks. Each property detector neural network is arranged to receive data representing an object within an environment, and to generate property data associated with a property of the object. A processor is arranged to receive an instruction indicating a task associated with an object having an associated property, and process the output of the plurality of property detector neural networks based upon the instruction to generate a relevance data item. The relevance data item indicates objects within the environment associated with the task. The processor also generates a plurality of weights based upon the relevance data item, and, based on the weights, generates modified data representing the plurality of objects within the environment. A neural network is arranged to receive the modified data and to output an action associated with the task.
-
公开(公告)号:US20230376771A1
公开(公告)日:2023-11-23
申请号:US18180754
申请日:2023-03-08
Applicant: DeepMind Technologies Limited
Inventor: Misha Man Ray Denil , Tom Schaul , Marcin Andrychowicz , Joao Ferdinando Gomes de Freitas , Sergio Gomez Colmenarejo , Matthew William Hoffman , David Benjamin Pfau
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for training machine learning models. One method includes obtaining a machine learning model, wherein the machine learning model comprises one or more model parameters, and the machine learning model is trained using gradient descent techniques to optimize an objective function; determining an update rule for the model parameters using a recurrent neural network (RNN); and applying a determined update rule for a final time step in a sequence of multiple time steps to the model parameters.
-
公开(公告)号:US11734797B2
公开(公告)日:2023-08-22
申请号:US17751359
申请日:2022-05-23
Applicant: DeepMind Technologies Limited
Inventor: Nal Emmerich Kalchbrenner , Daniel Belov , Sergio Gomez Colmenarejo , Aaron Gerard Antonius van den Oord , Ziyu Wang , Joao Ferdinando Gomes de Freitas , Scott Ellison Reed
CPC classification number: G06T3/4046 , G06N3/045 , G06N20/00 , G06T3/4076
Abstract: A method of generating an output image having an output resolution of N pixels×N pixels, each pixel in the output image having a respective color value for each of a plurality of color channels, the method comprising: obtaining a low-resolution version of the output image; and upscaling the low-resolution version of the output image to generate the output image having the output resolution by repeatedly performing the following operations: obtaining a current version of the output image having a current K×K resolution; and processing the current version of the output image using a set of convolutional neural networks that are specific to the current resolution to generate an updated version of the output image having a 2K×2K resolution.
-
公开(公告)号:US20220284546A1
公开(公告)日:2022-09-08
申请号:US17751359
申请日:2022-05-23
Applicant: DeepMind Technologies Limited
Inventor: Nal Emmerich Kalchbrenner , Daniel Belov , Sergio Gomez Colmenarejo , Aaron Gerard Antonius van den Oord , Ziyu Wang , Joao Ferdinando Gomes de Freitas , Scott Ellison Reed
Abstract: A method of generating an output image having an output resolution of N pixels×N pixels, each pixel in the output image having a respective color value for each of a plurality of color channels, the method comprising: obtaining a low-resolution version of the output image; and upscaling the low-resolution version of the output image to generate the output image having the output resolution by repeatedly performing the following operations: obtaining a current version of the output image having a current K×K resolution; and processing the current version of the output image using a set of convolutional neural networks that are specific to the current resolution to generate an updated version of the output image having a 2K×2K resolution.
-
9.
公开(公告)号:US20220261639A1
公开(公告)日:2022-08-18
申请号:US17625361
申请日:2020-07-16
Applicant: DeepMind Technologies Limited
Inventor: Konrad Zolna , Scott Ellison Reed , Ziyu Wang , Alexander Novikov , Sergio Gomez Colmenarejo , Joao Ferdinando Gomes de Freitas , David Budden , Serkan Cabi
IPC: G06N3/08
Abstract: A method is proposed of training a neural network to generate action data for controlling an agent to perform a task in an environment. The method includes obtaining, for each of a plurality of performances of the task, one or more first tuple datasets, each first tuple dataset comprising state data characterizing a state of the environment at a corresponding time during the performance of the task; and a concurrent process of training the neural network and a discriminator network. The training process comprises a plurality of neural network update steps and a plurality of discriminator network update steps. Each neural network update step comprises: receiving state data characterizing a current state of the environment; using the neural network and the state data to generate action data indicative of an action to be performed by the agent; forming a second tuple dataset comprising the state data; using the second tuple dataset to generate a reward value, wherein the reward value comprises an imitation value generated by the discriminator network based on the second tuple dataset; and updating one or more parameters of the neural network based on the reward value. Each discriminator network update step comprises updating the discriminator network based on a plurality of the first tuple datasets and a plurality of the second tuple datasets, the update being to increase respective imitation values which the discriminator network generates upon receiving any of the plurality of the first tuple datasets compared to respective imitation values which the discriminator network generates upon receiving any of the plurality of the second tuple datasets. The updating process is performed subject to a constraint that the updated discriminator network, upon receiving any of at least a certain proportion of a first subset of the first tuple datasets and/or any of at least a certain proportion of a second subset of the second tuple datasets, does not generate imitation values which correctly indicate that those tuple datasets are first or second tuple datasets.
-
公开(公告)号:US12271823B2
公开(公告)日:2025-04-08
申请号:US18180754
申请日:2023-03-08
Applicant: DeepMind Technologies Limited
Inventor: Misha Man Ray Denil , Tom Schaul , Marcin Andrychowicz , Joao Ferdinando Gomes de Freitas , Sergio Gomez Colmenarejo , Matthew William Hoffman , David Benjamin Pfau
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for training machine learning models. One method includes obtaining a machine learning model, wherein the machine learning model comprises one or more model parameters, and the machine learning model is trained using gradient descent techniques to optimize an objective function; determining an update rule for the model parameters using a recurrent neural network (RNN); and applying a determined update rule for a final time step in a sequence of multiple time steps to the model parameters.
-
-
-
-
-
-
-
-
-