-
公开(公告)号:US20220292404A1
公开(公告)日:2022-09-15
申请号:US17830286
申请日:2022-06-01
Applicant: DeepMind Technologies Limited
Inventor: Yutian Chen , Joao Ferdinando Gomes de Freitas
Abstract: Methods and systems for determining an optimized setting for one or more process parameters of a machine learning training process. One of the methods includes processing a current network input using a recurrent neural network in accordance with first values of the network parameters to obtain a current network output, obtaining a measure of the performance of the machine learning training process with an updated setting defined by the current network output, and generating a new network input that comprises (i) the updated setting defined by the current network output and (ii) the measure of the performance of the training process with the updated setting defined by the current network output.
-
公开(公告)号:US20220284546A1
公开(公告)日:2022-09-08
申请号:US17751359
申请日:2022-05-23
Applicant: DeepMind Technologies Limited
Inventor: Nal Emmerich Kalchbrenner , Daniel Belov , Sergio Gomez Colmenarejo , Aaron Gerard Antonius van den Oord , Ziyu Wang , Joao Ferdinando Gomes de Freitas , Scott Ellison Reed
Abstract: A method of generating an output image having an output resolution of N pixels×N pixels, each pixel in the output image having a respective color value for each of a plurality of color channels, the method comprising: obtaining a low-resolution version of the output image; and upscaling the low-resolution version of the output image to generate the output image having the output resolution by repeatedly performing the following operations: obtaining a current version of the output image having a current K×K resolution; and processing the current version of the output image using a set of convolutional neural networks that are specific to the current resolution to generate an updated version of the output image having a 2K×2K resolution.
-
23.
公开(公告)号:US20220261639A1
公开(公告)日:2022-08-18
申请号:US17625361
申请日:2020-07-16
Applicant: DeepMind Technologies Limited
Inventor: Konrad Zolna , Scott Ellison Reed , Ziyu Wang , Alexander Novikov , Sergio Gomez Colmenarejo , Joao Ferdinando Gomes de Freitas , David Budden , Serkan Cabi
IPC: G06N3/08
Abstract: A method is proposed of training a neural network to generate action data for controlling an agent to perform a task in an environment. The method includes obtaining, for each of a plurality of performances of the task, one or more first tuple datasets, each first tuple dataset comprising state data characterizing a state of the environment at a corresponding time during the performance of the task; and a concurrent process of training the neural network and a discriminator network. The training process comprises a plurality of neural network update steps and a plurality of discriminator network update steps. Each neural network update step comprises: receiving state data characterizing a current state of the environment; using the neural network and the state data to generate action data indicative of an action to be performed by the agent; forming a second tuple dataset comprising the state data; using the second tuple dataset to generate a reward value, wherein the reward value comprises an imitation value generated by the discriminator network based on the second tuple dataset; and updating one or more parameters of the neural network based on the reward value. Each discriminator network update step comprises updating the discriminator network based on a plurality of the first tuple datasets and a plurality of the second tuple datasets, the update being to increase respective imitation values which the discriminator network generates upon receiving any of the plurality of the first tuple datasets compared to respective imitation values which the discriminator network generates upon receiving any of the plurality of the second tuple datasets. The updating process is performed subject to a constraint that the updated discriminator network, upon receiving any of at least a certain proportion of a first subset of the first tuple datasets and/or any of at least a certain proportion of a second subset of the second tuple datasets, does not generate imitation values which correctly indicate that those tuple datasets are first or second tuple datasets.
-
公开(公告)号:US11386900B2
公开(公告)日:2022-07-12
申请号:US17043846
申请日:2019-05-20
Applicant: DeepMind Technologies Limited
IPC: G10L15/00 , G10L15/25 , G06K9/62 , G06N3/08 , G06T7/00 , G10L15/02 , G10L15/16 , G10L15/197 , G06V20/40 , G06V40/20 , G06V40/16
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing visual speech recognition. In one aspect, a method comprises receiving a video comprising a plurality of video frames, wherein each video frame depicts a pair of lips; processing the video using a visual speech recognition neural network to generate, for each output position in an output sequence, a respective output score for each token in a vocabulary of possible tokens, wherein the visual speech recognition neural network comprises one or more volumetric convolutional neural network layers and one or more time-aggregation neural network layers; wherein the vocabulary of possible tokens comprises a plurality of phonemes; and determining a sequence of words expressed by the pair of lips depicted in the video using the output scores.
-
25.
公开(公告)号:US20200250528A1
公开(公告)日:2020-08-06
申请号:US16758461
申请日:2018-10-25
Applicant: DeepMind Technologies Limited
Inventor: Aaron Gerard Antonius van den Oord , Yutian Chen , Danilo Jimenez Rezende , Oriol Vinyals , Joao Ferdinando Gomes de Freitas , Scott Ellison Reed
IPC: G06N3/08
Abstract: A system comprising a causal convolutional neural network to autoregressively generate a succession of values of a data item conditioned upon previously generated values of the data item. The system includes support memory for a set of support data patches each of which comprises an encoding of an example data item. A soft attention mechanism attends to one or more patches when generating the current item value. The soft attention mechanism determines a set of scores for the support data patches, for example in the form of a soft attention query vector dependent upon the previously generated values of the data item. The soft attention query vector is used to query the memory. When generating the value of the data item at a current iteration layers of the causal convolutional neural network are conditioned upon the support data patches weighted by the scores.
-
公开(公告)号:US20200160843A1
公开(公告)日:2020-05-21
申请号:US16687558
申请日:2019-11-18
Applicant: DeepMind Technologies Limited
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a video speech recognition model having a plurality of model parameters on a set of unlabeled video-audio data and using a trained speech recognition model. During the training, the values of the parameters of the trained audio speech recognition model fixed are generally fixed and only the values of the video speech recognition model are adjusted. Once being trained, the video speech recognition model can be used to recognize speech from video when corresponding audio is not available.
-
公开(公告)号:US20180260689A1
公开(公告)日:2018-09-13
申请号:US15977913
申请日:2018-05-11
Applicant: DeepMind Technologies Limited
Inventor: Ziyu Wang , Joao Ferdinando Gomes de Freitas , Marc Lanctot
CPC classification number: G06N3/0472 , G06N3/02 , G06N3/0454 , G06N3/08 , G06N3/088 , Y04S10/54
Abstract: Systems, methods, and apparatus, including computer programs encoded on a computer storage medium, for selecting an actions from a set of actions to be performed by an agent interacting with an environment. In one aspect, the system includes a dueling deep neural network. The dueling deep neural network includes a value subnetwork, an advantage subnetwork, and a combining layer. The value subnetwork processes a representation of an observation to generate a value estimate. The advantage subnetwork processes the representation of the observation to generate an advantage estimate for each action in the set of actions. The combining layer combines the value estimate and the respective advantage estimate for each action to generate a respective Q value for the action. The system selects an action to be performed by the agent in response to the observation using the respective Q values for the actions in the set of actions.
-
公开(公告)号:US12271823B2
公开(公告)日:2025-04-08
申请号:US18180754
申请日:2023-03-08
Applicant: DeepMind Technologies Limited
Inventor: Misha Man Ray Denil , Tom Schaul , Marcin Andrychowicz , Joao Ferdinando Gomes de Freitas , Sergio Gomez Colmenarejo , Matthew William Hoffman , David Benjamin Pfau
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for training machine learning models. One method includes obtaining a machine learning model, wherein the machine learning model comprises one or more model parameters, and the machine learning model is trained using gradient descent techniques to optimize an objective function; determining an update rule for the model parameters using a recurrent neural network (RNN); and applying a determined update rule for a final time step in a sequence of multiple time steps to the model parameters.
-
公开(公告)号:US12008445B2
公开(公告)日:2024-06-11
申请号:US17830286
申请日:2022-06-01
Applicant: DeepMind Technologies Limited
Inventor: Yutian Chen , Joao Ferdinando Gomes de Freitas
Abstract: Methods and systems for determining an optimized setting for one or more process parameters of a machine learning training process. One of the methods includes processing a current network input using a recurrent neural network in accordance with first values of the network parameters to obtain a current network output, obtaining a measure of the performance of the machine learning training process with an updated setting defined by the current network output, and generating a new network input that comprises (i) the updated setting defined by the current network output and (ii) the measure of the performance of the training process with the updated setting defined by the current network output.
-
公开(公告)号:US20240177001A1
公开(公告)日:2024-05-30
申请号:US18497924
申请日:2023-10-30
Applicant: DeepMind Technologies Limited
Inventor: Scott Ellison Reed , Joao Ferdinando Gomes de Freitas
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for neural programming. One of the methods includes processing a current neural network input using a core recurrent neural network to generate a neural network output; determining, from the neural network output, whether or not to end a currently invoked program and to return to a calling program from the set of programs; determining, from the neural network output, a next program to be called; determining, from the neural network output, contents of arguments to the next program to be called; receiving a representation of a current state of the environment; and generating a next neural network input from an embedding for the next program to be called and the representation of the current state of the environment.
-
-
-
-
-
-
-
-
-