-
公开(公告)号:US20250045583A1
公开(公告)日:2025-02-06
申请号:US18805367
申请日:2024-08-14
Applicant: DeepMind Technologies Limited
Inventor: Tom Schaul , John Quan , David Silver
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network used to select actions performed by a reinforcement learning agent interacting with an environment. In one aspect, a method includes maintaining a replay memory, where the replay memory stores pieces of experience data generated as a result of the reinforcement learning agent interacting with the environment. Each piece of experience data is associated with a respective expected learning progress measure that is a measure of an expected amount of progress made in the training of the neural network if the neural network is trained on the piece of experience data. The method further includes selecting a piece of experience data from the replay memory by prioritizing for selection pieces of experience data having relatively higher expected learning progress measures and training the neural network on the selected piece of experience data.
-
公开(公告)号:US20250036958A1
公开(公告)日:2025-01-30
申请号:US18358920
申请日:2023-07-25
Applicant: DeepMind Technologies Limited
Inventor: Caglar Gulcehre , Thomas Le Paine , Srivatsan Srinivasan , Ksenia Konyushkova , Lotte Petronella Jacoba Weerts , Abhishek Sharma , Aditya Siddhant , Orhan Firat
IPC: G06N3/092 , G06N3/0475
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a generative neural network. One of the methods includes training a generative neural network by performing a sequence of a plurality training stages each generating an expanded training data set. The method also involves performing a sequence of improve steps, each comprising training the generative neural network on the training examples in a corresponding subset of the expanded training data set.
-
公开(公告)号:US12211488B2
公开(公告)日:2025-01-28
申请号:US18571553
申请日:2022-06-15
Applicant: DeepMind Technologies Limited
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing video data using an adaptive visual speech recognition model. One of the methods includes receiving a video that includes a plurality of video frames that depict a first speaker: obtaining a first embedding characterizing the first speaker; and processing a first input comprising (i) the video and (ii) the first embedding using a visual speech recognition neural network having a plurality of parameters, wherein the visual speech recognition neural network is configured to process the video and the first embedding in accordance with trained values of the parameters to generate a speech recognition output that defines a sequence of one or more words being spoken by the first speaker in the video.
-
公开(公告)号:US20250022476A1
公开(公告)日:2025-01-16
申请号:US18780377
申请日:2024-07-22
Applicant: DeepMind Technologies Limited
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for bandwidth extension. One of the methods includes obtaining a low-resolution version of an input, the low-resolution version of the input comprising a first number of samples at a first sample rate over a first time period; and generating, from the low-resolution version of the input, a high-resolution version of the input comprising a second, larger number of samples at a second, higher sample rate over the first time period. Generating the high-resolution version includes generating a representation of the low-resolution version of the input; processing the representation of the low-resolution version of the input through a conditioning neural network to generate a conditioning input; and processing the conditioning input using a generative neural network to generate the high-resolution version of the input.
-
公开(公告)号:US12175737B2
公开(公告)日:2024-12-24
申请号:US17773789
申请日:2020-11-13
Applicant: DEEPMIND TECHNOLOGIES LIMITED
Inventor: Viorica Patraucean , Bilal Piot , Joao Carreira , Volodymyr Mnih , Simon Osindero
Abstract: A system that is configured to receive a sequence of task inputs and to perform a machine learning task is described. The system includes a reinforcement learning (RL) neural network and a task neural network. The RL neural network is configured to: generate, for each task input of the sequence of task inputs, a respective decision that determines whether to encode the task input or to skip the task input, and provide the respective decision of each task input to the task neural network. The task neural network is configured to: receive the sequence of task inputs, receive, from the RL neural network, for each task input of the sequence of task inputs, a respective decision that determines whether to encode the task input or to skip the task input, process each of the un-skipped task inputs in the sequence of task inputs to generate a respective accumulated feature for the un-skipped task input, wherein the respective accumulated feature characterizes features of the un-skipped task input and of previous un-skipped task inputs in the sequence, and generate a machine learning task output for the machine learning task based on the last accumulated feature generated for the last un-skipped task input in the sequence.
-
6.
公开(公告)号:US20240412809A1
公开(公告)日:2024-12-12
申请号:US18813386
申请日:2024-08-23
Applicant: DeepMind Technologies Limited
Inventor: John Jumper , Andrew W. Senior , Richard Andrew Evans , Russell James Bates , Mikhail Figurnov , Alexander Pritzel , Timothy Frederick Goldie Green
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining a predicted structure of a protein that is specified by an amino acid sequence. In one aspect, a method comprises: obtaining a multiple sequence alignment for the protein; determining, from the multiple sequence alignment and for each pair of amino acids in the amino acid sequence of the protein, a respective initial embedding of the pair of amino acids; processing the initial embeddings of the pairs of amino acids using a pair embedding neural network comprising a plurality of self-attention neural network layers to generate a final embedding of each pair of amino acids; and determining the predicted structure of the protein based on the final embedding of each pair of amino acids.
-
公开(公告)号:US20240412063A1
公开(公告)日:2024-12-12
申请号:US18698218
申请日:2022-10-05
Applicant: DeepMind Technologies Limited
Inventor: Oleg O. Sushkov , Todor Bozhinov Davchev , Jonathan Karl Scholz
IPC: G06N3/08
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a reinforcement learning system to select actions to be performed by an agent interacting with an environment to perform a particular task. In one aspect, one of the methods includes obtaining a training sequence comprising a respective training observations at each of a plurality of time steps; obtaining demonstration data comprising one or more demonstration sequences; generating a new training sequence from the training sequence and the demonstration data; and training the goal-conditioned policy neural network on the new training sequence through reinforcement learning.
-
公开(公告)号:US20240403652A1
公开(公告)日:2024-12-05
申请号:US18699012
申请日:2022-10-05
Applicant: DeepMind Technologies Limited
Inventor: Dushyant ` Rao , Fereshteh Sadeghi , Leonard Hasenclever , Markus Wulfmeier , Martina Zambelli , Giulia Vezzani , Dhruva Tirumala Bukkapatnam , Yusuf Aytar , Joshua Merel , Nicolas Manfred Otto Heess , Raia Thais Hadsell
IPC: G06N3/092
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an agent can be controlled using a hierarchical controller that includes a high-level controller neural network, a mid-level controller neural network, and a low-level controller neural network.
-
公开(公告)号:US12151171B2
公开(公告)日:2024-11-26
申请号:US17963113
申请日:2022-10-10
Applicant: DeepMind Technologies Limited
Inventor: Luke Christopher Marris
IPC: A63F13/798 , A63F13/822 , G06F17/18
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for rating tasks and policies using conditional probability distributions derived from equilibrium-based solutions of games. One of the methods includes: determining, for each action selection policy in a pool of action selection policies, a respective performance measure of the action selection policy on each task in a pool of tasks, processing the performance measures of the action selection policies on the tasks to generate data defining a joint probability distribution over a set of action selection policy-task pairs, and processing the joint probability distribution over the set of action selection policy-task pairs to generate a respective rating for each action selection policy in the pool of action selection policies, where the respective rating for each action selection policy characterizes a utility of the action selection policy in performing tasks from the pool of tasks.
-
10.
公开(公告)号:US20240378439A1
公开(公告)日:2024-11-14
申请号:US18642641
申请日:2024-04-22
Applicant: DeepMind Technologies Limited
Inventor: Aaron Gerard Antonius van den Oord , Yutian Chen , Danilo Jimenez Rezende , Oriol Vinyals , Joao Ferdinando Gomes de Freitas , Scott Ellison Reed
Abstract: A system comprising a causal convolutional neural network to autoregressively generate a succession of values of a data item conditioned upon previously generated values of the data item. The system includes support memory for a set of support data patches each of which comprises an encoding of an example data item. A soft attention mechanism attends to one or more patches when generating the current item value. The soft attention mechanism determines a set of scores for the support data patches, for example in the form of a soft attention query vector dependent upon the previously generated values of the data item. The soft attention query vector is used to query the memory. When generating the value of the data item at a current iteration layers of the causal convolutional neural network are conditioned upon the support data patches weighted by the scores.
-
-
-
-
-
-
-
-
-