Patent search caee:"DeepMind Technologies Limited" Page 1

1.

发明申请
TRAINING NEURAL NETWORKS USING A PRIORITIZED EXPERIENCE MEMORY 有权

公开(公告)号：US20250045583A1

公开(公告)日：2025-02-06

申请号：US18805367

申请日：2024-08-14

Applicant: DeepMind Technologies Limited

Inventor： Tom Schaul , John Quan , David Silver

IPC: G06N3/08 , G06N3/088

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network used to select actions performed by a reinforcement learning agent interacting with an environment. In one aspect, a method includes maintaining a replay memory, where the replay memory stores pieces of experience data generated as a result of the reinforcement learning agent interacting with the environment. Each piece of experience data is associated with a respective expected learning progress measure that is a measure of an expected amount of progress made in the training of the neural network if the neural network is trained on the piece of experience data. The method further includes selecting a piece of experience data from the replay memory by prioritizing for selection pieces of experience data having relatively higher expected learning progress measures and training the neural network on the selected piece of experience data.

2.

发明申请
TRAINING GENERATIVE NEURAL NETWORKS THROUGH REINFORCED SELF-TRAINING 有权

公开(公告)号：US20250036958A1

公开(公告)日：2025-01-30

申请号：US18358920

申请日：2023-07-25

Applicant: DeepMind Technologies Limited

Inventor： Caglar Gulcehre , Thomas Le Paine , Srivatsan Srinivasan , Ksenia Konyushkova , Lotte Petronella Jacoba Weerts , Abhishek Sharma , Aditya Siddhant , Orhan Firat

IPC: G06N3/092 , G06N3/0475

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a generative neural network. One of the methods includes training a generative neural network by performing a sequence of a plurality training stages each generating an expanded training data set. The method also involves performing a sequence of improve steps, each comprising training the generative neural network on the training examples in a corresponding subset of the expanded training data set.

3.

发明授权
Adaptive visual speech recognition 有权

公开(公告)号：US12211488B2

公开(公告)日：2025-01-28

申请号：US18571553

申请日：2022-06-15

Applicant: DeepMind Technologies Limited

Inventor： Ioannis Alexandros Assael , Brendan Shillingford , Joao Ferdinando Gomes de Freitas

IPC: G10L25/30 , G10L15/06

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing video data using an adaptive visual speech recognition model. One of the methods includes receiving a video that includes a plurality of video frames that depict a first speaker: obtaining a first embedding characterizing the first speaker; and processing a first input comprising (i) the video and (ii) the first embedding using a visual speech recognition neural network having a plurality of parameters, wherein the visual speech recognition neural network is configured to process the video and the first embedding in accordance with trained values of the parameters to generate a speech recognition output that defines a sequence of one or more words being spoken by the first speaker in the video.

4.

发明申请
BANDWIDTH EXTENSION OF INCOMING DATA USING NEURAL NETWORKS 有权

公开(公告)号：US20250022476A1

公开(公告)日：2025-01-16

申请号：US18780377

申请日：2024-07-22

Applicant: DeepMind Technologies Limited

Inventor： Ioannis Alexandros Assael , Thomas Chadwick Walters , Archit Gupta , Brendan Shillingford

IPC: G10L19/02 , G06N3/08 , G10L25/18 , G10L25/30 , G10L25/57 , H04N7/01

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for bandwidth extension. One of the methods includes obtaining a low-resolution version of an input, the low-resolution version of the input comprising a first number of samples at a first sample rate over a first time period; and generating, from the low-resolution version of the input, a high-resolution version of the input comprising a second, larger number of samples at a second, higher sample rate over the first time period. Generating the high-resolution version includes generating a representation of the low-resolution version of the input; processing the representation of the low-resolution version of the input through a conditioning neural network to generate a conditioning input; and processing the conditioning input using a generative neural network to generate the high-resolution version of the input.

5.

发明授权
Reinforcement learning for active sequence processing 有权

公开(公告)号：US12175737B2

公开(公告)日：2024-12-24

申请号：US17773789

申请日：2020-11-13

Applicant: DEEPMIND TECHNOLOGIES LIMITED

Inventor： Viorica Patraucean , Bilal Piot , Joao Carreira , Volodymyr Mnih , Simon Osindero

IPC: G06V10/82 , G06N3/045 , G06N3/048

Abstract: A system that is configured to receive a sequence of task inputs and to perform a machine learning task is described. The system includes a reinforcement learning (RL) neural network and a task neural network. The RL neural network is configured to: generate, for each task input of the sequence of task inputs, a respective decision that determines whether to encode the task input or to skip the task input, and provide the respective decision of each task input to the task neural network. The task neural network is configured to: receive the sequence of task inputs, receive, from the RL neural network, for each task input of the sequence of task inputs, a respective decision that determines whether to encode the task input or to skip the task input, process each of the un-skipped task inputs in the sequence of task inputs to generate a respective accumulated feature for the un-skipped task input, wherein the respective accumulated feature characterizes features of the un-skipped task input and of previous un-skipped task inputs in the sequence, and generate a machine learning task output for the machine learning task based on the last accumulated feature generated for the last un-skipped task input in the sequence.

6.

发明申请
Protein Structure Prediction from Amino Acid Sequences Using Self-Attention Neural Networks 有权

公开(公告)号：US20240412809A1

公开(公告)日：2024-12-12

申请号：US18813386

申请日：2024-08-23

Applicant: DeepMind Technologies Limited

Inventor： John Jumper , Andrew W. Senior , Richard Andrew Evans , Russell James Bates , Mikhail Figurnov , Alexander Pritzel , Timothy Frederick Goldie Green

IPC: G16B15/00 , G06N3/02

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining a predicted structure of a protein that is specified by an amino acid sequence. In one aspect, a method comprises: obtaining a multiple sequence alignment for the protein; determining, from the multiple sequence alignment and for each pair of amino acids in the amino acid sequence of the protein, a respective initial embedding of the pair of amino acids; processing the initial embeddings of the pairs of amino acids using a pair embedding neural network comprising a plurality of self-attention neural network layers to generate a final embedding of each pair of amino acids; and determining the predicted structure of the protein based on the final embedding of each pair of amino acids.

7.

发明申请
DEMONSTRATION-DRIVEN REINFORCEMENT LEARNING 有权

公开(公告)号：US20240412063A1

公开(公告)日：2024-12-12

申请号：US18698218

申请日：2022-10-05

Applicant: DeepMind Technologies Limited

Inventor： Oleg O. Sushkov , Todor Bozhinov Davchev , Jonathan Karl Scholz

IPC: G06N3/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a reinforcement learning system to select actions to be performed by an agent interacting with an environment to perform a particular task. In one aspect, one of the methods includes obtaining a training sequence comprising a respective training observations at each of a plurality of time steps; obtaining demonstration data comprising one or more demonstration sequences; generating a new training sequence from the training sequence and the demonstration data; and training the goal-conditioned policy neural network on the new training sequence through reinforcement learning.

8.

发明申请
HIERARCHICAL LATENT MIXTURE POLICIES FOR AGENT CONTROL 有权

公开(公告)号：US20240403652A1

公开(公告)日：2024-12-05

申请号：US18699012

申请日：2022-10-05

Applicant: DeepMind Technologies Limited

Inventor： Dushyant ` Rao , Fereshteh Sadeghi , Leonard Hasenclever , Markus Wulfmeier , Martina Zambelli , Giulia Vezzani , Dhruva Tirumala Bukkapatnam , Yusuf Aytar , Joshua Merel , Nicolas Manfred Otto Heess , Raia Thais Hadsell

IPC: G06N3/092

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an agent can be controlled using a hierarchical controller that includes a high-level controller neural network, a mid-level controller neural network, and a low-level controller neural network.

9.

发明授权
Rating tasks and policies using conditional probability distributions derived from equilibrium-based solutions of games 有权

公开(公告)号：US12151171B2

公开(公告)日：2024-11-26

申请号：US17963113

申请日：2022-10-10

Applicant: DeepMind Technologies Limited

Inventor： Luke Christopher Marris

IPC: A63F13/798 , A63F13/822 , G06F17/18

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for rating tasks and policies using conditional probability distributions derived from equilibrium-based solutions of games. One of the methods includes: determining, for each action selection policy in a pool of action selection policies, a respective performance measure of the action selection policy on each task in a pool of tasks, processing the performance measures of the action selection policies on the tasks to generate data defining a joint probability distribution over a set of action selection policy-task pairs, and processing the joint probability distribution over the set of action selection policy-task pairs to generate a respective rating for each action selection policy in the pool of action selection policies, where the respective rating for each action selection policy characterizes a utility of the action selection policy in performing tasks from the pool of tasks.

10.

发明申请
AUTO-REGRESSIVE NEURAL NETWORK SYSTEMS WITH A SOFT ATTENTION MECHANISM USING SUPPORT DATA PATCHES 有权

公开(公告)号：US20240378439A1

公开(公告)日：2024-11-14

申请号：US18642641

申请日：2024-04-22

Applicant: DeepMind Technologies Limited

Inventor： Aaron Gerard Antonius van den Oord , Yutian Chen , Danilo Jimenez Rezende , Oriol Vinyals , Joao Ferdinando Gomes de Freitas , Scott Ellison Reed

IPC: G06N3/08 , G06N20/00

Abstract: A system comprising a causal convolutional neural network to autoregressively generate a succession of values of a data item conditioned upon previously generated values of the data item. The system includes support memory for a set of support data patches each of which comprises an encoding of an example data item. A soft attention mechanism attends to one or more patches when generating the current item value. The soft attention mechanism determines a set of scores for the support data patches, for example in the form of a soft attention query vector dependent upon the previously generated values of the data item. The soft attention query vector is used to query the memory. When generating the value of the data item at a current iteration layers of the causal convolutional neural network are conditioned upon the support data patches weighted by the scores.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification