Patent search ap:("DEEPMIND TECHNOLOGIES LIMITED") AND inv:"Aaron Gerard Antonius van den Oord" Page 2

11.

发明授权
Learning observation representations by predicting the future in latent space 有权

公开(公告)号：US11995528B2

公开(公告)日：2024-05-28

申请号：US18090243

申请日：2022-12-28

Applicant: DeepMind Technologies Limited

Inventor： Aaron Gerard Antonius van den Oord , Yazhe Li , Oriol Vinyals

IPC: G06N3/006 , G06F17/16 , G06F18/22 , G06N3/045 , G06N3/048 , G06N3/08 , G06V10/764 , G06V10/77 , G06V10/82

CPC classification number: G06N3/006 , G06F17/16 , G06F18/22 , G06N3/045 , G06N3/048 , G06N3/08 , G06V10/764 , G06V10/7715 , G06V10/82

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an encoder neural network that is configured to process an input observation to generate a latent representation of the input observation. In one aspect, a method includes: obtaining a sequence of observations; for each observation in the sequence of observations, processing the observation using the encoder neural network to generate a latent representation of the observation; for each of one or more given observations in the sequence of observations: generating a context latent representation of the given observation; and generating, from the context latent representation of the given observation, a respective estimate of the latent representations of one or more particular observations that are after the given observation in the sequence of observations.

12.

发明授权
Processing sequences using convolutional neural networks 有权

公开(公告)号：US11948066B2

公开(公告)日：2024-04-02

申请号：US17375250

申请日：2021-07-14

Applicant: DeepMind Technologies Limited

Inventor： Aaron Gerard Antonius van den Oord , Sander Etienne Lea Dieleman , Nal Emmerich Kalchbrenner , Karen Simonyan , Oriol Vinyals , Lasse Espeholt

IPC: G06N3/04 , G06F40/279 , G06F40/44 , G06N3/044 , G06N3/045 , G06N3/047 , G06N3/084 , G10L13/04 , G10L13/08 , G10L15/16 , G10L25/30 , G06F17/18

CPC classification number: G06N3/047 , G06F40/279 , G06F40/44 , G06N3/044 , G06N3/045 , G06N3/084 , G10L13/04 , G10L13/086 , G10L15/16 , G10L25/30 , G06F17/18 , G10H2250/311

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing sequences using convolutional neural networks. One of the methods includes, for each of the time steps: providing a current sequence of audio data as input to a convolutional subnetwork, wherein the current sequence comprises the respective audio sample at each time step that precedes the time step in the output sequence, and wherein the convolutional subnetwork is configured to process the current sequence of audio data to generate an alternative representation for the time step; and providing the alternative representation for the time step as input to an output layer, wherein the output layer is configured to: process the alternative representation to generate an output that defines a score distribution over a plurality of possible audio samples for the time step.

13.

发明授权
Action selection neural network training using imitation learning in latent space 有权

公开(公告)号：US11663441B2

公开(公告)日：2023-05-30

申请号：US16586437

申请日：2019-09-27

Applicant: DeepMind Technologies Limited

Inventor： Scott Ellison Reed , Yusuf Aytar , Ziyu Wang , Tom Paine , Sergio Gomez Colmenarejo , David Budden , Tobias Pfaff , Aaron Gerard Antonius van den Oord , Oriol Vinyals , Alexander Novikov

IPC: G06N3/006 , G06F17/16 , G06N3/08 , G06F18/22 , G06N3/045 , G06N3/048 , G06V10/764 , G06V10/77 , G06V10/82

CPC classification number: G06N3/006 , G06F17/16 , G06F18/22 , G06N3/045 , G06N3/048 , G06N3/08 , G06V10/764 , G06V10/7715 , G06V10/82

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an action selection policy neural network, wherein the action selection policy neural network is configured to process an observation characterizing a state of an environment to generate an action selection policy output, wherein the action selection policy output is used to select an action to be performed by an agent interacting with an environment. In one aspect, a method comprises: obtaining an observation characterizing a state of the environment subsequent to the agent performing a selected action; generating a latent representation of the observation; processing the latent representation of the observation using a discriminator neural network to generate an imitation score; determining a reward from the imitation score; and adjusting the current values of the action selection policy neural network parameters based on the reward using a reinforcement learning training technique.

14.

发明授权
Speech recognition using convolutional neural networks 有权

公开(公告)号：US11069345B2

公开(公告)日：2021-07-20

申请号：US16719424

申请日：2019-12-18

Applicant: DeepMind Technologies Limited

Inventor： Aaron Gerard Antonius van den Oord , Sander Etienne Lea Dieleman , Nal Emmerich Kalchbrenner , Karen Simonyan , Oriol Vinyals , Lasse Espeholt

IPC: G10L15/16 , G06N3/04 , G06N3/08 , G10L15/02 , G10L15/22

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing speech recognition by generating a neural network output from an audio data input sequence, where the neural network output characterizes words spoken in the audio data input sequence. One of the methods includes, for each of the audio data inputs, providing a current audio data input sequence that comprises the audio data input and the audio data inputs preceding the audio data input in the audio data input sequence to a convolutional subnetwork comprising a plurality of dilated convolutional neural network layers, wherein the convolutional subnetwork is configured to, for each of the plurality of audio data inputs: receive the current audio data input sequence for the audio data input, and process the current audio data input sequence to generate an alternative representation for the audio data input.

15.

发明授权
Sample-efficient adaptive text-to-speech 有权

公开(公告)号：US10810993B2

公开(公告)日：2020-10-20

申请号：US16666043

申请日：2019-10-28

Applicant: DeepMind Technologies Limited

Inventor： Yutian Chen , Scott Ellison Reed , Aaron Gerard Antonius van den Oord , Oriol Vinyals , Heiga Zen , Ioannis Alexandros Assael , Brendan Shillingford , Joao Ferdinando Gomes de Freitas

IPC: G10L13/047 , G06N3/08 , G10L13/033 , G10L13/04

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating an adaptive audio-generation model. One of the methods includes generating an adaptive audio-generation model including learning a plurality of embedding vectors and parameter values of a neural network using training data comprising first text and audio data representing a plurality of different individual speakers speaking portions of the first text, wherein the plurality of embedding vectors represent respective voice characteristics of the plurality of different individual speakers. The adaptive audio-generation model is adapted for a new individual speaker using adaptation data comprising second text and audio data representing the new individual speaker speaking portions of the second text, the new individual speaker being different from each of the plurality of individual speakers, wherein adapting the audio-generation model includes learning a new embedding vector for the new individual speaker.

16.

发明授权
Speech recognition using convolutional neural networks 有权

公开(公告)号：US10586531B2

公开(公告)日：2020-03-10

申请号：US16209661

申请日：2018-12-04

Applicant: DeepMind Technologies Limited

Inventor： Aaron Gerard Antonius van den Oord , Sander Etienne Lea Dieleman , Nal Emmerich Kalchbrenner , Karen Simonyan , Oriol Vinyals , Lasse Espeholt

IPC: G10L15/16 , G06N3/04 , G10L15/02 , G10L15/22 , G06N3/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing speech recognition by generating a neural network output from an audio data input sequence, where the neural network output characterizes words spoken in the audio data input sequence. One of the methods includes, for each of the audio data inputs, providing a current audio data input sequence that comprises the audio data input and the audio data inputs preceding the audio data input in the audio data input sequence to a convolutional subnetwork comprising a plurality of dilated convolutional neural network layers, wherein the convolutional subnetwork is configured to, for each of the plurality of audio data inputs: receive the current audio data input sequence for the audio data input, and process the current audio data input sequence to generate an alternative representation for the audio data input.

17.

发明授权
Generating images using neural networks 有权

公开(公告)号：US10402700B2

公开(公告)日：2019-09-03

申请号：US15721089

申请日：2017-09-29

Applicant: DeepMind Technologies Limited

Inventor： Aaron Gerard Antonius van den Oord , Nal Emmerich Kalchbrenner , Karen Simonyan

IPC: G06K9/66 , H04N19/50 , H04N19/52 , G06N3/04 , G06N3/08 , G06K9/46 , G06K9/62 , H04N19/186 , H04N19/172 , H04N19/182

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating images using neural networks. One of the methods includes generating the output image pixel by pixel from a sequence of pixels taken from the output image, comprising, for each pixel in the output image, generating a respective score distribution over a discrete set of possible color values for each of the plurality of color channels.

18.

发明授权
Generating images using neural networks 有权

公开(公告)号：US12267518B2

公开(公告)日：2025-04-01

申请号：US18406837

申请日：2024-01-08

Applicant: DeepMind Technologies Limited

Inventor： Aaron Gerard Antonius van den Oord , Nal Emmerich Kalchbrenner , Karen Simonyan

IPC: H04N19/50 , G06F18/2113 , G06N3/04 , G06N3/044 , G06N3/045 , G06N3/08 , G06N3/084 , G06V10/56 , G06V30/194 , H04N19/172 , H04N19/182 , H04N19/186 , H04N19/52

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating images using neural networks. One of the methods includes generating the output image pixel by pixel from a sequence of pixels taken from the output image, comprising, for each pixel in the output image, generating a respective score distribution over a discrete set of possible color values for each of the plurality of color channels.

19.

发明公开
GENERATING IMAGES USING NEURAL NETWORKS 审中-公开

公开(公告)号：US20240146948A1

公开(公告)日：2024-05-02

申请号：US18406837

申请日：2024-01-08

Applicant: DeepMind Technologies Limited

Inventor： Aaron Gerard Antonius van den Oord , Nal Emmerich Kalchbrenner , Karen Simonyan

IPC: H04N19/50 , G06F18/2113 , G06N3/04 , G06N3/044 , G06N3/045 , G06N3/08 , G06N3/084 , G06V10/56 , G06V30/194 , H04N19/52

CPC classification number: H04N19/50 , G06F18/2113 , G06N3/04 , G06N3/044 , G06N3/045 , G06N3/08 , G06N3/084 , G06V10/56 , G06V30/194 , H04N19/52 , H04N19/186

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating images using neural networks. One of the methods includes generating the output image pixel by pixel from a sequence of pixels taken from the output image, comprising, for each pixel in the output image, generating a respective score distribution over a discrete set of possible color values for each of the plurality of color channels.

20.

发明授权
Iterative multiscale image generation using neural networks 有权

公开(公告)号：US11734797B2

公开(公告)日：2023-08-22

申请号：US17751359

申请日：2022-05-23

Applicant: DeepMind Technologies Limited

Inventor： Nal Emmerich Kalchbrenner , Daniel Belov , Sergio Gomez Colmenarejo , Aaron Gerard Antonius van den Oord , Ziyu Wang , Joao Ferdinando Gomes de Freitas , Scott Ellison Reed

IPC: G06T3/40 , G06N20/00 , G06N3/045

CPC classification number: G06T3/4046 , G06N3/045 , G06N20/00 , G06T3/4076

Abstract: A method of generating an output image having an output resolution of N pixels×N pixels, each pixel in the output image having a respective color value for each of a plurality of color channels, the method comprising: obtaining a low-resolution version of the output image; and upscaling the low-resolution version of the output image to generate the output image having the output resolution by repeatedly performing the following operations: obtaining a current version of the output image having a current K×K resolution; and processing the current version of the output image using a set of convolutional neural networks that are specific to the current resolution to generate an updated version of the output image having a 2K×2K resolution.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification