-
公开(公告)号:US10824946B2
公开(公告)日:2020-11-03
申请号:US16511496
申请日:2019-07-15
Applicant: DeepMind Technologies Limited
Inventor: Meire Fortunato , Charles Blundell , Oriol Vinyals
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network. In one aspect, a method includes maintaining data specifying, for each of the network parameters, current values of a respective set of distribution parameters that define a posterior distribution over possible values for the network parameter. A respective current training value for each of the network parameters is determined from a respective temporary gradient value for the network parameter. The current values of the respective sets of distribution parameters for the network parameters are updated in accordance with the respective current training values for the network parameters. The trained values of the network parameters are determined based on the updated current values of the respective sets of distribution parameters.
-
公开(公告)号:US10810993B2
公开(公告)日:2020-10-20
申请号:US16666043
申请日:2019-10-28
Applicant: DeepMind Technologies Limited
Inventor: Yutian Chen , Scott Ellison Reed , Aaron Gerard Antonius van den Oord , Oriol Vinyals , Heiga Zen , Ioannis Alexandros Assael , Brendan Shillingford , Joao Ferdinando Gomes de Freitas
IPC: G10L13/047 , G06N3/08 , G10L13/033 , G10L13/04
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating an adaptive audio-generation model. One of the methods includes generating an adaptive audio-generation model including learning a plurality of embedding vectors and parameter values of a neural network using training data comprising first text and audio data representing a plurality of different individual speakers speaking portions of the first text, wherein the plurality of embedding vectors represent respective voice characteristics of the plurality of different individual speakers. The adaptive audio-generation model is adapted for a new individual speaker using adaptation data comprising second text and audio data representing the new individual speaker speaking portions of the second text, the new individual speaker being different from each of the plurality of individual speakers, wherein adapting the audio-generation model includes learning a new embedding vector for the new individual speaker.
-
公开(公告)号:US10586531B2
公开(公告)日:2020-03-10
申请号:US16209661
申请日:2018-12-04
Applicant: DeepMind Technologies Limited
Inventor: Aaron Gerard Antonius van den Oord , Sander Etienne Lea Dieleman , Nal Emmerich Kalchbrenner , Karen Simonyan , Oriol Vinyals , Lasse Espeholt
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing speech recognition by generating a neural network output from an audio data input sequence, where the neural network output characterizes words spoken in the audio data input sequence. One of the methods includes, for each of the audio data inputs, providing a current audio data input sequence that comprises the audio data input and the audio data inputs preceding the audio data input in the audio data input sequence to a convolutional subnetwork comprising a plurality of dilated convolutional neural network layers, wherein the convolutional subnetwork is configured to, for each of the plurality of audio data inputs: receive the current audio data input sequence for the audio data input, and process the current audio data input sequence to generate an alternative representation for the audio data input.
-
公开(公告)号:US20240354566A1
公开(公告)日:2024-10-24
申请号:US18623952
申请日:2024-04-01
Applicant: DeepMind Technologies Limited
Inventor: Koray Kavukcuoglu , Aaron Gerard Antonius van den Oord , Oriol Vinyals
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating discrete latent representations of input data items. One of the methods includes receiving an input data item; providing the input data item as input to an encoder neural network to obtain an encoder output for the input data item; and generating a discrete latent representation of the input data item from the encoder output, comprising: for each of the latent variables, determining, from a set of latent embedding vectors in the memory, a latent embedding vector that is nearest to the encoded vector for the latent variable.
-
公开(公告)号:US20240282094A1
公开(公告)日:2024-08-22
申请号:US18568561
申请日:2022-06-08
Applicant: DeepMind Technologies Limited
Inventor: Maria Rafailia Tsimpoukelli , Jacob Lee Menick , Serkan Cabi , Felix George Hill , Seyed Mohammadali Eslami , Oriol Vinyals
IPC: G06V10/82 , G06F40/284 , G06V20/70
CPC classification number: G06V10/82 , G06F40/284 , G06V20/70
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing multi-modal inputs using language models. In particular, the inputs include an image, and the image is encoded by an image encoder neural network to generate a sequence of image embeddings representing the image. The sequence of image embeddings is provided as at least part of an input sequence to that is processed by a language model neural network.
-
公开(公告)号:US11948075B2
公开(公告)日:2024-04-02
申请号:US16620815
申请日:2018-06-11
Applicant: DEEPMIND TECHNOLOGIES LIMITED
Inventor: Koray Kavukcuoglu , Aaron Gerard Antonius van den Oord , Oriol Vinyals
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating discrete latent representations of input data items. One of the methods includes receiving an input data item; providing the input data item as input to an encoder neural network to obtain an encoder output for the input data item; and generating a discrete latent representation of the input data item from the encoder output, comprising: for each of the latent variables, determining, from a set of latent embedding vectors in the memory, a latent embedding vector that is nearest to the encoded vector for the latent variable.
-
公开(公告)号:US11934935B2
公开(公告)日:2024-03-19
申请号:US15985463
申请日:2018-05-21
Applicant: DeepMind Technologies Limited
Inventor: Aaron Gerard Antonius van den Oord , Karen Simonyan , Oriol Vinyals
Abstract: A feedforward generative neural network that generates an output example that includes multiple output samples of a particular type in a single neural network inference. Optionally, the generation may be conditioned on a context input. For example, the feedforward generative neural network may generate a speech waveform that is a verbalization of an input text segment conditioned on linguistic features of the text segment.
-
公开(公告)号:US11836630B2
公开(公告)日:2023-12-05
申请号:US17024217
申请日:2020-09-17
Applicant: DeepMind Technologies Limited
Inventor: Meire Fortunato , Charles Blundell , Oriol Vinyals
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network. In one aspect, a method includes maintaining data specifying, for each of the network parameters, current values of a respective set of distribution parameters that define a posterior distribution over possible values for the network parameter. A respective current training value for each of the network parameters is determined from a respective temporary gradient value for the network parameter. The current values of the respective sets of distribution parameters for the network parameters are updated in accordance with the respective current training values for the network parameters. The trained values of the network parameters are determined based on the updated current values of the respective sets of distribution parameters.
-
公开(公告)号:US20230244452A1
公开(公告)日:2023-08-03
申请号:US18105211
申请日:2023-02-02
Applicant: DeepMind Technologies Limited
Inventor: Yujia Li , David Hugo Choi , Junyoung Chung , Nathaniel Arthur Kushman , Julian Schrittwieser , Rémi Leblond , Thomas Edward Eccles , James Thomas Keeling , Felix Axel Gimeno Gil , Agustín Matías Dal Lago , Thomas Keisuke Hubert , Peter Choy , Cyprien de Masson d'Autume , Esme Sutherland Robson , Oriol Vinyals
IPC: G06F8/30
CPC classification number: G06F8/30
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating computer code using neural networks. One of the methods includes receiving description data describing a computer programming task; receiving a first set of inputs for the computer programming task; generating a plurality of candidate computer programs by sampling a plurality of output sequences from a set of one or more generative neural networks; for each candidate computer program in a subset of the candidate computer programs and for each input in the first set: executing the candidate computer program on the input to generate an output; and selecting, from the candidate computer programs, one or more computer programs as synthesized computer programs for performing the computer programming task based at least in part on the outputs generated by executing the candidate computer programs in the subset on the inputs in the first set of inputs.
-
公开(公告)号:US11704541B2
公开(公告)日:2023-07-18
申请号:US16759525
申请日:2018-10-29
Applicant: DeepMind Technologies Limited
Inventor: Yujia Li , Christopher James Dyer , Oriol Vinyals
CPC classification number: G06N3/047 , G06F16/9024 , G06F17/18 , G06N3/045 , G06N3/08
Abstract: There is described a neural network system for generating a graph, the graph comprising a set of nodes and edges. The system comprises one or more neural networks configured to represent a probability distribution over sequences of node generating decisions and/or edge generating decisions, and one or more computers configured to sample the probability distribution represented by the one or more neural networks to generate a graph.
-
-
-
-
-
-
-
-
-