-
公开(公告)号:US12288547B2
公开(公告)日:2025-04-29
申请号:US17339834
申请日:2021-06-04
Applicant: DeepMind Technologies Limited
Inventor: Jeffrey Donahue , Karen Simonyan , Sander Etienne Lea Dieleman , Mikolaj Binkowski , Erich Konrad Elsen
IPC: G10L13/047 , G06N3/04 , G06N3/08
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using a generative neural network to convert conditioning text inputs to audio outputs. The generative neural network includes an alignment neural network that is configured to receive a generative input that includes the conditioning text input and to process the generative input to generate an aligned conditioning sequence that comprises a respective feature representation at each of a plurality of first time steps and that is temporally aligned with the audio output.
-
公开(公告)号:US20250103856A1
公开(公告)日:2025-03-27
申请号:US18832817
申请日:2023-01-30
Applicant: DeepMind Technologies Limited
Inventor: Joao Carreira , Andrew Coulter Jaegle , Skanda Kumar Koppula , Daniel Zoran , Adrià Recasens Continente , Catalin-Dumitru Ionescu , Olivier Jean Hénaff , Evan Gerard Shelhamer , Relja Arandjelovic , Matthew Botvinick , Oriol Vinyals , Karen Simonyan , Andrew Zisserman
IPC: G06N3/045
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for using a neural network to generate a network output that characterizes an entity. In one aspect, a method includes: obtaining a representation of the entity as a set of data element embeddings, obtaining a set of latent embeddings, and processing: (i) the set of data element embeddings, and (ii) the set of latent embeddings, using the neural network to generate the network output. The neural network includes a sequence of neural network blocks including: (i) one or more local cross-attention blocks, and (ii) an output block. Each local cross-attention block partitions the set of latent embeddings and the set of data element embeddings into proper subsets, and updates each proper subset of the set of latent embeddings using attention over only the corresponding proper subset of the set of data element embeddings.
-
公开(公告)号:US12147899B2
公开(公告)日:2024-11-19
申请号:US18528640
申请日:2023-12-04
Applicant: DeepMind Technologies Limited
Inventor: Karen Simonyan , David Silver , Julian Schrittwieser
Abstract: Methods, systems and apparatus, including computer programs encoded on computer storage media, for training an action selection neural network. One of the methods includes receiving an observation characterizing a current state of the environment; determining a target network output for the observation by performing a look ahead search of possible future states of the environment starting from the current state until the environment reaches a possible future state that satisfies one or more termination criteria, wherein the look ahead search is guided by the neural network in accordance with current values of the network parameters; selecting an action to be performed by the agent in response to the observation using the target network output generated by performing the look ahead search; and storing, in an exploration history data store, the target network output in association with the observation for use in updating the current values of the network parameters.
-
公开(公告)号:US20240185070A1
公开(公告)日:2024-06-06
申请号:US18528640
申请日:2023-12-04
Applicant: DeepMind Technologies Limited
Inventor: Karen Simonyan , David Silver , Julian Schrittwieser
Abstract: Methods, systems and apparatus, including computer programs encoded on computer storage media, for training an action selection neural network. One of the methods includes receiving an observation characterizing a current state of the environment; determining a target network output for the observation by performing a look ahead search of possible future states of the environment starting from the current state until the environment reaches a possible future state that satisfies one or more termination criteria, wherein the look ahead search is guided by the neural network in accordance with current values of the network parameters; selecting an action to be performed by the agent in response to the observation using the target network output generated by performing the look ahead search; and storing, in an exploration history data store, the target network output in association with the observation for use in updating the current values of the network parameters.
-
公开(公告)号:US20240127586A1
公开(公告)日:2024-04-18
申请号:US18275087
申请日:2022-02-02
Applicant: DeepMind Technologies Limited
Inventor: Andrew Brock , Soham De , Samuel Laurence Smith , Karen Simonyan
IPC: G06V10/82 , G06V10/776
CPC classification number: G06V10/82 , G06V10/776
Abstract: There is disclosed a computer-implemented method for training a neural network. The method comprises determining a gradient associated with a parameter of the neural network. The method further comprises determining a ratio of a gradient norm to parameter norm and comparing the ratio to a threshold. In response to determining that the ratio exceeds the threshold, the value of the gradient is reduced such that the ratio is equal to or below the threshold. The value of the parameter is updated based upon the reduced gradient value.
-
公开(公告)号:US11870947B2
公开(公告)日:2024-01-09
申请号:US17959132
申请日:2022-10-03
Applicant: DeepMind Technologies Limited
IPC: H04N19/50 , H04N19/52 , H04N19/56 , G06N3/04 , G06N3/08 , G06F18/21 , G06V10/56 , G06V30/19 , G06N3/084 , G06F18/2113 , G06N3/044 , G06N3/045 , G06V30/194 , H04N19/186 , H04N19/172 , H04N19/182
CPC classification number: H04N19/50 , G06F18/2113 , G06N3/04 , G06N3/044 , G06N3/045 , G06N3/08 , G06N3/084 , G06V10/56 , G06V30/194 , H04N19/52 , H04N19/172 , H04N19/182 , H04N19/186
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating images using neural networks. One of the methods includes generating the output image pixel by pixel from a sequence of pixels taken from the output image, comprising, for each pixel in the output image, generating a respective score distribution over a discrete set of possible color values for each of the plurality of color channels.
-
公开(公告)号:US11853861B2
公开(公告)日:2023-12-26
申请号:US17962881
申请日:2022-10-10
Applicant: DeepMind Technologies Limited
Inventor: Nal Emmerich Kalchbrenner , Karen Simonyan , Erich Konrad Elsen
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating output examples using neural networks. One of the methods includes receiving a request to generate an output example of a particular type, accessing dependency data, and generating the output example by, at each of a plurality of generation time steps: identifying one or more current blocks for the generation time step, wherein each current block is a block for which the values of the bits in all of the other blocks identified in the dependency for the block have already been generated; and generating the values of the bits in the current blocks for the generation time step conditioned on, for each current block, the already generated values of the bits in the other blocks identified in the dependency for the current block.
-
公开(公告)号:US20230084700A1
公开(公告)日:2023-03-16
申请号:US17948016
申请日:2022-09-19
Applicant: DeepMind Technologies Limited
Inventor: Karen Simonyan , David Silver , Julian Schrittwieser
Abstract: Methods, systems and apparatus, including computer programs encoded on computer storage media, for training an action selection neural network. One of the methods includes receiving an observation characterizing a current state of the environment; determining a target network output for the observation by performing a look ahead search of possible future states of the environment starting from the current state until the environment reaches a possible future state that satisfies one or more termination criteria, wherein the look ahead search is guided by the neural network in accordance with current values of the network parameters; selecting an action to be performed by the agent in response to the observation using the target network output generated by performing the look ahead search; and storing, in an exploration history data store, the target network output in association with the observation for use in updating the current values of the network parameters.
-
公开(公告)号:US11462034B2
公开(公告)日:2022-10-04
申请号:US17198096
申请日:2021-03-10
Applicant: DeepMind Technologies Limited
IPC: G06V30/194 , G06N3/04 , G06N3/08 , H04N19/52 , H04N19/50 , G06V10/56 , H04N19/186 , H04N19/172 , H04N19/182 , G06K9/62
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating images using neural networks. One of the methods includes generating the output image pixel by pixel from a sequence of pixels taken from the output image, comprising, for each pixel in the output image, generating a respective score distribution over a discrete set of possible color values for each of the plurality of color channels.
-
公开(公告)号:US20210383789A1
公开(公告)日:2021-12-09
申请号:US17339834
申请日:2021-06-04
Applicant: DeepMind Technologies Limited
Inventor: Jeffrey Donahue , Karen Simonyan , Sander Etienne Lea Dieleman , Mikolaj Binkowski , Erich Konrad Elsen
IPC: G10L13/047 , G06N3/08 , G06N3/04
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using a generative neural network to convert conditioning text inputs to audio outputs. The generative neural network includes an alignment neural network that is configured to receive a generative input that includes the conditioning text input and to process the generative input to generate an aligned conditioning sequence that comprises a respective feature representation at each of a plurality of first time steps and that is temporally aligned with the audio output.
-
-
-
-
-
-
-
-
-