-
公开(公告)号:US20210089908A1
公开(公告)日:2021-03-25
申请号:US17032562
申请日:2020-09-25
Applicant: DeepMind Technologies Limited
Inventor: Tom Schaul , Diana Luiza Borsa , Fengning Ding , David Szepesvari , Georg Ostrovski , Simon Osindero , William Clinton Dabney
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling an agent. One of the methods includes sampling a behavior modulation in accordance with a current probability distribution; for each of one or more time steps: processing an input comprising an observation characterizing a current state of the environment at the time step using an action selection neural network to generate a respective action score for each action in a set of possible actions that can be performed by the agent; modifying the action scores using the sampled behavior modulation; and selecting the action to be performed by the agent at the time step based on the modified action scores; determining a fitness measure corresponding to the sampled behavior modulation; and updating the current probability distribution over the set of possible behavior modulations using the fitness measure corresponding to the behavior modulation.
-
公开(公告)号:US12061964B2
公开(公告)日:2024-08-13
申请号:US17032562
申请日:2020-09-25
Applicant: DeepMind Technologies Limited
Inventor: Tom Schaul , Diana Luiza Borsa , Fengning Ding , David Szepesvari , Georg Ostrovski , Simon Osindero , William Clinton Dabney
IPC: G06N3/006 , G06F18/214 , G06F18/2415 , G06N3/08 , G06V10/764 , G06V10/82 , G06V40/20
CPC classification number: G06N3/006 , G06F18/2148 , G06F18/2415 , G06N3/08 , G06V10/764 , G06V10/82 , G06V40/20
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling an agent. One of the methods includes sampling a behavior modulation in accordance with a current probability distribution; for each of one or more time steps: processing an input comprising an observation characterizing a current state of the environment at the time step using an action selection neural network to generate a respective action score for each action in a set of possible actions that can be performed by the agent; modifying the action scores using the sampled behavior modulation; and selecting the action to be performed by the agent at the time step based on the modified action scores; determining a fitness measure corresponding to the sampled behavior modulation; and updating the current probability distribution over the set of possible behavior modulations using the fitness measure corresponding to the behavior modulation.
-
3.
公开(公告)号:US20240232580A1
公开(公告)日:2024-07-11
申请号:US18284595
申请日:2022-05-27
Applicant: DEEPMIND TECHNOLOGIES LIMITED
Inventor: Andrew Coulter Jaegle , Jean-Baptiste Alayrac , Sebastian Borgeaud Dit Avocat , Catalin-Dumitru Ionescu , Carl Doersch , Fengning Ding , Oriol Vinyals , Olivier Jean Hénaff , Skanda Kumar Koppula , Daniel Zoran , Andrew Brock , Evan Gerard Shelhamer , Andrew Zisserman , Joao Carreira
IPC: G06N3/0455
CPC classification number: G06N3/0455
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a network output using a neural network. In one aspect, a method comprises: obtaining: (i) a network input to a neural network, and (ii) a set of query embeddings; processing the network input using the neural network to generate a network output that comprises a respective dimension corresponding to each query embedding in the set of query embeddings, comprising: processing the network input using an encoder block of the neural network to generate a representation of the network input as a set of latent embeddings; and processing: (i) the set of latent embeddings, and (ii) the set of query embeddings, using a cross-attention block that generates each dimension of the network output by cross-attention of a corresponding query embedding over the set of latent embeddings.
-
4.
公开(公告)号:US20240020972A1
公开(公告)日:2024-01-18
申请号:US18029980
申请日:2021-10-01
Applicant: DeepMind Technologies Limited
Inventor: Fengning Ding , Adam Anthony Santoro , Felix George Hill , Matthew Botvinick , Luis Piloto
IPC: G06V20/40 , G06V10/26 , G06V10/82 , G06V10/776
CPC classification number: G06V20/41 , G06V10/26 , G06V10/82 , G06V10/776
Abstract: A video processing system configured to analyze a sequence of video frames to detect objects in the video frames and provide information relating to the detected objects in response to a query. The query may comprise, for example, a request for a prediction of a future event, or of the location of an object, or a request for a prediction of what would happen if an object were modified. The system uses a transformer neural network subsystem to process representations of objects in the video.
-
公开(公告)号:US20230401835A1
公开(公告)日:2023-12-14
申请号:US18199896
申请日:2023-05-19
Applicant: DeepMind Technologies Limited
Inventor: Aaditya K. Singh , Fengning Ding , Felix George Hill , Andrew Kyle Lampinen
CPC classification number: G06V10/82 , G06V20/635
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a speaker neural network using one or more listener neural networks.
-
-
-
-