-
公开(公告)号:US20170140270A1
公开(公告)日:2017-05-18
申请号:US15349950
申请日:2016-11-11
Applicant: Google Inc.
Inventor: Volodymyr Mnih , Adrià Puigdomènech Badia , Alexander Benjamin Graves , Timothy James Alexander Harley , David Silver , Koray Kavukcuoglu
CPC classification number: G06N3/08 , G06N3/04 , G06N3/0454
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for asynchronous deep reinforcement learning. One of the systems includes a plurality of workers, wherein each worker is configured to operate independently of each other worker, and wherein each worker is associated with a respective actor that interacts with a respective replica of the environment during the training of the deep neural network.