- 专利标题: REINFORCEMENT LEARNING SYSTEM
-
申请号: US16291400申请日: 2019-03-04
-
公开(公告)号: US20200005130A1公开(公告)日: 2020-01-02
- 发明人: Yoshifumi Nishi , Radu Berdan , Takao Marukame , Kumiko Nomura
- 申请人: Kabushiki Kaisha Toshiba
- 申请人地址: JP Minato-ku
- 专利权人: Kabushiki Kaisha Toshiba
- 当前专利权人: Kabushiki Kaisha Toshiba
- 当前专利权人地址: JP Minato-ku
- 优先权: JP2018-125761 20180702
- 主分类号: G06N3/063
- IPC分类号: G06N3/063 ; G06N3/04 ; G11C11/54 ; G11C13/00 ; G06N3/08
摘要:
According to an embodiment, a reinforcement learning system includes a memristor array in which each of a plurality of first direction lines corresponds to one of a plurality of states, and each of a plurality of second direction lines corresponds to one of a plurality of actions, a first voltage application unit that individually applies voltage to the first direction lines, a second voltage application unit that individually applies voltage to the second direction lines, a action decision circuit that decides action to be selected by an agent in a state corresponding to a first direction line to which a readout voltage is applied, a action storage unit that stores action selected by the agent in each state that can be caused in an environment, and a trace storage unit that stores a time at which the state is caused by action selected by the agent.
公开/授权文献
- US11586897B2 Reinforcement learning system 公开/授权日:2023-02-21
信息查询