REINFORCEMENT LEARNING SYSTEM

发明申请

US20200005130A1 REINFORCEMENT LEARNING SYSTEM 审中-公开

请登陆查看更多内容

专利标题： REINFORCEMENT LEARNING SYSTEM
申请号： US16291400

申请日： 2019-03-04
公开(公告)号： US20200005130A1

公开(公告)日： 2020-01-02
发明人: Yoshifumi Nishi , Radu Berdan , Takao Marukame , Kumiko Nomura
申请人： Kabushiki Kaisha Toshiba
申请人地址： JP Minato-ku
专利权人： Kabushiki Kaisha Toshiba
当前专利权人： Kabushiki Kaisha Toshiba
当前专利权人地址： JP Minato-ku
优先权： JP2018-125761 20180702
主分类号： G06N3/063
IPC分类号： G06N3/063 ; G06N3/04 ; G11C11/54 ; G11C13/00 ; G06N3/08

摘要：

According to an embodiment, a reinforcement learning system includes a memristor array in which each of a plurality of first direction lines corresponds to one of a plurality of states, and each of a plurality of second direction lines corresponds to one of a plurality of actions, a first voltage application unit that individually applies voltage to the first direction lines, a second voltage application unit that individually applies voltage to the second direction lines, a action decision circuit that decides action to be selected by an agent in a state corresponding to a first direction line to which a readout voltage is applied, a action storage unit that stores action selected by the agent in each state that can be caused in an environment, and a trace storage unit that stores a time at which the state is caused by action selected by the agent.

公开/授权文献

US11586897B2 Reinforcement learning system 公开/授权日：2023-02-21

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/06	..物理实现，即神经网络、神经元或神经元部分的硬件实现
G06N3/063	...采用电的