Information processing apparatus, and method
Abstract:
The present disclosure relates to an information processing apparatus, a method, and a program capable of causing a system to efficiently learn a method of controlling a person. A control learning system calculates a reward based on an input objective state of a control target and a state of the control target based on a sensing result of the control target. The control learning system performs reinforcement learning using the calculated reward and the state of the control target to select a better action for bringing the control target closer to the objective state. The control learning system executes the selected action for the control target. For example, the present disclosure can be applied to a control learning system including a terminal and a cloud system.
Public/Granted literature
Information query
Patent Agency Ranking
0/0