Abstract:
An information processing apparatus includes a network learning portion that performs learning of an appearance/position recognition network by constraining first to third weights and using a learning image, wherein the appearance/position recognition network has a foreground layer including a position node, a background layer including a background node, and an image layer including a pixel node, and is a neural network in which the position node, the background node, and the pixel node are connected to each other, and wherein the first weight is a connection weight between the position node and the pixel node, the second weight is a connection weight between the position node and the background node, and the third weight is a connection weight between the background node and the pixel node.