摘要:
A plural number of letters or characters, inferred from the results of letter/character recognition of an image photographed by a CCD camera (20), a plural number of kana readings inferred from the letters or characters and the way of pronunciation corresponding to the kana readings are generated in an pronunciation information generating unit (150) and the plural readings obtained are matched to the pronunciation from the user acquired by a microphone (23) to specify one kana reading and the way of pronunciation (reading) from among the plural generated candidates.
摘要:
An information processing apparatus includes: model learning means for self-organizing, on the basis of a state transition model having a state and state transition to be learned by using time series data as data in time series, an internal state from an observation signal obtained by a sensor; and controller learning means for performing learning for allocating a controller, which outputs an action, to each of transitions of a state or each of transition destination states in the state transition model indicating the internal state self-organized by the model learning means.
摘要:
An information processing apparatus includes: model learning means for self-organizing, on the basis of a state transition model having a state and state transition to be learned by using time series data as data in time series, an internal state from an observation signal obtained by a sensor; and controller learning means for performing learning for allocating a controller, which outputs an action, to each of transitions of a state or each of transition destination states in the state transition model indicating the internal state self-organized by the model learning means.
摘要:
An apparatus, method and program for performing a speech recognition process utilizing contextual information that comprises an estimation of the intention of an utterance of a user. The recognition process includes calculating a pre-score based on observed contextual information according intention models which correspond to a plurality of types of intention information and combining the pre-scoring results with acoustic and linguistic scores to obtain an improved recognition or comprehension of the intent of a user utterance.
摘要:
In a conventional voice dialogue system, there is a case where it is difficult to perform a natural dialogue with the user. Therefore, we designed to perform speech recognition on the user's utterance, to control a dialogue with the user according to a scenario previously given, based on the speech recognition result to generate an answering sentence corresponding to the contents of the user's utterance as the occasion demands, and to perform voice synthesis processing to one sentence in the reproduced scenario or the generated answering sentence.
摘要:
An information processing apparatus includes a storage unit configured to store a node holding dynamics; an input-weight-coefficient adjuster configured to adjust input-weight coefficients on a dimension-by-dimension basis, the input-weight coefficients being weight coefficients for individual dimensions of input data input to input units of the node, the input data being observed time-series data having a plurality of dimensions; and an output-weight-coefficient adjuster configured to adjust output-weight coefficients on a dimension-by-dimension basis, the output-weight coefficients being weight coefficients for individual dimensions of output data having a plurality of dimensions and output from output units of the node.
摘要:
An information processing apparatus includes a storage unit configured to store a node holding dynamics; an input-weight-coefficient adjuster configured to adjust input-weight coefficients on a dimension-by-dimension basis, the input-weight coefficients being weight coefficients for individual dimensions of input data input to input units of the node, the input data being observed time-series data having a plurality of dimensions; and an output-weight-coefficient adjuster configured to adjust output-weight coefficients on a dimension-by-dimension basis, the output-weight coefficients being weight coefficients for individual dimensions of output data having a plurality of dimensions and output from output units of the node.
摘要:
There is proposed a method that may be universally used for controlling a man-machine interface unit. A learning sample is used in order at least to derive and/or initialize a target action (t) to be carried out and to lead the user from an optional current status (ec) to an optional desired target status (et) as the final status (ef). This learning sample (l) is formed by a data triple made up by an initial status (ei) before an optional action (a) carried out by the user, a final status (ef) after the action taken place (a).
摘要:
A learning system is provided, which includes network storage means for storing a network including a plurality of nodes, each of which holds a dynamics; and learning means for self-organizationally updating the dynamics of the network on the basis of measured time-series data.
摘要:
A learning apparatus includes a storage unit configured to store a network formed by a plurality of nodes each holding dynamics; a learning unit configured to learn the dynamics of the network in a self-organizing manner on the basis of observed time-series data; a winner-node determiner configured to determine a winner node, the winner node being a node having dynamics that best match the time-series data; and a weight determiner configured to determine learning weights for the dynamics held by the individual nodes according to distances of the individual nodes from the winner node. The learning unit is configured to learn the dynamics of the network in a self-organizing manner by degrees corresponding to the learning weights.