摘要:
An image capturing device is equipped with an external parameter estimating unit for estimating external parameters using a distance image obtained by a TOF camera and a luminance image obtained by a CCD camera, a corresponding pixel determining unit for determining a correspondence relationship between pixel positions in the distance image and pixel positions in the luminance image, using previously stored internal parameters of the TOF camera and the CCD camera or the CCD camera, and the external parameters, and an occlusion searching unit for searching for an occlusion region in the distance image, using the correspondence relationship between pixel positions in the distance image and pixel positions in the luminance image.
摘要:
A command recognition device includes: an utterance understanding unit that determines or selects word sequence information from speech information; speech confidence degree calculating unit that calculates degree of speech confidence based on the speech information and the word sequence information; a phrase confidence degree calculating unit that calculates a degree of phrase confidence based on image information and phrase information included in the word sequence information; and a motion control instructing unit that determines whether a command of the word sequence information should be executed based on the degree of speech confidence and the degree of phrase confidence.
摘要:
A speech understanding apparatus includes a speech recognition unit which performs speech recognition of an utterance using multiple language models, and outputs multiple speech recognition results obtained by the speech recognition, a language understanding unit which uses multiple language understanding models to perform language understanding for each of the multiple speech recognition results output from the speech recognition unit, and outputs multiple speech understanding results obtained from the language understanding, and an integrating unit which calculates, based on values representing features of the speech understanding results, utterance batch confidences that numerically express accuracy of the speech understanding results for each of the multiple speech understanding results output from the language understanding unit, and selects one of the speech understanding results with a highest utterance batch confidence among the calculated utterance batch confidences.
摘要:
A question and answer database expansion apparatus includes: a question and answer database in which questions and answers corresponding to the questions are registered in association with each other, a first speech recognition unit which carries out speech recognition for an input sound signal by using a language model based on the question and answer database, and outputs a first speech recognition result as the recognition result, a second speech recognition unit which carries out speech recognition for the input sound signal by using a language model based on a large vocabulary database, and outputs a second speech recognition result as the recognition result, and a question detection unit which detects an unregistered utterance, which is not registered in the question and answer database, from the input sound based on the first speech recognition result and the second speech recognition result, and outputs the detected unregistered utterance.
摘要:
A referring expression processor which uses a probabilistic model and in which referring expressions including descriptive, anaphoric and deictic expressions are understood and generated in the course of dialogue is provided. The referring expression processor according to the present invention includes: a referring expression processing section which performs at least one of understanding and generation of referring expressions using a probabilistic model constructed with a referring expression Bayesian network, each referring expression Bayesian network representing relationships between a reference domain (D) which is a set of possible referents, a referent (X) in the reference domain, a concept (C) concerning the referent and a word (W) which represents the concept; and a memory which stores data necessary for constructing the referring expression Bayesian network.
摘要:
A speech recognition apparatus includes a speech input unit that receives input speech, a phoneme recognition unit that recognizes phonemes of the input speech and generates a first phoneme sequence representing corrected speech, a matching unit that matches the first phoneme sequence with a second phoneme sequence representing original speech, and a phoneme correcting unit that corrects phonemes of the second phoneme sequence based on the matching result.
摘要:
A language processor according to the present invention includes a probability calculating section (103) having a probabilistic model by which a probability that a set of partial referring expression pairs represents an object, each pair consisting of an attribute value expression and a part expression modified by the attribute value expression, the set of referring expression pairs including a pair concerning the whole object, is represented by a product of probabilities each of which is a probability that each partial referring expression pair represents the object. The probability that each partial referring expression pair represents the object is obtained using a probability that the object is chosen, a probability that represents the salience of a part of the object, and a probability representing relevance between the part of the object and an attribute value, a probability that the part expression is used for a part of the object, and a probability that the attribute value expression is used for an attribute value. It further includes at least one memory section (101) for storing values of probabilities used for the probabilistic model. The probability calculating section calculates the probability that the set of partial referring expression pairs represents the object using the values of probabilities stored in the at least one memory section and the probabilistic model.