摘要:
A call other than a conversion partner call and various sounds are detected by input audio signals from plural microphones without deteriorating a voice recognition precision. A hearing aid apparatus according to the present invention corrects a frequency characteristic of the call voice other than the conversation partner voice based on an arrival direction of the call voice other than the conversation partner voice, which is estimated based on the audio signal converted by the plural microphones, checks a call word standard pattern representing features of a phoneme and a syllabic sound based on other voice data picked up by using the microphones having one characteristic against a call voice other than the conversation partner voice in which the frequency characteristic is corrected by the frequency characteristic correction processing unit to determine whether the call voice is a call word, and forms a directivity in the direction other than the arrival direction of the voice of the conversation partner. Then, the hearing aid apparatus according to the present invention corrects the frequency characteristic of the call voice other than the conversation partner voice so as to provide the same characteristic as that of the microphones at the time of creating the audio standard pattern.
摘要:
Provided is a lifestyle collecting apparatus that collects information for determining a lifestyle of a user, and includes: an object information detecting unit configured to detect object information representing an object around the user; a relevance degree calculating unit configured to calculate a relevance degree of the user to the object, using the object information; an appearance information extracting unit configured to extract appearance information from the object information, and add the relevance degree to the extracted appearance information, the appearance information representing an appearance of the object; and a lifestyle database which stores the appearance information to which the relevance degree has been added, as the information for determining the lifestyle of the user.
摘要:
Provided is a viewing terminal apparatus that can present an appropriate result of statistics on viewing of a content for diversified viewing modes. The viewing terminal apparatus includes: a category determining unit that determines, as a viewer category, a relationship between viewers who are viewing a content displayed on a display; a transmitting unit that transmits, to the viewing statistics-gathering apparatus, first viewing status information indicating the content that is being viewed by the viewers and the viewer category determined by the category determining unit, the content being associated with the viewer category; and a viewing statistics presenting unit that obtains viewing statistics information from the viewing statistics-gathering apparatus, and presents a result of statistics that is (i) indicated by the obtained viewing statistics information and (ii) a result of statistics on viewing of a content only by viewers who belong to a predetermined viewer category.
摘要:
To classify moving images using audio signals. An audio signal is acquired, a section feature relating to an audio frequency distribution is extracted with respect to each of a plurality of sections each having a predetermined length contained in the acquired audio signal, each extracted section feature is compared with each of reference section features to calculate a section similarity indicating a degree of correlation between each section feature and each reference section feature. An integrated feature relating to the plurality of sections and being calculated based on the section similarity calculated with respect to each of the plurality of sections is extracted from the acquired audio signal. The extracted integrated feature is compared with each of one or more reference integrated features, and the audio signal is classified based on comparison result. Then, classification result is used for moving image classification.
摘要:
Provided is a viewing terminal apparatus that can present an appropriate result of statistics on viewing of a content for diversified viewing modes. The viewing terminal apparatus includes: a category determining unit that determines, as a viewer category, a relationship between viewers who are viewing a content displayed on a display; a transmitting unit that transmits, to the viewing statistics-gathering apparatus, first viewing status information indicating the content that is being viewed by the viewers and the viewer category determined by the category determining unit, the content being associated with the viewer category; and a viewing statistics presenting unit that obtains viewing statistics information from the viewing statistics-gathering apparatus, and presents a result of statistics that is (i) indicated by the obtained viewing statistics information and (ii) a result of statistics on viewing of a content only by viewers who belong to a predetermined viewer category.
摘要:
A program guidance apparatus includes a recognition word storage unit (105) operable to store a past recognition word that is recognized by speech recognition in the past, a viewing history word storage unit (106) operable to store viewing history words that are the information of a viewed program and a dictionary creating unit (103) operable to create a customized recognition dictionary that is created by adding the past recognition word and viewing history words that are not included in the basic recognition dictionary to the basic recognition dictionary and another customized recognition dictionary to which weights are assigned using “item weight coefficient” according to the categories of words and “history weight coefficient” according to whether or not the word is recorded as a past recognition word or viewing history words.
摘要:
A method and apparatus for speech recognition of the present application has a process to collate, with an input utterance, an acoustic model corresponding to a hypothesis to be expressed by the connection of utterance segments, such as phonemes or syllables, and developed according to a length of an input utterance by an inter-word connection rule thereby obtaining a recognition score. Within a word of the hypothesis, the similar hypotheses high in utterance score within a predetermined threshold from the maximum value of the score are all held to a word end irrespectively of the number of hypotheses. Meanwhile, at a word end of the hypotheses, the hypotheses are narrowed to a predetermined number of upper ranking in the order of higher score.