摘要:
A call other than a conversion partner call and various sounds are detected by input audio signals from plural microphones without deteriorating a voice recognition precision. A hearing aid apparatus according to the present invention corrects a frequency characteristic of the call voice other than the conversation partner voice based on an arrival direction of the call voice other than the conversation partner voice, which is estimated based on the audio signal converted by the plural microphones, checks a call word standard pattern representing features of a phoneme and a syllabic sound based on other voice data picked up by using the microphones having one characteristic against a call voice other than the conversation partner voice in which the frequency characteristic is corrected by the frequency characteristic correction processing unit to determine whether the call voice is a call word, and forms a directivity in the direction other than the arrival direction of the voice of the conversation partner. Then, the hearing aid apparatus according to the present invention corrects the frequency characteristic of the call voice other than the conversation partner voice so as to provide the same characteristic as that of the microphones at the time of creating the audio standard pattern.
摘要:
A call other than a conversion partner call and various sounds are detected by input audio signals from plural microphones without deteriorating a voice recognition precision. A hearing aid apparatus according to the present invention corrects a frequency characteristic of the call voice other than the conversation partner voice based on an arrival direction of the call voice other than the conversation partner voice, which is estimated based on the audio signal converted by the plural microphones, checks a call word standard pattern representing features of a phoneme and a syllabic sound based on other voice data picked up by using the microphones having one characteristic against a call voice other than the conversation partner voice in which the frequency characteristic is corrected by the frequency characteristic correction processing unit to determine whether the call voice is a call word, and forms a directivity in the direction other than the arrival direction of the voice of the conversation partner. Then, the hearing aid apparatus according to the present invention corrects the frequency characteristic of the call voice other than the conversation partner voice so as to provide the same characteristic as that of the microphones at the time of creating the audio standard pattern.
摘要:
The present invention aims at extracting a keyword of conversation without preparations by advanced anticipation of keywords of conversation. A keyword extracting device of the present invention includes an audio input section 101 by way of which a speech sound made by a speaker is input; a speech segment determination section 102 that determines a speech segment for each speaker in connection with the input speech sound; a speech recognition section 103 that recognizes a speech sound of the determined speech segment for each speaker; an interrupt detection section 104 that detects a feature of a speech response suggesting presence of a keyword on the basis of a response of another speaker to speech sounds of respective speakers; namely, an interrupt where a preceding speech and a subsequent speech overlap; a keyword extraction section 105 that extracts the keyword from the speech in the speech segment specified on the basis of an interrupt; a keyword search section 106 that performs keyword search by means of the keyword; and a display section 107 that displays a result of keyword search.
摘要:
The present invention aims at extracting a keyword of conversation without preparations by advanced anticipation of keywords of conversation. A keyword extracting device of the present invention includes an audio input section 101 by way of which a speech sound made by a speaker is input; a speech segment determination section 102 that determines a speech segment for each speaker in connection with the input speech sound; a speech recognition section 103 that recognizes a speech sound of the determined speech segment for each speaker; an interrupt detection section 104 that detects a feature of a speech response suggesting presence of a keyword on the basis of a response of another speaker to speech sounds of respective speakers; namely, an interrupt where a preceding speech and a subsequent speech overlap; a keyword extraction section 105 that extracts the keyword from the speech in the speech segment specified on the basis of an interrupt; a keyword search section 106 that performs keyword search by means of the keyword; and a display section 107 that displays a result of keyword search.
摘要:
Provided is a lifestyle collecting apparatus that collects information for determining a lifestyle of a user, and includes: an object information detecting unit configured to detect object information representing an object around the user; a relevance degree calculating unit configured to calculate a relevance degree of the user to the object, using the object information; an appearance information extracting unit configured to extract appearance information from the object information, and add the relevance degree to the extracted appearance information, the appearance information representing an appearance of the object; and a lifestyle database which stores the appearance information to which the relevance degree has been added, as the information for determining the lifestyle of the user.
摘要:
Provided is a lifestyle collecting apparatus that collects information for determining a lifestyle of a user, and includes: an object information detecting unit configured to detect object information representing an object around the user; a relevance degree calculating unit configured to calculate a relevance degree of the user to the object, using the object information; an appearance information extracting unit configured to extract appearance information from the object information, and add the relevance degree to the extracted appearance information, the appearance information representing an appearance of the object; and a lifestyle database which stores the appearance information to which the relevance degree has been added, as the information for determining the lifestyle of the user.
摘要:
Provided is a viewing terminal apparatus that can present an appropriate result of statistics on viewing of a content for diversified viewing modes. The viewing terminal apparatus includes: a category determining unit that determines, as a viewer category, a relationship between viewers who are viewing a content displayed on a display; a transmitting unit that transmits, to the viewing statistics-gathering apparatus, first viewing status information indicating the content that is being viewed by the viewers and the viewer category determined by the category determining unit, the content being associated with the viewer category; and a viewing statistics presenting unit that obtains viewing statistics information from the viewing statistics-gathering apparatus, and presents a result of statistics that is (i) indicated by the obtained viewing statistics information and (ii) a result of statistics on viewing of a content only by viewers who belong to a predetermined viewer category.
摘要:
Provided is a viewing terminal apparatus that can present an appropriate result of statistics on viewing of a content for diversified viewing modes. The viewing terminal apparatus includes: a category determining unit that determines, as a viewer category, a relationship between viewers who are viewing a content displayed on a display; a transmitting unit that transmits, to the viewing statistics-gathering apparatus, first viewing status information indicating the content that is being viewed by the viewers and the viewer category determined by the category determining unit, the content being associated with the viewer category; and a viewing statistics presenting unit that obtains viewing statistics information from the viewing statistics-gathering apparatus, and presents a result of statistics that is (i) indicated by the obtained viewing statistics information and (ii) a result of statistics on viewing of a content only by viewers who belong to a predetermined viewer category.
摘要:
A hearing aid, signal processing method and program enables TV sound to be made easier for a hearing aid user to hear when wishing to watch TV, and a person's voice to be made easier to hear when wishing to converse with that person. The hearing aid includes another-person's speech detection section that detects speech of a speaker other than the wearer using detected sound source direction information, an own-speech detection result, and a TV sound detection result, and a per-sound-source frequency calculation section that calculates the frequency of each sound source using an own-speech detection result, TV sound detection result, other-speaker's speech detection result, and sound source direction information. A scene determination section determines a scene using sound source direction information and a per-sound-source frequency; and an output sound control section controls hearing of the hearing aid according to a determined scene.
摘要:
Provided are a hearing aid, signal processing method, and program that enable TV sound to be made easier for a hearing aid user to hear when wishing to watch TV, and a person's voice to be made easier to hear when wishing to converse with that person. A hearing aid (100) is provided with an other-person's speech detection section (150) that detects speech of a speaker other than the wearer using detected sound source direction information, an own-speech detection result, and a TV sound detection result, and a per-sound-source frequency calculation section (160) that calculates the frequency of each sound source using an own-speech detection result, TV sound detection result, other-speaker's speech detection result, and sound source direction information. A scene determination section (170) determines a scene to be a “conversation scene,” “TV viewing scene,” and “‘TV viewing while . . . ’ scene” using sound source direction information and the per-sound-source frequency, and an output sound control section (180) controls hearing of the hearing aid (100) according to a determined scene.