摘要:
An azimuth and distance calculator calculates the relative direction and distance to the next intersection to be guided, based on information on the intersection supplied from storage for received information on an object to be guided and information on the moving histories of a user. Then, the calculator converts the relative direction into a horizontal angle and the distance to an elevation angle, and passes the angles to a stereophony generator. The stereophony generator creates output sound information having a sound image localized outside of a headphone and outputs the information to the headphone. In this manner, the user can accurately understand the distance to the object.
摘要:
The present invention aims at extracting a keyword of conversation without preparations by advanced anticipation of keywords of conversation. A keyword extracting device of the present invention includes an audio input section 101 by way of which a speech sound made by a speaker is input; a speech segment determination section 102 that determines a speech segment for each speaker in connection with the input speech sound; a speech recognition section 103 that recognizes a speech sound of the determined speech segment for each speaker; an interrupt detection section 104 that detects a feature of a speech response suggesting presence of a keyword on the basis of a response of another speaker to speech sounds of respective speakers; namely, an interrupt where a preceding speech and a subsequent speech overlap; a keyword extraction section 105 that extracts the keyword from the speech in the speech segment specified on the basis of an interrupt; a keyword search section 106 that performs keyword search by means of the keyword; and a display section 107 that displays a result of keyword search.
摘要:
Provided is a lifestyle collecting apparatus that collects information for determining a lifestyle of a user, and includes: an object information detecting unit configured to detect object information representing an object around the user; a relevance degree calculating unit configured to calculate a relevance degree of the user to the object, using the object information; an appearance information extracting unit configured to extract appearance information from the object information, and add the relevance degree to the extracted appearance information, the appearance information representing an appearance of the object; and a lifestyle database which stores the appearance information to which the relevance degree has been added, as the information for determining the lifestyle of the user.
摘要:
A voice output apparatus, enhancing a robustness of an interface between a user and the apparatus by transmitting, information to the user via text message and voice message. The voice output apparatus including a display unit (107) displaying a text message that is apparatus-transmitting information to be transmitted to the user, a delay unit (105), and a voice output unit (106) estimating a delay time necessary for an action taken by the user to visually identify the text message displayed by the display unit (107), and outputting, via voice message, the apparatus-transmitting information, when the delay time (T) passes after the text message is displayed.
摘要:
The present invention aims at extracting a keyword of conversation without preparations by advanced anticipation of keywords of conversation. A keyword extracting device of the present invention includes an audio input section 101 by way of which a speech sound made by a speaker is input; a speech segment determination section 102 that determines a speech segment for each speaker in connection with the input speech sound; a speech recognition section 103 that recognizes a speech sound of the determined speech segment for each speaker; an interrupt detection section 104 that detects a feature of a speech response suggesting presence of a keyword on the basis of a response of another speaker to speech sounds of respective speakers; namely, an interrupt where a preceding speech and a subsequent speech overlap; a keyword extraction section 105 that extracts the keyword from the speech in the speech segment specified on the basis of an interrupt; a keyword search section 106 that performs keyword search by means of the keyword; and a display section 107 that displays a result of keyword search.
摘要:
The voice output apparatus, which enhances a robustness of an interface between a user and the apparatus by transmitting, information to the user via text message and voice message, is comprised of: a display unit (107) for displaying a text message that is apparatus-transmitting information to be transmitted to the user; and a delay unit (105) as well as a voice output unit (106) for estimating a delay time necessary for an action taken by the user to visually identify the text message displayed by the display unit (107), and outputting, via voice message, the apparatus-transmitting information, when the delay time (T) passes after the text message is displayed.
摘要:
An interesting section extracting device extracts an interesting section of interest to a user from a video file with reference to an audio signal included in the video file such that a specified time is included in the interesting section. The interesting section extracting device includes an interface device that obtains the specified time; and a likelihood vector generating unit that calculates, in one-to-one correspondence with first unit sections of the audio signal, likelihoods for anchor models that respectively represent features of a plurality of types of sound pieces and generates likelihood vectors having the calculated likelihoods as components thereof. An interesting section extracting unit calculates a first feature section as candidate section, which is candidate for the interesting section to be extracted, by using likelihood vectors and extract, as the interesting section, part of the first feature section including the specified time.
摘要:
A call other than a conversion partner call and various sounds are detected by input audio signals from plural microphones without deteriorating a voice recognition precision. A hearing aid apparatus according to the present invention corrects a frequency characteristic of the call voice other than the conversation partner voice based on an arrival direction of the call voice other than the conversation partner voice, which is estimated based on the audio signal converted by the plural microphones, checks a call word standard pattern representing features of a phoneme and a syllabic sound based on other voice data picked up by using the microphones having one characteristic against a call voice other than the conversation partner voice in which the frequency characteristic is corrected by the frequency characteristic correction processing unit to determine whether the call voice is a call word, and forms a directivity in the direction other than the arrival direction of the voice of the conversation partner. Then, the hearing aid apparatus according to the present invention corrects the frequency characteristic of the call voice other than the conversation partner voice so as to provide the same characteristic as that of the microphones at the time of creating the audio standard pattern.
摘要:
Provided is a lifestyle collecting apparatus that collects information for determining a lifestyle of a user, and includes: an object information detecting unit configured to detect object information representing an object around the user; a relevance degree calculating unit configured to calculate a relevance degree of the user to the object, using the object information; an appearance information extracting unit configured to extract appearance information from the object information, and add the relevance degree to the extracted appearance information, the appearance information representing an appearance of the object; and a lifestyle database which stores the appearance information to which the relevance degree has been added, as the information for determining the lifestyle of the user.
摘要:
An image information processing apparatus comprising: an extraction unit that extracts an object from a photographed image; a calculation unit that calculates an orientation of the object as exhibited in the image; and a provision unit that provides a tag to the image according to the orientation of the object.