摘要:
A method, medium, and system generating a video abstract with high processing speeds, may include a detecting of an event candidate section from video data, based on audio information, a detecting of shot change information from the detected event candidate section, a detecting of final event sections from the detected event candidate section, based on the detected shot change information and visual information, and a generating of video abstract information by merging the extracted final event sections.
摘要:
A personalized service method using a user history in a mobile terminal is provided. The personalized service method using the user history in the mobile terminal includes: checking at least one of event information and context information which occurred due to a user; checking a user's location, a user's condition, a user's emotional state, or an event occurrence time when the at least one of event information and context information occur; reflecting the user's location, the user's condition, the user's emotional state, or the event occurrence time in the at least one of event information and context information, and recording the at least one of event information and context information, having reflected the user's location, the user's condition, the user's emotional state, or the event occurrence time, in a database in a diary type; and displaying a representing image corresponding to the at least one of event information and context information, having been recorded in the diary type.
摘要:
A method of recognizing speech is provided. The method includes the operations of (a) dividing first speech that is input to a speech recognizing apparatus into frames; (b) converting the frames of the first speech into frames of second speech by applying conversion rules to the divided frames, respectively; and (c) recognizing, by the speech recognizing apparatus, the frames of the second speech, wherein (b) comprises converting the frames of the first speech into the frames of the second speech by reflecting at least one frame from among the frames that are previously positioned with respect to a frame of the first speech.
摘要:
A personalized service method using a user history in a mobile terminal is provided. The personalized service method using the user history in the mobile terminal includes: checking at least one of event information and context information which occurred due to a user; checking a user's location, a user's condition, a user's emotional state, or an event occurrence time when the at least one of event information and context information occur; reflecting the user's location, the user's condition, the user's emotional state, or the event occurrence time in the at least one of event information and context information, and recording the at least one of event information and context information, having reflected the user's location, the user's condition, the user's emotional state, or the event occurrence time, in a database in a diary type; and displaying a representing image corresponding to the at least one of event information and context information, having been recorded in the diary type.
摘要:
An audio information retrieval method, medium, and system that can rapidly retrieve audio information, even in noisy environments, by extracting a modulation spectrum that is robust against noise, converting features of the extracted modulation spectrum into hash bits, and using a hash table. The audio information retrieval method may include extracting a modulation spectrum from audio data of a compressed domain, converting the extracted modulation spectrum into fingerprint bits, arranging the fingerprint bits in a form of a hash table, converting a received query into an address by a hash function corresponding to the query, and retrieving the audio information by referring to the hash table.
摘要:
A microphone signal compensation apparatus includes a plurality of audio input units to respectively receive a target signal, each audio input unit of the plurality of audio input units including a microphone; a constant filter unit to selectively apply a constant filtering calibration scheme to signals output by the plurality of audio input units to compensate for a difference in at least one characteristic among the audio input units, the constant filtering calibration scheme being estimated from an average value of a ratio of a desired signal to a reference signal among the signals output by the plurality of audio input units; and a noise remover to remove noise from the signals processed by the constant filter unit, and to separate the target signal from the signals from which the noise has been removed.
摘要:
An apparatus for estimating a Direction of Arrival (DOA) of a wideband includes a first signal receiving unit and a second signal receiving unit to receive a wideband signal while satisfying d≦Mc/2fs wherein ‘d’ denotes a distance the first signal receiving unit and the second signal receiving unit are spaced apart from each other, ‘c’ denotes the speed of sound, ‘M’ denotes a number of wideband frequencies being a number of fast Fourier transformation (FFT) points of a wideband signal, and ‘fs’ denotes a sampling frequency, and a DOA calculating unit to calculate a DOA (θ) using a normalized frequency ( f) which is obtained by performing an FFT on the respective wideband signals transmitted from the first signal receiving unit and the second signal receiving unit, and using the distance d.
摘要翻译:用于估计宽带的到达方向(DOA)的装置包括:第一信号接收单元和第二信号接收单元,用于在满足d&nlE的同时接收宽带信号; Mc / 2fs其中'd'表示第一信号接收单元 并且第二信号接收单元彼此间隔开,'c'表示声速,'M'表示宽带频率的数量,是宽带信号的快速傅里叶变换(FFT)点的数目,'fs “表示采样频率,DOA计算单元使用通过对从第一信号接收单元发送的各个宽带信号执行FFT并且接收到的第二信号而获得的归一化频率(f)来计算DOA(& 单位,并使用距离d。
摘要:
An audio information retrieval method, medium, and system that can rapidly retrieve audio information, even in noisy environments, by extracting a modulation spectrum that is robust against noise, converting features of the extracted modulation spectrum into hash bits, and using a hash table. The audio information retrieval method may include extracting a modulation spectrum from audio data of a compressed domain, converting the extracted modulation spectrum into fingerprint bits, arranging the fingerprint bits in a form of a hash table, converting a received query into an address by a hash function corresponding to the query, and retrieving the audio information by referring to the hash table.
摘要:
Provided is an apparatus and method for recognizing characters. The apparatus includes a display unit to display an image in which a region of interest or an error region is indicated, and a character recognition result, a region-of-interest setting unit to set the region of interest in the image displayed on the display unit, a recognition unit to perform character recognition on the region of interest or the error region and provide the character recognition result to the display unit, and an error correction unit to set the error region in the image displayed on the display region, perform image copying on the set error region according to a user input, and provide a handwriting input using the image copying to the recognition unit.
摘要:
A system for playing music is provided. The system includes: a mood categorizer categorizing a mood of a music file; a similar music search module searching for similar music having a mood similar to music which a user desires by referring to the categorized mood; a highlight detector detecting a highlight section of the music file; and a theme categorizer categorizing a theme of the music file.