摘要:
An audio information retrieval method, medium, and system that can rapidly retrieve audio information, even in noisy environments, by extracting a modulation spectrum that is robust against noise, converting features of the extracted modulation spectrum into hash bits, and using a hash table. The audio information retrieval method may include extracting a modulation spectrum from audio data of a compressed domain, converting the extracted modulation spectrum into fingerprint bits, arranging the fingerprint bits in a form of a hash table, converting a received query into an address by a hash function corresponding to the query, and retrieving the audio information by referring to the hash table.
摘要:
An audio information retrieval method, medium, and system that can rapidly retrieve audio information, even in noisy environments, by extracting a modulation spectrum that is robust against noise, converting features of the extracted modulation spectrum into hash bits, and using a hash table. The audio information retrieval method may include extracting a modulation spectrum from audio data of a compressed domain, converting the extracted modulation spectrum into fingerprint bits, arranging the fingerprint bits in a form of a hash table, converting a received query into an address by a hash function corresponding to the query, and retrieving the audio information by referring to the hash table.
摘要:
An audio information retrieval method, medium, and system that can rapidly retrieve audio information, even in noisy environments, by extracting a modulation spectrum that is robust against noise, converting features of the extracted modulation spectrum into hash bits, and using a hash table. The audio information retrieval method may include extracting a modulation spectrum from audio data of a compressed domain, converting the extracted modulation spectrum into fingerprint bits, arranging the fingerprint bits in a form of a hash table, converting a received query into an address by a hash function corresponding to the query, and retrieving the audio information by referring to the hash table.
摘要:
A method and apparatus for classifying mood of music at high speed. The method includes: extracting a Modified Discrete Cosine Transformation-based timbre feature from a compressed domain of a music file; extracting a Modified Discrete Cosine Transformation-based tempo feature from the compressed domain of the music file; and classifying the mood of the music file based on the extracted timbre feature and the extracted tempo feature.
摘要:
Embodiments of the present invention relate to a method, medium, and system for summarizing music. The method includes summarizing a music content by extracting an audio feature value from a compressed segment of music data, tracking change points of the music content using the extracted audio feature value and re-configuring segments, selecting a fixed length fragment from each of the reconfigured segments and clustering the selected fragment so as to measure similarity and redundancy between the respective segments, and generating a summary of the music content using a segment selected based on the measured similarity and redundancy between the respective segments.
摘要:
A method, medium, and system generating a video abstract with high processing speeds, may include a detecting of an event candidate section from video data, based on audio information, a detecting of shot change information from the detected event candidate section, a detecting of final event sections from the detected event candidate section, based on the detected shot change information and visual information, and a generating of video abstract information by merging the extracted final event sections.
摘要:
A broadcast program summary generation system, method and medium are provided. The broadcast program summary generation system includes a format transformation unit to transform a broadcast format of digital broadcast data into a storage format, and a summary generation unit to decode video data of the transformed digital broadcast data, to analyze the decoded video data, to detect an important event by analyzing audio data of the transformed digital broadcast data, and to generate summary information based on the important event.
摘要:
A summary clip generation system according to the present invention includes: an event detection unit detecting a video event and an audio event from multimedia contents; a segment generation unit generating at least one segment by dividing or merging at least one shot which forms the multimedia contents, by referring to the video event; and a segment selection unit selecting a segment whose uprush degree is greater than a predetermined level, from the at least one segment by referring to the uprush degree which is calculated using the video event and the audio event, corresponding to each of the generated segments.
摘要:
A method, system, and medium for indexing an image object. The system of indexing an image object, the system includes: an image input unit receiving an image from a camera of a portable device, and displaying the received image on a display unit; a geographical object identification unit identifying a geographical object included in an object location corresponding to the image; a context information extraction unit extracting context information corresponding to the identified geographical object from a context database; and a display control unit displaying the context information on a position of the image, the position corresponding to the geographical object, and the image being displayed on the display unit.
摘要:
A method, medium, and system generating navigation information of a sports video. The method may include detecting a candidate navigation point by analyzing video data in the sports video, and analyzing a caption from the candidate navigation point and generating the navigation information by determining a navigation section according to a result of the caption analysis.