摘要:
Techniques for repetition detection in media data are provided. Media features of many different types may be extracted from the media data. Query sequences of fingerprints may be selected time intervals that begin at query times. Matched sequences of fingerprints may be determined. A set of offset values may be determined based on the matched sequences of fingerprints. This set of offset values may be further refined into a set of significant time points using a relatively targeted search and comparison method based on the media features of a second type extracted from the media data.
摘要:
An apparatus and a method are provided for automatically extracting a representative sample from audio data. The apparatus includes storage means and audio data processing means. The audio data processing means embody the method by comprising a plurality of audio data processing modules respectively for section processing, chromagram processing tempo processing and metre processing. Each module outputs a respective representative section and the audio data processing means combines the plurality of sections as the representative sample and stores the representative sample in the storage means as an artist/album thumbnail.
摘要:
Techniques for ranking representative segments in media data are provided. Media features of many different types may be extracted from the media data. A plurality of ranking scores may be assigned to a plurality of candidate representative segments. Each individual candidate representative segment in the plurality of candidate representative segments comprises at least one scene in one or more statistical patterns in media features of the media data based on one or more types of features extractable from the media data. Each individual ranking score in the plurality of ranking scores may be assigned to an individual candidate representative segment in the plurality of candidate representative segments. A representative segment to be played to an end user may be selected from the candidate representative segments, based on the plurality of ranking scores.
摘要:
Techniques for ranking representative segments in media data are provided. Media features of many different types may be extracted from the media data. A plurality of ranking scores may be assigned to a plurality of candidate representative segments. Each individual candidate representative segment in the plurality of candidate representative segments comprises at least one scene in one or more statistical patterns in media features of the media data based on one or more types of features extractable from the media data. Each individual ranking score in the plurality of ranking scores may be assigned to an individual candidate representative segment in the plurality of candidate representative segments. A representative segment to be played to an end user may be selected from the candidate representative segments, based on the plurality of ranking scores.
摘要:
An apparatus, system, and method for extracting the structure of song lyrics using a repeated pattern thereof are provided. The apparatus includes a lyric extractor extracting lyric information from metadata related to an audio file, a character string information extractor extracting an interlude section and a repeated character string based on the extracted lyric information, a paragraph extractor extracting a paragraph based on the repeated character string and then a set of paragraphs having the same repeated pattern among the extracted paragraphs, and a lyric structure generator arranging an interlude section, a character string, and a paragraph related to the audio file in a tree structure.