摘要:
The present invention relates to a system and methodology to facilitate automatic generation of mnemonic audio portions or segments referred to as audio thumbnails. A system is provided for summarizing audio information. The system includes an analysis component to determine common features in an audio file and a mnemonic detector to extract fingerprint portions of the audio file based in part on the common features in order to generate a thumbnail of the audio file. The generated thumbnails can then be employed to facilitate browsing or searching audio files in order to mitigate listening to longer portions or segments of such files.
摘要:
The present invention relates to a system and methodology to facilitate automatic management and pruning of audio files residing in a database. Audio fingerprinting is a powerful tool for identifying streaming or file-based audio, using a database of fingerprints. Duplicate detection identifies duplicate audio clips in a set, even if the clips differ in compression quality or duration. The present invention can be provided as a self-contained application that it does not require an external database of fingerprints. Also, a user interface provides various options for managing and pruning the audio files.
摘要:
A “media stream customizer” customizes buffered media streams by inserting one or more media objects into the stream to maintain an approximate buffer level. Specifically, when media objects such as songs, jingles, advertisements, etc., are deleted from the buffered stream (based on some user specified preferences), the buffer level will decrease. Therefore, over time, as more objects are deleted, the amount of the media stream being buffered continues to decrease, thereby limiting the ability to perform additional deletions from the stream. To address this limitation, the media stream customizer automatically chooses one or more media objects to insert back into the stream, and ensures that the inserted objects are consistent with any surrounding content of the media stream, thereby maintaining an approximate buffer level. In addition, the buffered content can also be stretched using pitch preserving audio stretching techniques to further compensate for deletions from the buffered stream.
摘要:
A “Music Mapper” automatically constructs a set coordinate vectors for use in inferring similarity between various pieces of music. In particular, given a music similarity graph expressed as links between various artists, albums, songs, etc., the Music Mapper applies a recursive embedding process to embed each of the graphs music entries into a multi-dimensional space. This recursive embedding process also embeds new music items added to the music similarity graph without reembedding existing entries so long a convergent embedding solution is achieved. Given this embedding, coordinate vectors are then computed for each of the embedded musical items. The similarity between any two musical items is then determined as either a function of the distance between the two corresponding vectors. In various embodiments, this similarity is then used in constructing music playlists given one or more random or user selected seed songs or in a statistical music clustering process.
摘要:
A “Music Mapper” automatically constructs a set coordinate vectors for use in inferring similarity between various pieces of music. In particular, given a music similarity graph expressed as links between various artists, albums, songs, etc., the Music Mapper applies a recursive embedding process to embed each of the graphs music entries into a multi-dimensional space. This recursive embedding process also embeds new music items added to the music similarity graph without reembedding existing entries so long a convergent embedding solution is achieved. Given this embedding, coordinate vectors are then computed for each of the embedded musical items. The similarity between any two musical items is then determined as either a function of the distance between the two corresponding vectors. In various embodiments, this similarity is then used in constructing music playlists given one or more random or user selected seed songs or in a statistical music clustering process.
摘要:
A “media stream customizer” customizes buffered media streams by inserting one or more media objects into the stream to maintain an approximate buffer level. Specifically, when media objects such as songs, jingles, advertisements, etc., are deleted from the buffered stream (based on some user specified preferences), the buffer level will decrease. Therefore, over time, as more objects are deleted, the amount of the media stream being buffered continues to decrease, thereby limiting the ability to perform additional deletions from the stream. To address this limitation, the media stream customizer automatically chooses one or more media objects to insert back into the stream, and ensures that the inserted objects are consistent with any surrounding content of the media stream, thereby maintaining an approximate buffer level. In addition, the buffered content can also be stretched using pitch preserving audio stretching techniques to further compensate for deletions from the buffered stream.
摘要:
A “Music Mapper” automatically constructs a set coordinate vectors for use in inferring similarity between various pieces of music. In particular, given a music similarity graph expressed as links between various artists, albums, songs, etc., the Music Mapper applies a recursive embedding process to embed each of the graphs music entries into a multi-dimensional space. This recursive embedding process also embeds new music items added to the music similarity graph without reembedding existing entries so long a convergent embedding solution is achieved. Given this embedding, coordinate vectors are then computed for each of the embedded musical items. The similarity between any two musical items is then determined as either a function of the distance between the two corresponding vectors. In various embodiments, this similarity is then used in constructing music playlists given one or more random or user selected seed songs or in a statistical music clustering process.
摘要:
A “Music Mapper” automatically constructs a set coordinate vectors for use in inferring similarity between various pieces of music. In particular, given a music similarity graph expressed as links between various artists, albums, songs, etc., the Music Mapper applies a recursive embedding process to embed each of the graphs music entries into a multi-dimensional space. This recursive embedding process also embeds new music items added to the music similarity graph without reembedding existing entries so long a convergent embedding solution is achieved. Given this embedding, coordinate vectors are then computed for each of the embedded musical items. The similarity between any two musical items is then determined as either a function of the distance between the two corresponding vectors. In various embodiments, this similarity is then used in constructing music playlists given one or more random or user selected seed songs or in a statistical music clustering process.
摘要:
A technique for generating high-resolution bitmaps from low-resolution bitmaps. A low-resolution bitmap is magnified to form a magnified image. Edge detection is performed on the magnified image to find high contrast edges. A plurality of image patches of the magnified image are generated. These images patches are analyzed by performing connected components analysis on each of them using the high contrast edges to produce a plurality of foreground and background decisions determining whether a portion of an image patch is a background or a foreground region. Then the contrast of one or more pixels in each of the plurality of image patches is enhanced based on the foreground and background decisions. Finally, the system and method of the invention combines the luminance of the enhanced output pixels with the color values generated by the magnification algorithm. This produces a high-resolution bitmap from the contrast-enhanced pixels.
摘要:
A “Music Mapper” automatically constructs a set coordinate vectors for use in inferring similarity between various pieces of music. In particular, given a music similarity graph expressed as links between various artists, albums, songs, etc., the Music Mapper applies a recursive embedding process to embed each of the graphs music entries into a multi-dimensional space. This recursive embedding process also embeds new music items added to the music similarity graph without reembedding existing entries so long a convergent embedding solution is achieved. Given this embedding, coordinate vectors are then computed for each of the embedded musical items. The similarity between any two musical items is then determined as either a function of the distance between the two corresponding vectors. In various embodiments, this similarity is then used in constructing music playlists given one or more random or user selected seed songs or in a statistical music clustering process.