摘要:
For characterizing an information signal having an amplitude-time waveform with local extreme values, at first the local extreme values of the information signal are determined, wherein a local extreme value is defined by a time instant and an amplitude. Furthermore, area information of valleys or mountains of the information signal in case of a one-dimensional amplitude of the information signal or volume information in case of a two-dimensional amplitude of the information signal of valleys or mountains is ascertained. A valley or mountain is defined by a temporal section of the information signal, wherein the section of the information signal extends from the time instant of a local extreme value to a temporarily adjacent value of the information signal having the same amplitude as the local extreme value. Area or volume information of several mountains or valleys is characteristic for the information signal and permits further characterization of the information signal, build-up of an information signal database, or identification of an information signal on the basis of an existing information signal database. Area or volume information is on the one hand characteristic for the information signal and on the other hand, due to its integral nature, robust against information signal changes in form of overlays or distortions.
摘要:
An information producing apparatus is constructed for producing a combination of object information providing substantial contents and performance information providing a music piece in association with the substantial contents. In the information producing apparatus, a source section provides the object information having substantial contents. An extracting section analyzes the provided object information to extract therefrom the characteristic information which is characteristic of the substantial contents of the provided object information. An attaching section operates based on the extracted characteristic information for attaching performance information to the provided object information. On the other hand, an information reproducing apparatus utilizes the attached performance information to provide a performance of a music piece as a music effect in association with the substantial contents when reproducing the object information transmitted from the information producing apparatus.
摘要:
The invention provides a method and apparatus for automatically generating a summary or key phrase for a song. The song, or a portion thereof, is digitized and converted into a sequence of feature vectors, such mel-frequency cepstral coefficients (MFCCs). The feature vectors are then processed in order decipher the song's structure. Those sections that correspond to different structural elements are then marked with corresponding labels. Once the song is labeled, various heuristics are applied to select a key phrase corresponding to the song's summary. For example, the system may identify the label that appears most frequently within the song, and then select the longest duration of that label as the summary.
摘要:
In connection with a classification system for classifying media entities that merges perceptual classification techniques and digital signal processing classification techniques for improved classification of media entities, a system and methods are provided for automatically classifying and characterizing sonic properties of media entities. Such a system and methods may be useful for the indexing of a database or other storage collection of media entities, such as media entities that are audio files, or have portions that are audio files. The methods also help to determine media entities that have similar sonic properties by utilizing classification chain techniques that test distances between media entities in terms of their properties. For example, a neighborhood of songs may be determined within which each song has similar sonic properties.
摘要:
The present invention is directed to classifying a musical piece based on determined characteristics for each of plural notes contained within the piece. Exemplary embodiments accommodate the fact that in a continuous piece of music, the starting and ending points of a note may overlap previous notes, the next note, or notes played in parallel by one or more instruments. This is complicated by the additional fact that different instruments produce notes with dramatically different characteristics. For example, notes with a sustaining stage, such as those produced by a trumpet or flute, possess high energy in the middle of the sustaining stage, while notes without a sustaining stage, such as those produced by a piano or guitar, posses high energy in the attacking stage when the note is first produced. Exemplary embodiments address these complexities to permit the indexing and retrieval of musical pieces in real time, in a database, thus simplifying database management and enhancing the ability to search multimedia assets contained in the database.
摘要:
Fingerprint data derived from audio or other content is used as an identifier. The fingerprint data can be derived from the content. In one embodiment, fingerprint data supplied from two or more sources is aggregated. The aggregated fingerprint data is used to define a set of audio signals. An audio signal from the set of audio signals is selected based on its probability of matching the fingerprint data. Digital watermarks can also be similarly used to define a set of audio signals.
摘要:
A method and apparatus for searching for multimedia files in a distributed database and for displaying results of the search based on the context and content of the multimedia files.
摘要:
A data select apparatus is constructed for selecting a desired data item. In the apparatus, an internal memory device memorizes a first set of data items. A peripheral device is provided for accessing an external memory medium. A detector device presents a detection signal when the peripheral device receives the external memory medium for accessing and when the received external memory medium stores a second set of data items upon accessing. A controller device responds to the detection signal for merging the first set of data items retrieved from the internal memory device and the second set of data items retrieved from the external memory medium with each other and for sorting the merged data items in a predetermined order. A display device operates upon presence of the detection signal for displaying both of the first and second sets of the data items in the predetermined order, and operates upon absence of the detection signal for displaying only the first set of data items. A keyboard device is operated to select a desired data item from the displayed data items.
摘要:
A system and method for controlling an interactive playground in which aspects of the playground are dynamically varied based on input signals from sensors in the playground. The system includes a system supervisor unit that utilizes a rule file, a scene file and a MIDI file in conjunction with a variety of sensor input to create an appropriate system response. Output control signals generated by the system supervisor unit are transmitted to other coupled computers to effectuate audio, visual and other effects in an interactive playground environment. The system supervisor has the desirable ability to load different scene, rule and MIDI files to create different system behavior in response to sensor stimuli, thereby creating a more adaptive behavioral based environment.
摘要:
An information processing system has a music data base. The music data base stores homophonic reference sequences of music notes. The reference sequences are all normalized to the same scale degree so that they can be stored lexicographically. Upon finding a match between a string of input music notes and a particular reference sequence through an N-ary query, the system provides bibliographic information associated with the matching reference sequence.