摘要:
In an example context of identifying live audio, an audio processor machine accesses audio data that represents a query sound and creates a spectrogram from the audio data. Each segment of the spectrogram represents a different time slice in the query sound. For each time slice, the audio processor machine determines one or more dominant frequencies and an aggregate energy value that represents a combination of all the energy for that dominant frequency and its harmonics. The machine creates a harmonogram by representing these aggregate energy values at these dominant frequencies in each time slice. The harmonogram thus may represent the strongest harmonic components within the query sound. The machine can identify the query sound by comparing its harmonogram to other harmonograms of other sounds and may respond to a user's submission of the query sound by providing an identifier of the query sound to the user.
摘要:
A method includes determining, at a processor of a device, an unordered match between a set of consecutive portions of a first audio fingerprint and a set of non-consecutive portions of a second audio fingerprint. The method also includes, in response to determining that a length of the unordered match satisfies a length criterion, outputting an indicator that the first audio fingerprint matches the second audio fingerprint.
摘要:
Methods and apparatus to generate signatures representative of media are disclosed. An example method includes transforming a block of samples from a time-domain representation to a frequency-domain representation comprising multiple frequency bands, determining a signature function by fitting a curve to at least a subset of the frequency bands, and calculating signature values for the block. Calculating the tuple includes calculating a first angle between a reference line and a first line that is tangent to the signature function at a first index, calculating a second angle between the reference line and a second line that is tangent to the signature function at a second index, calculating a third angle between the reference line and a third line that is tangent to the signature function at a third index, and creating the signature values based on the first angle, the second angle, and the third angle.
摘要:
Methods and apparatus for detecting a repetitive pattern in a sequence of audio frames are described. Similarity values of a first similarity matrix with first resolution for the sequence are calculated. An adaptive threshold is estimated from the similarity values for classifying the similarity values into repetition or non-repetition. For each of one or more offsets of a second similarity matrix with second resolution higher that the first resolution, similarity values of the second similarity matrix corresponding to the offset are calculated. Then the calculated similarity values are binarized with the adaptive threshold to obtain binarized data. Finally, the repetitive pattern is detected from the binarized data. The requirement on memory may be reduced because less data are stored in detecting the repetitive pattern.
摘要:
A music service application that can be run on a wireless mobile device enables audio data to be progressively downloaded from a remote server and also enables locally stored data to be played efficiently. Audio content that is relevant to a user is identified and downloaded to the user's mobile device, in some cases with minimal or no effort by the user. Continuous play features ensure that the user can experience an uninterrupted music experience, both in online and offline modes. Social features such as playlists and preferences of other users are leveraged, to provide users with popular music that is relevant to their interests.
摘要:
A search engine server supports crawling of third party servers communicatively coupled to the search engine server to gather vectors to web content, wherein the search engine server delivers a report to registered creative work owners by identifying vectors to web content that contain similarities to their works and by providing protection to the copyrighted creative works. The search engine server has components that identify similarities to the works of the registered owners of the creative works and provide protection by reporting to the registered owners as well as host third party servers, in case of textual, image, audio and video creative works. This service is an added value based service of the search engine server to the registered owners of the creative works upon service charge basis. The search engine server also provides additional services that include reporting to the host third party servers that contain web content having similarities to that of creative works of registered owners and assisting the third party servers to delete the content upon consideration.
摘要:
Provided is a search system which is configured to search for a registered vector being similar to an input vector among a plurality of registered vectors, on the basis of a degree of similarity between an input vector and a registered vector. The search system includes a partial similarity calculation unit that calculates a degree of partial similarity which is the degree of similarity concerning some of one or more dimensions of the input vector and the registered vector, a limit calculation unit that calculates, on the basis of the degree of partial similarity, an upper limit of the degree of similarity that is expected when the degree of similarity is calculated, and a rejection decision unit that decides, on the basis of the upper limit of the degree of similarity, whether or not to reject the registered vector from a candidate for a search result.
摘要:
A personalized car radio system comprising: a remote application server comprising: a collector, being a software that scans web sites continuously, for detecting content that corresponds to keywords expressing a driver's preferences; a client application, being a software that schedules displaying collected content in accordance with an alertness rank of the driver and a rhythm of the content; and a client device interacting with the application server by Unicast communication, the client device comprising: a safety module, being a software activated continuously or intermittently, for determining an alertness rank according to (a) metered movement of an organ of the driver, and (b) road condition; and a sounding device and a user interface thereof, for sounding the scheduled content; and a text-to-speech converter, being executed either on the server or the client device, for converting text files to audio files.
摘要:
The invention described herein is generally directed to a method and apparatus for creating and retrieving audio data. In one implementation the invention comprises an annotation system configured to record, store, and retrieve media. The annotation system contains a set of client-processing devices configured to capture media for subsequent playback. Each client-processing device typically contains a record button to initiate the capture and is configured upon performing the capture operation to trigger an association of a unique ID with the media. The client-processing devices are further configured to upload the media and a unique ID to a server for purposes of storage. The server obtains the media and unique ID for subsequent retrieval and provides the media and the unique ID to at least one client-processing device from the set of client processing devices.
摘要:
An excerpt of a media object is extracted by computing, for each bar of an N-bar loop, one or more perceptual quality vectors. For each of the one or more perceptual quality vectors within a search zone (S), one or more distances between bar i and bar i+N is computed and sorted to generate a sorted list of bars.