摘要:
Embodiments of the present invention disclose an audio processing method applied to a cloud interaction system. The cloud interaction system comprises user equipment and a server. The method comprises: a server monitors for calling of an audio interface via an interactive application, and, upon detecting that the audio interface has been called by the interactive application, the server generates, according to the type of audio interface, an audio instruction corresponding to the type, and determines whether there is a sending record indicating that audio data corresponding to the audio instruction has been sent to the user equipment; if there is a sending record, the server sending the audio instruction to the user equipment, wherein the audio instruction instructs the user equipment to use buffered audio data when executing the audio instruction. The audio processing method provided by the embodiments of the present invention can improve audio quality of user equipment, and reduce network traffic between servers and user equipment.
摘要:
Methods and apparatus to audio watermarking and watermark detection and extracted are described herein. An example method includes receiving a media content signal, sampling the media content signal to generate samples, storing the samples in a buffer, determining a first sequence of samples in the buffer, determining a second sequence of samples in the buffer, wherein the second sequence of samples is of substantially equal length as the first sequence of samples, calculating an average of the first sequence of samples and the second sequence of samples to generate an average sequence of samples, extracting an identifier from the average sequence of samples, and storing the identifier in a tangible memory.
摘要:
A method of associating a content object with metadata uses a combination of a content identifier and a bounding identifier to enable handling of disparate sets of content identifiers for content objects with potentially conflicting content identifiers. The method receives a content identifier for a content object from among a set of content identifiers. It provides a unique bounding identifier for the set of content identifiers. This unique bounding identifier is used in combination with the content identifier to form a globally unique identifier for the content object. This globally unique identifier is associated with a metadata source, which enables routing of a user to the metadata source.
摘要:
Systems and methods for the matching of datasets, such as input audio segments, with known datasets in a database are disclosed. In an illustrative embodiment, the use of the presently disclosed systems and methods is described in conjunction with recognizing known network message recordings encountered during an outbound telephone call. The methodologies include creation of a ternary fingerprint bitmap to make the comparison process more efficient. Also disclosed are automated methodologies for creating the database of known datasets from a larger collection of datasets.
摘要:
A computing device, during sampling or playback of a work, receives a command to associate data with the work at a particular point in the work. The computing device generates a digital fingerprint of a segment of the work, wherein the segment corresponds to the particular point in the work. The computing device then associates the data with the digital fingerprint.
摘要:
Die Erfindung betrifft ein Verfahren, wobei ein akustisches Eingangssignal in einem Eingangswandler (14) in ein elektrisches Eingangssignal umgewandelt wird, wobei das elektrische Eingangssignal in einer Signalverarbeitungseinheit (16) in ein elektrisches Ausgangssignal verarbeitet wird, wobei das elektrische Ausgangssignal in einem Ausgangswandler (18, 20) in ein Ausgabesignal umgewandelt wird, wobei durch eine Analyse des elektrischen Eingangssignals ein Takt erkannt wird, und wobei der erkannte Takt einem Hörsystemnutzer wahrnehmbar ausgegeben wird. Des Weiteren betrifft die Erfindung ein entsprechendes Hörsystem (10), mittels welchem im Eingangssignal ein Takt erkannt und wahrnehmbar ausgegeben wird.
摘要:
A device (200) and method for calculating scattering features for audio signal recognition. An interface (240) receives an audio signal that is processed (S610) by a processor (210) to obtain an audio frame. The processor (210) calculates (S620) a first order scattering features from at least one audio frame and then calculates (S630) for the first order scattering features an estimation of whether the first order scattering features comprises sufficient information for accurate audio signal recognition. The processor (240) calculates (S650) a second order scattering features from the first order scattering features only in case the first order scattering features does not comprise sufficient information for accurate audio signal recognition. As second order features are calculated only when it is deemed necessary, less processing power can be used by the device, which can lead to less power used by the device.
摘要:
A user inputs, as a query pattern, a desired search-object rhythm pattern using a control, corresponding to a desired one of a plurality of performance parts constituting a performance data set (automatic accompaniment data set), in a rhythm input device (10). An input rhythm pattern storage section (212) stores the input rhythm pattern (query pattern) into a RAM on the basis of a clock signal output from a bar line clock output section (211) and input trigger data. A part identification section (213) identifies a search-object performance part corresponding to the user-operated control. For the identified performance part, a rhythm pattern search section (214) searches an automatic accompaniment database (221) for an automatic accompaniment data set including a rhythm pattern that matches, i.e. has the highest similarity to, the input rhythm pattern (query pattern).
摘要:
Methods and systems for arranging and searching a database of media content recordings are provided. In one example, a method is provided that comprises receiving a sample of media content, and performing, by a computing device, a content recognition of the sample of media content using a data file including a concatenation of representations for each of a plurality of media content recordings. In other examples, another method is provided that comprises receiving media content recordings, determining a representation for each media content recording, concatenating by a computing device the representation for each media content recording as a data file, and storing by the computing device a mapping between an identifier for a respective media content recording and a global position in the data file that corresponds to the representation of the respective media content recording.
摘要:
It is inter alia disclosed a method comprising: determining a divergence measure between a statistical distribution of audio features of a first audio track and a statistical distribution of audio features of at least one further audio track; determining a divergence measure threshold value from at least the divergence measure between the statistical distribution of audio features of a first audio track and the statistical distribution of audio features of the at least one further audio track; and comparing the divergence measure with the divergence measure threshold value.