Abstract:
A method and an apparatus for separating speech data from background data in an audio communication are suggested. The method comprises: applying a speech model to the audio communication for separating the speech data from the background data of the audio communication; and updating the speech model as a function of the speech data and the background data during the audio communication.
Abstract:
A particular implementation determines parameters of a generative probabilistic model from visual descriptors extracted from at least one image. The extracted visual descriptors are quantized and encoded using the model-based arithmetic encoding to be stored or for transmission to a decoder. The model parameters are also stored to be available to a decoder, or transmitted directly to a decoder. A decoder uses the stored, or received, model parameters to reconstruct the generative probabilistic model and then to decode the visual descriptors. The visual descriptors are used for image analysis tasks, such as image retrieval or object detection. A particular implementation uses a Gaussian mixture model as a generative probabilistic model.
Abstract:
A method and apparatus for processing audio content is described. The method and apparatus include receiving (510) audio content, the audio content including an input audio signal, a first reference audio signal, and a second reference audio signal, determining (550) a processing function for the input audio signal, the processing function determined based on a cost function between the input audio signal, the first reference audio signal and a second reference audio signal, and processing (560) the input audio signal using the determined processing function in order to produce an output audio signal.
Abstract:
A method and an apparatus for selecting or removing one or more audio component types in an audio source associated with a video source at an electronic device are presented. For example, the present invention allows a user of an electronic device to selectively choose or remove, e.g., speech, music, and/or some other component type of an audio source for a selected video program at a receiver. In another embodiment, the selected audio source may be from another program.
Abstract:
A method and a system of audio retrieval and source separation are described. The method comprises the steps of: receiving a textual query; retrieving a preliminary audio sample from an auxiliary audio database; retrieving a target audio sample from a target audio database; and separating the retrieved target audio sample into a plurality of audio source signals. The corresponding system comprises an input unit, a storing unit and a processing unit to implement the method.
Abstract:
A method and apparatus for processing audio content is described. The method and apparatus include receiving (510) audio content, the audio content including an input audio signal, a first reference audio signal, and a second reference audio signal, determining (550) a processing function for the input audio signal, the processing function determined based on a cost function between the input audio signal, the first reference audio signal and a second reference audio signal, and processing (560) the input audio signal using the determined processing function in order to produce an output audio signal.
Abstract:
Separation of speech and background from an audio mixture by using a speech example, generated from a source associated with a speech component in the audio mixture, to guide the separation process.