MULTI-CHANNEL AUDIO CODING
    1.
    发明公开
    MULTI-CHANNEL AUDIO CODING 有权
    多声道音频CODING

    公开(公告)号:EP1866912A1

    公开(公告)日:2007-12-19

    申请号:EP06711112.0

    申请日:2006-03-16

    IPC分类号: G10L19/00 H04S3/00

    摘要: A multi-channel audio encoder (10) for encoding a multi-channel audio signal (101), e.g. a 5.1 channel audio signal, into a spatial down-mix (102), e.g. a stereo signal, and associated parameters (104, 105). The encoder (10) comprises first and second units (110, 120). The first unit (110) encodes the multi-channel audio signal (101) into the spatial down-mix (102) and parameters (104). These parameters (104) enable a multi-channel decoder (20) to reconstruct the multi-channel audio signal (203) from the spatial down-mix (102). The second unit (120) generates, from the spatial down-mix (102), parameters (105) that enable the decoder to reconstruct the spatial down-mix (202) from an alternative down-mix (103), e.g. a so-called artistic down-mix that has been manually mixed in a sound studio. In this way, the decoder (20) can efficiently deal with a situation in which an alternative down-mix (103) is received instead of the regular spatial, down-mix (102). In the decoder (20), first the spatial down-mix (202) is reconstructed from the alternative down-mix (103) and the parameters (105). Next, the spatial down-mix (202) is decoded into the multi-channel audio signal (203).

    A BINAURAL OBJECT-ORIENTED AUDIO DECODER
    3.
    发明公开
    A BINAURAL OBJECT-ORIENTED AUDIO DECODER 审中-公开
    双耳面向对象音频解码器

    公开(公告)号:EP2158791A1

    公开(公告)日:2010-03-03

    申请号:EP08763420.0

    申请日:2008-06-23

    IPC分类号: H04S1/00

    摘要: A binaural object-oriented audio decoder comprising decoding means for decoding and rendering at least one audio object based on head-related transfer function parameters is proposed. Said decoding means are being arranged for positioning an audio object in a virtual three-dimensional space. Said head-related transfer function parameters are being based on an elevation parameter, an azimuth parameter, and a distance parameter. Said parameters are corresponding to the position of the audio object in the virtual three- dimensional space. The binaural object-oriented audio decoder is configured for receiving the head-related transfer function parameters, whereby said received head-related transfer function parameters are varying for the elevation parameter and the azimuth parameter only. Said binaural object-oriented audio decoder is characterized by distance processing means for modifying the received head-related transfer function parameters according to a received desired distance parameter. Said modified head-related transfer function parameters are being used to position the audio object in the three-dimensions at the desired distance. Said modification of the head-related transfer function parameters is based on a predetermined distance parameter for said received head-related function parameters.

    METHOD OF GENERATING AND METHODS OF FILTERING A USER PROFILE
    4.
    发明公开
    METHOD OF GENERATING AND METHODS OF FILTERING A USER PROFILE 审中-公开
    一种用于生成和方法用于过滤用户配置文件

    公开(公告)号:EP1952281A2

    公开(公告)日:2008-08-06

    申请号:EP06821221.6

    申请日:2006-10-27

    IPC分类号: G06F17/30

    摘要: A method of generating a user profile (7;17,19;29) of a user of a device (14,15;23-26) for processing data representative of items of content, a respective recording of at least one perceptible content element being associated with each item of content, includes determining a set (3) containing a plurality of recordings of at least one perceptible content element, each associated with one of a plurality of items of content associated with the user, and generating data representative of a user profile (7;17,19;29) associating preferences of the user with the user, wherein the data representative of the user profile (7;17,19;29) includes one or more parameter values within an at least one-dimensional feature space, each dimension representing a property of at least a section of a recording of at perceptible content element, such that at least one of the dimensions in the feature space represents a quantifiable property of at least a section of a perceptible content element. At least one set of parameter values, each set containing at least one parameter value and quantifying a dimension of the feature space in the user profile (7;17,19;29), is obtained by applying a pre-determined analysis algorithm to each of a plurality of signals, each signal representing at least a section of a recording of at least one perceptible content element, such that the set of parameter values is based on a plurality of the recordings in the set of recordings.

    METHOD AND DEVICE FOR CALCULATING A SIMILARITY METRIC BETWEEN A FIRST FEATURE VECTOR AND A SECOND FEATURE VECTOR
    5.
    发明公开
    METHOD AND DEVICE FOR CALCULATING A SIMILARITY METRIC BETWEEN A FIRST FEATURE VECTOR AND A SECOND FEATURE VECTOR 审中-公开
    方法和设备用于计算相似尺度之间的第一特征矢量和第二特征向量

    公开(公告)号:EP1941400A1

    公开(公告)日:2008-07-09

    申请号:EP06809602.3

    申请日:2006-10-16

    IPC分类号: G06F17/30

    摘要: The method of calculating a similarity metric between a first feature vector of a first audio and/or video signal and a second feature vector of a second audio and/or video signal of the invention comprises the step of calculating a distance between the first feature vector and the second feature vector, both the first feature vector and the second feature vector comprising a feature value of a second dimension, wherein a weighting factor is used that gives a stronger weight to the first dimension than to the second dimension. The electronic device of the invention comprises electronic circuitry, which is operative to perform the method of the invention. The software of the invention makes a programmable device operative to perform the method of the invention.

    GENERATION OF MULTI-CHANNEL AUDIO SIGNALS
    6.
    发明公开
    GENERATION OF MULTI-CHANNEL AUDIO SIGNALS 有权
    生产了多声道音频信号

    公开(公告)号:EP1905006A1

    公开(公告)日:2008-04-02

    申请号:EP06780055.7

    申请日:2006-07-12

    IPC分类号: G10L19/00 H04S3/00

    CPC分类号: H04S3/008 G10L19/008

    摘要: A decoder (115) generates a multi channel audio signal, such as a surround sound signal, from a received first signal. The multi-channel signal comprises a second set of audio channels and the first signal comprises a first set of audio channels. The decoder (115) comprises a receiver (401) which receives the first signal. The receiver (401) is coupled to an estimate processor (405) which generates estimated parametric data for the second set of audio channels in response to characteristics of the first set of audio channels. The estimated parametric data relates characteristics of the second set of audio channels to characteristics of the first set of audio channels. The decoder (115) furthermore comprises a spatial audio decoder (403) which decodes the first signal in response to the estimated parametric data to generate the multi-channel signal comprising the second set of channels. The invention allows use of spatial audio decoding with signals that are not encoded by a spatial audio encoder.

    MULTI-CHANNEL ENCODER
    7.
    发明公开
    MULTI-CHANNEL ENCODER 有权
    多信道编码器

    公开(公告)号:EP1735774A2

    公开(公告)日:2006-12-27

    申请号:EP05718568.8

    申请日:2005-03-25

    IPC分类号: G10L19/00

    CPC分类号: G10L19/008

    摘要: There is described a multi-channel encoder (10) for processing input signals (300, 310, 320, 330, 340) conveyed in N input channels to generate corresponding output signals (480, 490) conveyed in M output channels together with complementary parametric data (370, 430, 450); M and N are integers wherein N>M. The encoder (10) includes a down-mixer for down-mixing the input signals (30, 310, 320, 330, 340) to generate the corresponding output signals (480, 490), the encoder also comprising an analyser for processing the input signals (300, 310, 320, 330, 340) to generate the parameter data (370, 430, 450), said parametric data describing mutual differences between the N channels of input signal to allow for regenerating during decoding one or more of the N channels of input signals from the M channels of output signal. Such an encoder (10) is capable of providing highly efficient data encoding and also of being backwards compatibility with relatively simpler decoders having fewer than N decoding output channels. The invention also concerns decoders compatible with such a multi-channel.

    AUDIO CODING
    8.
    发明授权
    AUDIO CODING 有权
    声音编码

    公开(公告)号:EP1595247B1

    公开(公告)日:2006-09-13

    申请号:EP04709311.7

    申请日:2004-02-09

    IPC分类号: G10L19/00 H04S5/00

    摘要: Parametric stereo coders use perceptually relevant parameters of the input signal to describe spatial properties. One of these parameters is the phase difference between the input signals (ITD or IPD). This time difference only determines the relative time difference between the input signals, without any information about how these time differences should be divided over the output signals in the decoder. An additional parameter is included in the encoded signal that describes how the ITD or IPD should be distributed between the output channels. To this goal the delay between a computed monaural signal and one of the input signals is used.

    ENCODING AUDIO SIGNALS
    9.
    发明公开
    ENCODING AUDIO SIGNALS 有权
    编码音频信号

    公开(公告)号:EP1671316A1

    公开(公告)日:2006-06-21

    申请号:EP04770014.1

    申请日:2004-09-16

    IPC分类号: G10L19/00

    CPC分类号: G10L19/008 H04S2420/03

    摘要: The encoder transforms the audio signals (x(n),y(n)) from the time domain to audio signal (X(k),Y(k)) in the frequency domain, and determines the cross-correlation function (Ri, Pi) in the frequency domain. A complex coherence value (Qi) is calculated by summing the (complex) cross-correlation function values (Ri, Pi) in the frequency domain. The inter-channel phase difference (IPDi) is estimated by the argument of the complex coherence value (Qi), and the inter-channel coherence (ICi) is estimated by the absolute value of the complex coherence value (Qi). In the prior art a computational intensive Inverse Fast Fourier Transformation and search for the maximum value of the cross-correlation function (Ri; Pi) in the time domain are required.

    摘要翻译: 编码器在频域中将音频信号(x(n),y(n))从时域变换为音频信号(X(k),Y(k)),并且确定互相关函数(Ri, Pi)在频域中。 通过在频域中对(复数)互相关函数值(Ri,Pi)求和来计算复相干值(Qi)。 信道间相位差(IPDi)由复相干值(Qi)的自变量估计,并且通道间相干性(ICi)由复相干值(Qi)的绝对值估计。 在现有技术中,需要计算密集的逆快速傅里叶变换和搜索时域中互相关函数(R 1; P i)的最大值。

    METHOD OF DERIVING A SET OF FEATURES FOR AN AUDIO INPUT SIGNAL
    10.
    发明公开
    METHOD OF DERIVING A SET OF FEATURES FOR AN AUDIO INPUT SIGNAL 有权
    FOR领导方法属性的数量AUDIO INPUT

    公开(公告)号:EP1941486A1

    公开(公告)日:2008-07-09

    申请号:EP06809601.5

    申请日:2006-10-16

    IPC分类号: G10H1/00

    摘要: The invention describes a method of deriving a set of features (S) of an audio input signal (M), which method comprises identifying a number of first-order features (f1, f2, ... , ff) of the audio input signal (M), generating a number of correlation values (ρ1 , ρ2, ... , ρI) from at least part of the first-order features (f1, f2, ... , ff), and compiling the set of features (S) for the audio input signal (M) using the correlation values (ρ1, ρ2, ..., ρI). The invention further describes a method of classifying an audio input signal (M) into a group, and a method of comparing audio input signals (M, M') to determine a degree of similarity between the audio input signals (M, M'). The invention also describes a system (1) for deriving a set of features (S) of an audio input signal (M), a classifying system (4) for classifying an audio input signal (M) into a group, and a comparison system (5) for comparing audio input signals (M, M') to determine a degree of similarity between the audio input signals (M, M').