-
公开(公告)号:EP1815462A1
公开(公告)日:2007-08-08
申请号:EP05798851.1
申请日:2005-11-03
发明人: DEN BRINKER, Albertus, C. , RIERA PALOU, Felipe , OOMEN, Arnoldus, W., J. , RAULT, Jean-Bernard, H., M. , VIRETTE, David, S. T. , PHILIPPE, Pierrick, J.-L. M.
IPC分类号: G10L19/08
CPC分类号: G10L19/16 , G10L19/0204 , G10L19/08
摘要: An audio encoding device (100) comprises first encoding means (101, 111) for encoding transient signal components and/or sinusoidal signal components of an audio signal (x(n)) and producing a residual signal (z(n)), and second encoding means for encoding the residual signal. The second encoding means comprise filter means (122) for selecting at least two frequency bands of the residual signal. The selected frequency bands (LF, HF) of the residual signal (z(n)) are encoded by a first encoding unit (123) and a second encoding unit (124) respectively. The first encoding unit (123) may comprise a waveform encoder, such as a time-domain encoder, while the second encoding unit (124) may comprise a noise encoder.
摘要翻译: 音频编码设备(100)包括用于编码音频信号(x(n))的瞬态信号分量和/或正弦信号分量并产生残余信号(z(n))的第一编码装置(101,111),以及 第二编码装置,用于编码残差信号。 第二编码装置包括用于选择剩余信号的至少两个频带的滤波器装置(122)。 第一编码单元(123)和第二编码单元(124)分别对残差信号(z(n))的选定频带(LF,HF)进行编码。 第一编码单元(123)可以包括诸如时域编码器的波形编码器,而第二编码单元(124)可以包括噪声编码器。
-
2.
公开(公告)号:EP1761915B1
公开(公告)日:2008-12-03
申请号:EP05746699.7
申请日:2005-06-14
IPC分类号: G10L19/00
CPC分类号: G10L19/008 , G10L19/04 , G10L25/12 , G10L25/27
摘要: An encoder (100) for encoding a multi-channel audio signal comprises a prediction processor (101) which generates two residual signals for two signal components of the multi-channel signal by linear prediction which is associated with psycho-acoustic prediction filters. A rotation processor (105) rotates the combined signal of the two residual signals to generate a main signal and a side signal. Preferably, the energy of the main signal is maximized and the energy of the side signal is minimized. An encoding processor (109) encodes the main and preferably the side signal and an output processor (111) generates an output signal comprising the encoded main data and preferably the side data, prediction parameters and rotation parameters. The combination of linear prediction, use of psycho-acoustic characteristics and the general encoder (100) for encoding a multi-channel signal comprises a prediction processor (101) which generates two residual signals for two signal components of the multi-channel signal by linear prediction which is associated with psycho-acoustic characteristics and which specifically uses psycho-acoustic prediction filters. A rotation processor (105) rotates the combined signal of the two residual signals to generate a main signal and a side signal. Preferably, the energy of the main signal is maximized and the energy of the side signal is minimized. An encoding processor (109) encodes the main and preferably the side signal and an output processor (111) generates an output signal comprising the encoded main data and preferably the side data, prediction parameters and rotation parameters. The combination of linear prediction, use of psycho-acoustic characteristics and the generation of a main and side signal improves encoding and enhances the flexibility of the encoder for different data rates.
-
公开(公告)号:EP1851752A1
公开(公告)日:2007-11-07
申请号:EP06710801.9
申请日:2006-02-01
发明人: SZCZERBA, Marek , DEN BRINKER, Albertus, C. , GERRITS, Andreas, J. , OOMEN, Arnoldus, W., J. , KLEIN MIDDELINK, Marc
IPC分类号: G10H7/00
CPC分类号: G10H7/00 , G10H1/22 , G10H2230/041 , G10H2250/495
摘要: A device (1) is arranged for synthesizing sound represented by sets of parameters, each set comprising noise parameters (NP) representing noise components of the sound and optionally also other parameters representing other components, such as transients and sinusoids. Each set of parameters may correspond with a sound channel, such as a MIDI voice. In order to reduce the computational load, the device comprises a selection unit (2) for selecting a limited number of sets from the total number of sets on the basis of a perceptual relevance value, such as the amplitude or energy. The device further comprises a synthesizing unit (3) for synthesizing the noise components using the noise parameters of the selected sets only.
-
公开(公告)号:EP1735777A1
公开(公告)日:2006-12-27
申请号:EP05718571.2
申请日:2005-03-25
CPC分类号: G10L19/02 , G10L19/008 , G10L19/0204 , H04S3/008
摘要: There is described a method of encoding input signals (CHI to CH3; 400 to 450) in a multi-channel encoder (5; 15) to generate corresponding output data comprising down-mix output signals (610, 620) together with complementary parametric data (600). The method includes a first step of down-mixing input signals (CHI to CH3; 400 to 450) to generate the corresponding down-mix output signals (610, 620), and a second step of processing the input signals (CHI to CH3; 400 to 450) during down-mixing to generate said parametric data (600) complementary to the down-mix output signals (610, 620). Processing of the input signals (CHI to CH3; 400 to 450) involves including information in the down-mix signals (610, 620) which is useable during subsequent decoding of the down-mix output signals (610, 620) and the parametric data (600) to determine at least some parameter data and thereby enabling representations of the input signals (CHI to CH3; 400 to 450) to be subsequently regenerated. Coders for use in the encoder (5; 15) for performing essential signal processing operations therein are also elucidated.
摘要翻译: 描述了在多通道编码器(5; 15)中对输入信号(CH1至CH3; 400至450)进行编码的方法,以生成相应的输出数据,包括下混合输出信号(610,620)以及互补参数数据 (600)。 该方法包括将输入信号(CH1至GH3; 400至450)下变频以产生相应的下混合输出信号(610,620)的第一步骤,以及处理输入信号(CH1至CH3; 400至450),以产生与下混合输出信号(610,620)互补的所述参数数据(600)。 输入信号(CH1至CH3; 400至450)的处理涉及将下变频信号(610,620)中的信息包括在下混合输出信号(610,620)的随后解码期间可用的信息和参数数据 (600)以确定至少一些参数数据,从而使得能够随后再生所述输入信号(CH1至CH3; 400至450)的表示。 还阐明了用于在编码器(5; 15)中进行基本信号处理操作的编码器。
-
公开(公告)号:EP1692688A1
公开(公告)日:2006-08-23
申请号:EP04799235.9
申请日:2004-11-24
CPC分类号: G10L19/093 , G10L19/10 , G10L19/24
摘要: An audio coder is arranged to process a respective set of sampled signal values for each of a plurality of sequential segments of an audio signal (x). The coder comprises an analyser (TSA) arranged to analyse the sampled signal values to provide one or more sinusoidal codes (Cs) corresponding to respective sinusoidal components of the audio signal. A subtractor subtracts a signal corresponding to the sinusoidal components from the audio signal to provide a first residual signal (r1). A modeller (SEG) models the frequency spectrum of the first residual signal (r1) by determining first filter parameters (Ps) of a filter which has a frequency response approximating a frequency spectrum of the first residual signal. Another subtractor subtracts a signal corresponding to the first filter parameters from the first residual signal to provide a second residual signal (r2). Another modeller (RPE) models a component (r2,r3) of the second residual signal with a pulse train coder (RPE) to provide respective pulse train parameters (L0). A bit stream generator (15) generates an encoded audio stream (AS) including the sinusoidal codes (Cs), the first filter parameters (Ps) and the pulse train parameters (L0).
摘要翻译: 音频编码器被设置为处理音频信号(x)的多个连续段中的每一个的相应的一组采样信号值。 编码器包括分析器(TSA),其被设置为分析采样的信号值以提供与音频信号的相应正弦分量相对应的一个或多个正弦码(Cs)。 减法器从音频信号中减去对应于正弦分量的信号以提供第一残差信号(r1)。 建模器(SEG)通过确定具有近似于第一残差信号的频谱的频率响应的滤波器的第一滤波器参数(Ps)来对第一残差信号(r1)的频谱进行建模。 另一减法器从第一残余信号中减去对应于第一滤波器参数的信号以提供第二残余信号(r2)。 另一个建模器(RPE)用脉冲编码器(RPE)建模第二个剩余信号的分量(r2,r3),以提供相应的脉冲序列参数(L0)。 比特流生成器(15)生成包括正弦码(Cs),第一滤波器参数(Ps)和脉冲串参数(L0)的编码音频流(AS)。
-
公开(公告)号:EP1514262B1
公开(公告)日:2006-08-16
申请号:EP03722975.4
申请日:2003-05-16
IPC分类号: G10L19/04
摘要: A method of encoding ( 14 ) an audio signal (x(n)) is disclosed. The method comprises the step of modelling ( 16 ) the audio signal in accordance with a frequency sensitizing parameter (( ) to provide a set of infinite impulse response (IIR) filter type characteristics (( 0 . . . k-1) of an order K and capable of being linearly combined with the sensitizing parameter (( ) to provide an estimate ( ) for the audio signal (x(n)), the IIR type filter model satisfying the requirements of a minimum phase filter. The set of characteristics (( 0 . . . k-1) of order K are transformed as a function of the sensitizing parameter (( ) to provide a set of characteristics (c 0 . . . k) of order K+1 compatible with finite impulse response (FIR) filter type characteristics satisfying the requirements of a minimum phase filter. The set of characteristics (c 0 . . . k) of order K+1 are normalised to provide a set of characteristics (d 1 . . . k) of order K. An encoded audio stream ( 50 ) is generated to include representations (LAR,LSFs) of the normalised set of characteristics (d 1 . . . k) of order K.
-
公开(公告)号:EP1568012A1
公开(公告)日:2005-08-31
申请号:EP03758591.6
申请日:2003-11-06
IPC分类号: G10L19/08
CPC分类号: G10L19/093
摘要: Coding of an audio signal represented by a respective set of sampled signal values for each of a plurality of sequential segments is disclosed. The sampled signal values are analysed (40) to determine one or more sinusoidal components for each of the plurality of sequential segments. The sinusoidal components are linked (42) across a plurality of sequential segments to provide sinusoidal tracks. For each sinusoidal track, a phase comprising a generally monotonically changing value is determined and an encoded audio stream including sinusoidal codes (r) representing said phase is generated (46).
-
公开(公告)号:EP1522063A1
公开(公告)日:2005-04-13
申请号:EP03735915.5
申请日:2003-06-18
IPC分类号: G10L19/02
CPC分类号: G10L19/093
摘要: Coding (1) an audio signal (x) comprises providing a respective set of sampled signal values for each of a plurality of sequential segments. The sampled signal values are analysed (130) to generate one or more sinusoidal components for each of the plurality of sequential segments. The sinusoidal components are linked across a plurality of sequential segments. Sinusoidal codes (CS) comprise tracks of linked sinusoidal components for each of the plurality of sequential segments. Each track comprises a frequency and amplitude for a sinusoidal component in a starting segment of a track whereas selected tracks include an indicator that no phase is included for said starting segment.
摘要翻译: 编码(1)音频信号(x)包括为多个连续段中的每一个提供相应的一组采样信号值。 对采样的信号值进行分析(130)以针对多个连续段中的每一个生成一个或多个正弦分量。 正弦分量跨多个连续的段连接。 正弦码(CS)包括多个连续段中的每一个的链接正弦分量的轨迹。 每个轨道包括用于轨道的起始段中的正弦分量的频率和幅度,而所选择的轨道包括所述起始段不包括相位的指示符。
-
公开(公告)号:EP1125282A1
公开(公告)日:2001-08-22
申请号:EP00960515.5
申请日:2000-08-24
IPC分类号: G10L19/02
CPC分类号: G10L19/0208
摘要: In a sinusoidal audio encoder it is known to use different time scales for analyzing different parts of the frequency spectrum. In prior art encoders sub-band filtering is used to split the input signal into a number of sub bands. By splitting the input signal into sub-bands, it can happen that a signal component at the boundary of two sub-bands results in a representation in both sub-band signals. This double representation of signal components can lead to several problems when coding these components. According to the present invention it is proposed to use preventing means (46, 48, 58, 68; 88, 92, 96) to avoid signal components to have multiple representations.
-
公开(公告)号:EP2030199B1
公开(公告)日:2009-10-28
申请号:EP07735902.4
申请日:2007-05-15
IPC分类号: G10L19/06
CPC分类号: G10L19/06 , G10L21/0264
摘要: An apparatus for linear predictive coding of an audio signal comprises a segmentation processor (201) which generates signal segments for the audio signal. An autocorrelation processor (401) for generates a first autocorrelation sequence for each signal segment and a modification processor (403) generates a second autocorrelation sequence for each signal segment by modifying the first autocorrelation sequence in response to at least one psychoacoustic characteristic. A prediction coefficient processor (405) determines linear predictive coding coefficients for each signal segment in response to the second autocorrelation sequence. The invention allows a low complexity linear encoding which takes into account psychoacoustic considerations thereby allowing an improved perceived coding quality for a given data rate.
-
-
-
-
-
-
-
-
-