MULTI-CHANNEL AUDIO CODING
    21.
    发明申请
    MULTI-CHANNEL AUDIO CODING 有权
    多声道音频编码

    公开(公告)号:US20100153097A1

    公开(公告)日:2010-06-17

    申请号:US11909730

    申请日:2006-03-16

    IPC分类号: G10L19/00

    摘要: A multi-channel audio encoder (10) for encoding a multi-channel audio signal (101), e.g. a 5.1 channel audio signal, into a spatial down-mix (102), e.g. a stereo signal, and associated parameters (104, 105). The encoder (10) comprises first and second units (110, 120). The first unit (110) encodes the multi-channel audio signal (101) into the spatial down-mix (102) and parameters (104). These parameters (104) enable a multi-channel decoder (20) to reconstruct the multi-channel audio signal (203) from the spatial down-mix (102). The second unit (120) generates, from the spatial down-mix (102), parameters (105) that enable the decoder to reconstruct the spatial down-mix (202) from an alternative down-mix (103), e.g. a so-called artistic down-mix that has been manually mixed in a sound studio. In this way, the decoder (20) can efficiently deal with a situation in which an alternative down-mix (103) is received instead of the regular spatial, down-mix (102). In the decoder (20), first the spatial down-mix (202) is reconstructed from the alternative down-mix (103) and the parameters (105). Next, the spatial down-mix (202) is decoded into the multi-channel audio signal (203).

    摘要翻译: 一种用于对多声道音频信号(101)进行编码的多声道音频编码器(10)。 5.1声道音频信号转换成空间缩混(102)。 立体声信号和相关参数(104,105)。 编码器(10)包括第一和第二单元(110,120)。 第一单元(110)将多声道音频信号(101)编码成空间缩混(102)和参数(104)。 这些参数(104)使得多声道解码器(20)能够从空间缩混(102)重建多声道音频信号(203)。 第二单元(120)从空间缩混(102)生成参数(105),使参数(105)能够使解码器从替代的下混合(103)重构空间缩混(202)。 所谓的艺术混合,已经在一个健全的工作室手工混合。 以这种方式,解码器(20)可以有效地处理接收替代的下混合(103)而不是常规的空间,混合(102)的情况。 在解码器(20)中,首先,从替代的下混合(103)和参数(105)重构空间缩混(202)。 接下来,空间缩混(202)被解码为多声道音频信号(203)。

    Audio coding via creation of sinusoidal tracks and phase determination
    22.
    发明授权
    Audio coding via creation of sinusoidal tracks and phase determination 有权
    通过创建正弦曲线进行音频编码和相位确定

    公开(公告)号:US07664633B2

    公开(公告)日:2010-02-16

    申请号:US10536228

    申请日:2003-11-06

    CPC分类号: G10L19/093

    摘要: Coding of an audio signal represented by a respective set of sampled signal values for each of a plurality of sequential segments is disclosed. The sampled signal values are analyzed (40) to determine one or more sinusoidal components for each of the plurality of sequential segments. The sinusoidal components are linked (42) across a plurality of sequential segments to provide sinusoidal tracks. For each sinusoidal track, a phase comprising a generally monotonically changing value is determined and an encoded audio stream including sinusoidal codes (r) representing said phase is generated (46).

    摘要翻译: 公开了由多个连续段中的每一个的相应的一组采样信号值表示的音频信号的编码。 分析采样信号值(40)以确定多个连续段中的每一个的一个或多个正弦分量。 正弦分量连接(42)跨越多个顺序段以提供正弦曲线。 对于每个正弦曲线,确定包括通常单调变化的值的相位,并且生成包括表示所述相位的正弦码(r)的编码音频流(46)。

    Parametric coding of an audio or speech signal

    公开(公告)号:US07050970B2

    公开(公告)日:2006-05-23

    申请号:US10046472

    申请日:2002-01-14

    IPC分类号: G10L19/04 G10L19/00

    CPC分类号: G10L19/0204

    摘要: An encoder includes a segmentation unit for segmenting an audio or speech signal into at least one segment and a calculation unit for calculating sinusoidal code data in the form of frequency and amplitude data of a given extension from the segment such that the extension approximates the segment for a given criterion. The calculation of the sinusoidal code data θki, dji and eji for the segment x(n) is carried out according to the following extension {circumflex over (x)}: x ⋒ = ∑ i = 1 L ⁢ ⁢ ∑ j = 0 J - 1 ⁢ ⁢ [ d j i ⁢ f j ⁡ ( n ) ⁢ cos ⁡ ( Θ i ⁡ ( n ) ) + e j i ⁢ f j ⁡ ( n ) ⁢ sin ( Θ i ⁡ ( n ) ] . Fig . ⁢ 1.

    Transmission system with improved encoder and decoder that prevents multiple representations of signal components from occurring
    24.
    发明授权
    Transmission system with improved encoder and decoder that prevents multiple representations of signal components from occurring 失效
    具有改进的编码器和解码器的传输系统,防止发生信号分量的多个表示

    公开(公告)号:US06654723B1

    公开(公告)日:2003-11-25

    申请号:US09830377

    申请日:2001-04-25

    IPC分类号: G10L1902

    CPC分类号: G10L19/0208

    摘要: In a sinusoidal audio encoder it is known to use different time scales for analyzing different parts of the frequency spectrum. In prior art encoders sub-band filtering is used to split the input signal into a number of sub bands. By splitting the input signal into sub-bands, it can happen that a signal component at the boundary of two sub-bands results in a representation in both sub-band signals. This double representation of signal components can lead to several problems when coding these components. According to the present invention it is proposed to use preventing means (46, 48, 58, 68; 88, 92, 96) to avoid signal components to have multiple representations.

    摘要翻译: 在正弦音频编码器中,已知使用不同的时标来分析频谱的不同部分。 在现有技术中,编码器使用子带滤波将输入信号分割成多个子带。 通过将输入信号分割为子带,可能发生两个子带边界处的信号分量导致两个子带信号中的表示。 信号分量的这种双重表示可能导致编码这些组件时的几个问题。 根据本发明,提出了使用防止装置(46,48,58,68; 88,92,96)来避免信号分量具有多个表示。

    Audio encoding
    25.
    发明授权
    Audio encoding 失效
    音频编码

    公开(公告)号:US07725310B2

    公开(公告)日:2010-05-25

    申请号:US10575428

    申请日:2004-10-04

    IPC分类号: G10L19/00

    CPC分类号: G10L19/032 G10L19/093

    摘要: Coding of an audio signal (x) represented by a respective set of sampled signal values (x(t)) for each of a plurality of sequential time segments is disclosed. The sampled signal values are analyzed to determine one or more sinusoidal components for each of the plurality of sequential segments. The sinusoidal components are linked across a plurality of sequential segments to provide sinusoidal tracks, where each track comprises a number of frames. An encoded signal (AS) is generated, including sinusoidal codes (Cs) comprising a representation level (r) for each frame or including sinusoidal codes (Cs) where some of these codes comprise a phase (φ), a frequency (ω) and a quantization table (Q) for a given frame when the given frame is designated as a random-access frame. The invention allows random access in a track while avoiding long adaptation of the quantization accuracy in a quantizer and/or the need for a large bit stream while still maintaining improved audio quality.

    摘要翻译: 公开了对于多个连续时间段中的每一个对由相应的一组采样信号值(x(t))表示的音频信号(x)的编码。 分析采样的信号值以确定多个顺序段中的每一个的一个或多个正弦分量。 正弦分量连接在多个顺序段上以提供正弦曲线,其中每个轨道包括多个帧。 生成编码信号(AS),其包括包括用于每个帧的表示级(r)的正弦码(Cs)或包括正弦码(Cs),其中这些码中的一些包括相位(ω),频率(ω) 以及当给定帧被指定为随机接入帧时的给定帧的量化表(Q)。 本发明允许在轨道中随机访问,同时避免量化器中的量化精度的长时间适应性和/或对大比特流的需要,同时仍然保持改善的音频质量。

    PREDICTIVE ENCODING OF A MULTI CHANNEL SIGNAL
    26.
    发明申请
    PREDICTIVE ENCODING OF A MULTI CHANNEL SIGNAL 审中-公开
    多通道信号的预测编码

    公开(公告)号:US20090281798A1

    公开(公告)日:2009-11-12

    申请号:US11915004

    申请日:2006-05-09

    IPC分类号: G10L19/04

    CPC分类号: G10L19/008 G10L19/06

    摘要: A multi channel encoder (100) comprises a multi channel linear predictive analyzer (105) for linear predictive coding of a multi channel signal. A prediction controller (101) comprises a prediction parameter generator (301) which generates linear prediction coding parameter matrices for the multi channel signal which are then mapped to reflection matrices. The reflection matrices may specifically be normalized backward or forward reflection matrices. The reflection matrices are encoded by a reflection parameter encoder (305) and combined with other encoded data in a multiplexer (109) to generate encoded data for the multi channel signal. The reflection parameter encoder (305) may specifically decompose the reflection matrices using an Eigenvalue decomposition or a singular value decomposition and the resulting data may be quantized for transmission. A decoder (200) receives the encoded data and obtains the prediction parameters by performing the inverse operation.

    摘要翻译: 多通道编码器(100)包括用于多通道信号的线性预测编码的多通道线性预测分析器(105)。 预测控制器(101)包括预测参数生成器(301),该预测参数生成器生成多信道信号的线性预测编码参数矩阵,然后映射到反射矩阵。 反射矩阵可以具体地是归一化的反向或正向反射矩阵。 反射矩阵由反射参数编码器(305)编码并与多路复用器(109)中的其它编码数据组合以产生用于多通道信号的编码数据。 反射参数编码器(305)可以使用特征值分解或奇异值分解来具体地分解反射矩阵,并且可以量化所得到的数据以进行传输。 解码器(200)接收编码数据并通过执行逆运算来获得预测参数。

    Quality of decoded audio by adding noise
    27.
    发明授权
    Quality of decoded audio by adding noise 失效
    通过增加噪声对解码音频的质量

    公开(公告)号:US07548852B2

    公开(公告)日:2009-06-16

    申请号:US10562359

    申请日:2004-06-25

    IPC分类号: G10L19/00

    CPC分类号: G10L21/038

    摘要: The present invention relates to a method of encoding and decoding an audio signal. The invention further relates to an arrangement for encoding and decoding an audio signal. The invention further relates to a computer-readable medium comprising a data record indicative of an audio signal and a device for communicating an audio signal having been encoded according to the present invention. By the method of encoding, a double description of the signal is obtained, where the encoding comprises two encoding steps, a first standard encoding and an additional second encoding. The second encoding is able to give a coarse description of the signal, such that a stochastic realization can be made and appropriate parts can be added to the decoded signal from the first decoding. The required description of the second encoder in order to make the realization of a stochastic signal possible requires a relatively low bit rate, while other double/multiple descriptions require a much higher bit rate.

    摘要翻译: 本发明涉及一种对音频信号进行编码和解码的方法。 本发明还涉及用于对音频信号进行编码和解码的装置。 本发明还涉及包括指示音频信号的数据记录的计算机可读介质和用于传送根据本发明已被编码的音频信号的设备。 通过编码的方法,获得对信号的双重描述,其中编码包括两个编码步骤,第一标准编码和附加的第二编码。 第二编码能够给出信号的粗略描述,使得可以进行随机实现,并且可以从第一解码将合适的部分添加到解码信号。 为了实现随机信号,第二编码器的所需描述可能需要相对低的比特率,而其他双重/多重描述需要高得多的比特率。

    Audio Encoding and Decoding
    29.
    发明申请
    Audio Encoding and Decoding 审中-公开
    音频编码和解码

    公开(公告)号:US20080275709A1

    公开(公告)日:2008-11-06

    申请号:US11570539

    申请日:2005-06-15

    IPC分类号: G10L19/00 G06F17/00

    CPC分类号: G10L19/10 G10L19/113

    摘要: A method of encoding a digital audio signal, wherein for each time segment the signal is spectrally flattened to obtain a spectrally flattened signal (r) and possibly spectral flattening parameters (LPP). The spectrally flattened signal is modelled by an excitation signal comprising a first partial excitation signal (px) conforming to an excitation signal generated by an RPE or CELP technique, and a second partial excitation signal (PEp) being a set of extra pulses with arbitrary positions and amplitudes. An audio bit stream as comprising the first and second partial excitation signals is generated. The extra pulses can be added to the excitation signal at positions in time that correspond to the time of occurrence of the spike, or preferably at positions in time of an RPE time grid.

    摘要翻译: 一种对数字音频信号进行编码的方法,其中对于每个时间段,信号被频谱平坦化以获得频谱平坦的信号(r)和可能的频谱平坦化参数(LPP)。 频谱平坦化的信号由包括符合由RPE或CELP技术产生的激励信号的第一部分激励信号(p SUB>)和第二部分激励信号(P p 产生包括第一和第二部分激励信号的音频比特流。 额外的脉冲可以在对应于尖峰发生时间的时间点,或优选地在RPE时间网格的时间位置处被加到激励信号上。