Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field
    1.
    发明授权
    Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field 有权
    用于对2或3维声场的核心表示的连续帧进行编码和解码的方法和装置

    公开(公告)号:US09397771B2

    公开(公告)日:2016-07-19

    申请号:US13333461

    申请日:2011-12-21

    IPC分类号: H04R5/00 H04H20/89 G10L19/008

    CPC分类号: H04H20/89 G10L19/008

    摘要: Representations of spatial audio scenes using higher-order Ambisonics HOA technology typically require a large number of coefficients per time instant. This data rate is too high for most practical applications that require real-time transmission of audio signals. According to the invention, the compression is carried out in spatial domain instead of HOA domain. The (N+1)2 input HOA coefficients are transformed into (N+1)2 equivalent signals in spatial domain, and the resulting (N+1)2 time-domain signals are input to a bank of parallel perceptual codecs. At decoder side, the individual spatial-domain signals are decoded, and the spatial-domain coefficients are transformed back into HOA domain in order to recover the original HOA representation.

    摘要翻译: 使用高阶Ambisonics HOA技术的空间音频场景的表示通常每个时刻需要大量的系数。 对于需要实时传输音频信号的大多数实际应用,此数据速率太高。 根据本发明,压缩在空间域中而不是HOA域进行。 (N + 1)2个输入HOA系数在空间域中变换为(N + 1)2个等效信号,并将得到的(N + 1)2个时域信号输入到一组并行感知编解码器。 在解码器侧,对各个空间域信号进行解码,并将空间域系数变换回到HOA域,以恢复原始的HOA表示。

    METHOD AND APPARATUS FOR ENCODING AND DECODING SUCCESSIVE FRAMES OF AN AMBISONICS REPRESENTATION OF A 2- OR 3-DIMENSIONAL SOUND FIELD
    2.
    发明申请
    METHOD AND APPARATUS FOR ENCODING AND DECODING SUCCESSIVE FRAMES OF AN AMBISONICS REPRESENTATION OF A 2- OR 3-DIMENSIONAL SOUND FIELD 有权
    用于编码和解码二维或三维声场的健康代表的后续框架的方法和装置

    公开(公告)号:US20120155653A1

    公开(公告)日:2012-06-21

    申请号:US13333461

    申请日:2011-12-21

    IPC分类号: H04R5/00

    CPC分类号: H04H20/89 G10L19/008

    摘要: Representations of spatial audio scenes using higher-order Ambisonics HOA technology typically require a large number of coefficients per time instant. This data rate is too high for most practical applications that require real-time transmission of audio signals. According to the invention, the compression is carried out in spatial domain instead of HOA domain. The (N+1)2 input HOA coefficients are transformed into (N+1)2 equivalent signals in spatial domain, and the resulting (N+1)2 time-domain signals are input to a bank of parallel perceptual codecs. At decoder side, the individual spatial-domain signals are decoded, and the spatial-domain coefficients are transformed back into HOA domain in order to recover the original HOA representation.

    摘要翻译: 使用高阶Ambisonics HOA技术的空间音频场景的表示通常每个时刻需要大量的系数。 对于需要实时传输音频信号的大多数实际应用,此数据速率太高。 根据本发明,压缩在空间域中而不是HOA域进行。 (N + 1)2个输入HOA系数在空间域中变换为(N + 1)2个等效信号,并将得到的(N + 1)2个时域信号输入到一组并行感知编解码器。 在解码器侧,对各个空间域信号进行解码,并将空间域系数变换回到HOA域,以恢复原始的HOA表示。

    DATA STRUCTURE FOR HIGHER ORDER AMBISONICS AUDIO DATA
    4.
    发明申请
    DATA STRUCTURE FOR HIGHER ORDER AMBISONICS AUDIO DATA 有权
    数据结构用于更高级别的AMBISONICS音频数据

    公开(公告)号:US20130216070A1

    公开(公告)日:2013-08-22

    申请号:US13883094

    申请日:2011-10-26

    IPC分类号: H04R5/02

    摘要: The invention is related to a data structure for Higher Order Ambisonics HOA audio data, which data structure includes 2D or 3D spatial audio content data for one or more different HOA audio data stream descriptions. The HOA audio data can have on order of greater than ‘3’, and the data structure in addition can include single audio signal source data and/or microphone array audio data from fixed or time-varying spatial positions.

    摘要翻译: 本发明涉及用于高阶组合HOA音频数据的数据结构,该数据结构包括用于一个或多个不同HOA音频数据流描述的2D或3D空间音频内容数据。 HOA音频数据可以具有大于“3”的顺序,并且数据结构另外可以包括来自固定或时变空间位置的单个音频信号源数据和/或麦克风阵列音频数据。

    Method and Apparatus for Lossless Encoding of a Source Signal Using a Lossy Encoded Data Stream and a Lossless Extension Data Stream
    5.
    发明申请
    Method and Apparatus for Lossless Encoding of a Source Signal Using a Lossy Encoded Data Stream and a Lossless Extension Data Stream 有权
    使用有损编码数据流和无损扩展数据流的源信号的无损编码的方法和装置

    公开(公告)号:US20090164226A1

    公开(公告)日:2009-06-25

    申请号:US12226992

    申请日:2007-04-18

    IPC分类号: G10L19/04 G10L19/00

    CPC分类号: G10L19/24 G10L19/0017

    摘要: In lossy based lossless coding a PCM audio signal passes through a lossy encoder to a lossy decoder. The lossy encoder provides a lossy bit stream. The difference signal between the PCM signal and the lossy decoder output is lossless encoded, providing an extension bit stream. The invention facilitates enhancing a lossy perceptual audio encoding/decoding by an extension that enables mathematically exact reproduction of the original waveform using enhanced de-correlation, and provides additional data for reconstructing at decoder site an intermediate-quality audio signal. The lossless extension can be used to extend the widely used mp3 encoding/decoding to lossless encoding/decoding and superior quality mp3 encoding/de-coding.

    摘要翻译: 在基于有损耗的无损编码中,PCM音频信号通过有损编码器到有损解码器。 有损编码器提供有损比特流。 PCM信号和有损解码器输出之间的差分信号是无损编码的,提供扩展位流。 本发明有助于通过扩展来增强有损感知音频编码/解码,其使得能够使用增强的去相关来数学地精确地再现原始波形,并且提供用于在解码器位置重建中等质量音频信号的附加数据。 无损扩展可用于将广泛使用的mp3编码/解码扩展到无损编码/解码以及优质的mp3编码/解码。

    Method and apparatus for lossless encoding of a source signal using a lossy encoded data stream and a lossless extension data stream
    6.
    发明授权
    Method and apparatus for lossless encoding of a source signal using a lossy encoded data stream and a lossless extension data stream 有权
    使用有损编码数据流和无损扩展数据流对源信号进行无损编码的方法和装置

    公开(公告)号:US08428941B2

    公开(公告)日:2013-04-23

    申请号:US12226992

    申请日:2007-04-18

    IPC分类号: G10L19/00 G10L19/12 G10L19/02

    CPC分类号: G10L19/24 G10L19/0017

    摘要: In lossy based lossless coding a PCM audio signal passes through a lossy encoder to a lossy decoder. The lossy encoder provides a lossy bit stream. The difference signal between the PCM signal and the lossy decoder output is lossless encoded, providing an extension bit stream. The invention facilitates enhancing a lossy perceptual audio encoding/decoding by an extension that enables mathematically exact reproduction of the original waveform using enhanced de-correlation, and provides additional data for reconstructing at decoder site an intermediate-quality audio signal. The lossless extension can be used to extend the widely used mp3 encoding/decoding to lossless encoding/decoding and superior quality mp3 encoding/de-coding.

    摘要翻译: 在基于有损耗的无损编码中,PCM音频信号通过有损编码器到有损解码器。 有损编码器提供有损比特流。 PCM信号和有损解码器输出之间的差分信号是无损编码的,提供扩展位流。 本发明有助于通过扩展来增强有损感知音频编码/解码,其使得能够使用增强的去相关来数学地精确地再现原始波形,并且提供用于在解码器位置重建中等质量音频信号的附加数据。 无损扩展可用于将广泛使用的mp3编码/解码扩展到无损编码/解码以及优质的mp3编码/解码。

    Method and apparatus for lossless encoding of a source signal, using a lossy encoded data steam and a lossless extension data stream
    7.
    发明授权
    Method and apparatus for lossless encoding of a source signal, using a lossy encoded data steam and a lossless extension data stream 失效
    使用有损编码数据蒸汽和无损扩展数据流对信号源进行无损编码的方法和装置

    公开(公告)号:US08326618B2

    公开(公告)日:2012-12-04

    申请号:US12227045

    申请日:2007-04-18

    IPC分类号: G10L21/00

    摘要: In lossy based lossless coding a PCM audio signal passes through a lossy encoder to a lossy decoder. The lossy encoder provides a lossy bit stream. The lossy decoder also provides side information that is used to control the coefficients of a prediction filter that de-correlates the difference signal between the PCM signal and the lossy decoder output. The de-correlated difference signal is lossless encoded, providing an extension bit stream. Instead of, or in addition to, de-correlating in the time domain, a de-correlation in the frequency domain using spectral whitening can be performed. The lossy encoded bit stream together with the lossless encoded extension bit stream form a lossless encoded bitstream. The invention facilitates enhancing a lossy perceptual audio encoding/decoding by an extension that enables mathematically exact reproduction of the original waveform, and provides additional data for reconstructing at decoder site an intermediate-quality audio signal. The lossless extension can be used to extend the widely used mp3 encoding/decoding to lossless encoding/decoding and superior quality mp3 encoding/decoding.

    摘要翻译: 在基于有损耗的无损编码中,PCM音频信号通过有损编码器到有损解码器。 有损编码器提供有损比特流。 有损解码器还提供用于控制将PCM信号和有损解码器输出之间的差信号去相关的预测滤波器的系数的侧信息。 去相关差分信号是无损编码的,提供扩展比特流。 代替或者除了在时域中去相关之外,可以执行使用频谱白化的频域中的去相关。 有损编码比特流与无损编码扩展比特流一起形成无损编码比特流。 本发明有助于通过能够在数字上准确再现原始波形的扩展来增强有损感知音频编码/解码,并提供用于在解码器位置重建中等质量音频信号的附加数据。 无损扩展可用于将广泛使用的mp3编码/解码扩展到无损编码/解码以及优质的mp3编码/解码。

    AUDIO BITSTREAM DATA STRUCTURE ARRANGEMENT OF A LOSSY ENCODED SIGNAL TOGETHER WITH LOSSLESS ENCODED EXTENSION DATA FOR SAID SIGNAL
    8.
    发明申请
    AUDIO BITSTREAM DATA STRUCTURE ARRANGEMENT OF A LOSSY ENCODED SIGNAL TOGETHER WITH LOSSLESS ENCODED EXTENSION DATA FOR SAID SIGNAL 有权
    音频比特数据结构编码信号丢失编码信号与无噪声编码扩展数据的信号

    公开(公告)号:US20090240506A1

    公开(公告)日:2009-09-24

    申请号:US12309370

    申请日:2007-07-05

    IPC分类号: G10L21/00

    CPC分类号: G10L19/167

    摘要: Lossless compression algorithms can only exploit redundancies of the original audio signal to reduce the data rate, but not irrelevancies as identified by psycho-acoustics. Lossless audio coding schemes apply a filter or transform for decorrelation and then encode the transformed signal. The encoded bit stream comprises the parameters of the transform or filter, and the lossless representation of the transformed signal. However, in case of lossy based lossless coding the additional amount of information exceeds the amount of data for the base layer by a multiple of the base layer data amount. Therefore the additional data cannot be packed completely into the base layer data stream e.g. as ancillary data. The at least two data streams resulting from the combination of lossy coding format with a lossless coding extension are the base layer containing the lossy coding information and the enhancement data stream for rebuilding the mathematically lossless original input signal. Furthermore several intermediate quality layers are possible. However, these data streams are not independent from each other Every higher layer depends on the lower layers and can only be reasonably decoded in combination with these lower layers. According to the invention, a special combination of one-time header information with repeated header information in a block structure is used, which kind of combination depends on the type of application. Assignment information data identify the different parts or layers of the lossless format belonging to one input signal. Synchronisation data are used to combine the different data streams or parts or layers to a single lossless or intermediate output signal. These features are used in a file format and in a streaming format.

    摘要翻译: 无损压缩算法只能利用原始音频信号的冗余度来降低数据速率,而不是由心理声学识别的无关紧要。 无损音频编码方案应用滤波器或变换进行去相关,然后对经变换的信号进行编码。 编码比特流包括变换或滤波器的参数以及变换信号的无损表示。 然而,在基于有损耗的无损编码的情况下,附加信息量超过基层的数据量乘以基本层数据量的倍数。 因此,附加数据不能完全包装到基本层数据流中。 作为辅助数据。 由有损编码格式与无损编码扩展的组合产生的至少两个数据流是包含用于重建数学无损原始输入信号的有损编码信息和增强数据流的基本层。 此外,几个中等质量的层是可能的。 然而,这些数据流彼此不是独立的每一个更高层依赖于较低的层,并且只能与这些较低的层组合地合理解码。 根据本发明,使用具有块结构中的重复标题信息的一次头信息的特殊组合,哪种组合取决于应用的类型。 分配信息数据标识属于一个输入信号的无损格式的不同部分或多个层。 同步数据用于将不同的数据流或部分或多个层组合成单个无损或中间的输出信号。 这些功能以文件格式和流格式使用。

    Audio data structure for lossy and lossless encoded extension data
    9.
    发明授权
    Audio data structure for lossy and lossless encoded extension data 有权
    用于有损和无损编码扩展数据的音频数据结构

    公开(公告)号:US08326639B2

    公开(公告)日:2012-12-04

    申请号:US12309370

    申请日:2007-07-05

    IPC分类号: G10L19/02 H04B1/66

    CPC分类号: G10L19/167

    摘要: Lossless audio coding performs decorrelation and encodes the transformed signal. The encoded bit stream comprises de-correlation parameters and the lossless representation data of the transformed signal. However, in the case of lossy based lossless coding, the additional amount of information exceeds the base layer amount of data. Therefore the additional data cannot be packed completely into the base layer e.g. as ancillary data. The data streams resulting from the combination of lossy coding format with a lossless coding extension are the base layer containing the lossy coding information and the enhancement data stream for rebuilding the mathematically lossless original input signal. Every higher layer depends on the lower layers and can only be reasonably decoded in combination with these lower layers. According to the invention, a special combination of one-time header information with repeated header information in a block structure is used. Assignment information data identify the different layers.

    摘要翻译: 无损音频编码执行解相关并对变换的信号进行编码。 编码比特流包括去相关参数和变换信号的无损表示数据。 然而,在基于有损耗的无损编码的情况下,附加信息量超过基层数据量。 因此,附加数据不能完全包装到基本层中。 作为辅助数据。 由有损编码格式与无损编码扩展的组合产生的数据流是包含用于重建数学无损原始输入信号的有损编码信息和增强数据流的基本层。 每个更高的层取决于较低的层,并且只能与这些较低层组合地合理解码。 根据本发明,使用具有块结构中的重复标题信息的一次头信息的特殊组合。 分配信息数据标识不同的层。

    Method and Apparatus for Lossless Encoding of a Source Signal, Using a Lossy Encoded Data Steam and a Lossless Extension Data Stream
    10.
    发明申请
    Method and Apparatus for Lossless Encoding of a Source Signal, Using a Lossy Encoded Data Steam and a Lossless Extension Data Stream 失效
    用于信源的无损编码的方法和装置,使用有损编码数据蒸汽和无损扩展数据流

    公开(公告)号:US20090177478A1

    公开(公告)日:2009-07-09

    申请号:US12227045

    申请日:2007-04-18

    IPC分类号: G10L19/14 G10L19/00

    摘要: In lossy based lossless coding a PCM audio signal passes through a lossy encoder to a lossy decoder. The lossy encoder provides a lossy bit stream. The lossy decoder also provides side information that is used to control the coefficients of a prediction filter that de-correlates the difference signal between the PCM signal and the lossy decoder output. The de-correlated difference signal is lossless encoded, providing an extension bit stream. Instead of, or in addition to, de-correlating in the time domain, a de-correlation in the frequency domain using spectral whitening can be performed. The lossy encoded bit stream together with the lossless encoded extension bit stream form a lossless encoded bitstream. The invention facilitates enhancing a lossy perceptual audio encoding/decoding by an extension that enables mathematically exact reproduction of the original waveform, and provides additional data for reconstructing at decoder site an intermediate-quality audio signal. The lossless extension can be used to extend the widely used mp3 encoding/decoding to lossless encoding/decoding and superior quality mp3 encoding/decoding.

    摘要翻译: 在基于有损耗的无损编码中,PCM音频信号通过有损编码器到有损解码器。 有损编码器提供有损比特流。 有损解码器还提供用于控制将PCM信号和有损解码器输出之间的差信号去相关的预测滤波器的系数的侧信息。 去相关差分信号是无损编码的,提供扩展比特流。 代替或者除了在时域中去相关之外,可以执行使用频谱白化的频域中的去相关。 有损编码比特流与无损编码扩展比特流一起形成无损编码比特流。 本发明有助于通过能够在数字上准确再现原始波形的扩展来增强有损感知音频编码/解码,并提供用于在解码器位置重建中等质量音频信号的附加数据。 无损扩展可用于将广泛使用的mp3编码/解码扩展到无损编码/解码以及优质的mp3编码/解码。