DATA STRUCTURE FOR HIGHER ORDER AMBISONICS AUDIO DATA
    1.
    发明申请
    DATA STRUCTURE FOR HIGHER ORDER AMBISONICS AUDIO DATA 有权
    数据结构用于更高级别的AMBISONICS音频数据

    公开(公告)号:US20130216070A1

    公开(公告)日:2013-08-22

    申请号:US13883094

    申请日:2011-10-26

    IPC分类号: H04R5/02

    摘要: The invention is related to a data structure for Higher Order Ambisonics HOA audio data, which data structure includes 2D or 3D spatial audio content data for one or more different HOA audio data stream descriptions. The HOA audio data can have on order of greater than ‘3’, and the data structure in addition can include single audio signal source data and/or microphone array audio data from fixed or time-varying spatial positions.

    摘要翻译: 本发明涉及用于高阶组合HOA音频数据的数据结构,该数据结构包括用于一个或多个不同HOA音频数据流描述的2D或3D空间音频内容数据。 HOA音频数据可以具有大于“3”的顺序,并且数据结构另外可以包括来自固定或时变空间位置的单个音频信号源数据和/或麦克风阵列音频数据。

    Method and device for decoding an audio soundfield representation for audio playback
    3.
    发明授权
    Method and device for decoding an audio soundfield representation for audio playback 有权
    用于解码用于音频回放的音频声场表示的方法和设备

    公开(公告)号:US09100768B2

    公开(公告)日:2015-08-04

    申请号:US13634859

    申请日:2011-03-25

    IPC分类号: H04R5/00 H04S3/02 G10L19/008

    摘要: Soundfield signals as such e.g. Ambisonics carry a representation of a desired sound field. The Ambisonics format is based on spherical harmonic decomposition of the soundfield, and Higher Order Ambisonics uses spherical harmonics of at least 2nd order. However, commonly used loudspeaker setups are irregular and lead to problems in decoder design. A method for improved decoding an audio soundfield representation for audio playback comprises calculating a panning function using a geometrical method based on the positions of a plurality of loudspeakers and a plurality of source directions, calculating a mode matrix Ξ from the loudspeaker positions, calculating a pseudo-inverse mode matrix μ+ and decoding the audio soundfield representation. The decoding is based on a decode matrix that is obtained from the panning function and the pseudo-inverse mode matrix Ξ+.

    摘要翻译: Soundfield信号如此。 Ambisonics携带所需声场的表示。 Ambisonics格式基于声场的球谐函数分解,高阶Ambisonics使用至少二阶的球谐函数。 然而,常用的扬声器设置是不规则的,并导致解码器设计中的问题。 用于改进对用于音频回放的音频声场表示进行解码的方法包括使用基于多个扬声器和多个源方向的位置的几何方法来计算平移功能,计算模式矩阵& Xgr; 从扬声器位置,计算伪逆模式矩阵μ+并解码音频声场表示。 解码基于从平移功能和伪逆模式矩阵& Xgr; +获得的解码矩阵。

    METHOD AND DEVICE FOR DECODING AN AUDIO SOUNDFIELD REPRESENTATION FOR AUDIO PLAYBACK
    4.
    发明申请
    METHOD AND DEVICE FOR DECODING AN AUDIO SOUNDFIELD REPRESENTATION FOR AUDIO PLAYBACK 有权
    用于解码用于音频播放的音频声音表示的方法和设备

    公开(公告)号:US20130010971A1

    公开(公告)日:2013-01-10

    申请号:US13634859

    申请日:2011-03-25

    IPC分类号: H04R5/00

    摘要: Soundfield signals such as e.g. Ambisonics carry a representation of a desired sound field. The Ambisonics format is based on spherical harmonic decomposition of the soundfield, and Higher Order Ambisonics uses spherical harmonics of at least 2nd order. However, commonly used loudspeaker setups are irregular and lead to problems in decoder design, A method for improved decoding an audio soundfield representation for audio playback comprises calculating a panning function using a geometrical method based on the positions of a plurality of loudspeakers and a plurality of source directions, calculating a mode matrix from the loudspeaker positions, calculating a pseudo-inverse mode matrix and decoding the audio soundfield representation. The decoding is based on a decode matrix that is obtained from the panning function and the pseudo-inverse mode matrix.

    摘要翻译: Soundfield信号,例如 Ambisonics携带所需声场的表示。 Ambisonics格式基于声场的球谐函数分解,高阶Ambisonics使用至少二阶的球谐函数。 然而,常用的扬声器设置是不规则的并且导致解码器设计中的问题。用于改进对用于音频播放的音频声场表示进行解码的方法包括使用基于多个扬声器和多个扬声器的位置的几何方法来计算平移功能 源方向,从扬声器位置计算模式矩阵,计算伪逆模式矩阵并解码音频声场表示。 解码基于从平移功能和伪逆模式矩阵获得的解码矩阵。

    Method and Apparatus for Lossless Encoding of a Source Signal Using a Lossy Encoded Data Stream and a Lossless Extension Data Stream
    5.
    发明申请
    Method and Apparatus for Lossless Encoding of a Source Signal Using a Lossy Encoded Data Stream and a Lossless Extension Data Stream 有权
    使用有损编码数据流和无损扩展数据流的源信号的无损编码的方法和装置

    公开(公告)号:US20090164226A1

    公开(公告)日:2009-06-25

    申请号:US12226992

    申请日:2007-04-18

    IPC分类号: G10L19/04 G10L19/00

    CPC分类号: G10L19/24 G10L19/0017

    摘要: In lossy based lossless coding a PCM audio signal passes through a lossy encoder to a lossy decoder. The lossy encoder provides a lossy bit stream. The difference signal between the PCM signal and the lossy decoder output is lossless encoded, providing an extension bit stream. The invention facilitates enhancing a lossy perceptual audio encoding/decoding by an extension that enables mathematically exact reproduction of the original waveform using enhanced de-correlation, and provides additional data for reconstructing at decoder site an intermediate-quality audio signal. The lossless extension can be used to extend the widely used mp3 encoding/decoding to lossless encoding/decoding and superior quality mp3 encoding/de-coding.

    摘要翻译: 在基于有损耗的无损编码中,PCM音频信号通过有损编码器到有损解码器。 有损编码器提供有损比特流。 PCM信号和有损解码器输出之间的差分信号是无损编码的,提供扩展位流。 本发明有助于通过扩展来增强有损感知音频编码/解码,其使得能够使用增强的去相关来数学地精确地再现原始波形,并且提供用于在解码器位置重建中等质量音频信号的附加数据。 无损扩展可用于将广泛使用的mp3编码/解码扩展到无损编码/解码以及优质的mp3编码/解码。

    Method and apparatus for lossless encoding of a source signal using a lossy encoded data stream and a lossless extension data stream
    6.
    发明授权
    Method and apparatus for lossless encoding of a source signal using a lossy encoded data stream and a lossless extension data stream 有权
    使用有损编码数据流和无损扩展数据流对源信号进行无损编码的方法和装置

    公开(公告)号:US08428941B2

    公开(公告)日:2013-04-23

    申请号:US12226992

    申请日:2007-04-18

    IPC分类号: G10L19/00 G10L19/12 G10L19/02

    CPC分类号: G10L19/24 G10L19/0017

    摘要: In lossy based lossless coding a PCM audio signal passes through a lossy encoder to a lossy decoder. The lossy encoder provides a lossy bit stream. The difference signal between the PCM signal and the lossy decoder output is lossless encoded, providing an extension bit stream. The invention facilitates enhancing a lossy perceptual audio encoding/decoding by an extension that enables mathematically exact reproduction of the original waveform using enhanced de-correlation, and provides additional data for reconstructing at decoder site an intermediate-quality audio signal. The lossless extension can be used to extend the widely used mp3 encoding/decoding to lossless encoding/decoding and superior quality mp3 encoding/de-coding.

    摘要翻译: 在基于有损耗的无损编码中,PCM音频信号通过有损编码器到有损解码器。 有损编码器提供有损比特流。 PCM信号和有损解码器输出之间的差分信号是无损编码的,提供扩展位流。 本发明有助于通过扩展来增强有损感知音频编码/解码,其使得能够使用增强的去相关来数学地精确地再现原始波形,并且提供用于在解码器位置重建中等质量音频信号的附加数据。 无损扩展可用于将广泛使用的mp3编码/解码扩展到无损编码/解码以及优质的mp3编码/解码。

    Method and apparatus for lossless encoding of a source signal, using a lossy encoded data steam and a lossless extension data stream
    7.
    发明授权
    Method and apparatus for lossless encoding of a source signal, using a lossy encoded data steam and a lossless extension data stream 失效
    使用有损编码数据蒸汽和无损扩展数据流对信号源进行无损编码的方法和装置

    公开(公告)号:US08326618B2

    公开(公告)日:2012-12-04

    申请号:US12227045

    申请日:2007-04-18

    IPC分类号: G10L21/00

    摘要: In lossy based lossless coding a PCM audio signal passes through a lossy encoder to a lossy decoder. The lossy encoder provides a lossy bit stream. The lossy decoder also provides side information that is used to control the coefficients of a prediction filter that de-correlates the difference signal between the PCM signal and the lossy decoder output. The de-correlated difference signal is lossless encoded, providing an extension bit stream. Instead of, or in addition to, de-correlating in the time domain, a de-correlation in the frequency domain using spectral whitening can be performed. The lossy encoded bit stream together with the lossless encoded extension bit stream form a lossless encoded bitstream. The invention facilitates enhancing a lossy perceptual audio encoding/decoding by an extension that enables mathematically exact reproduction of the original waveform, and provides additional data for reconstructing at decoder site an intermediate-quality audio signal. The lossless extension can be used to extend the widely used mp3 encoding/decoding to lossless encoding/decoding and superior quality mp3 encoding/decoding.

    摘要翻译: 在基于有损耗的无损编码中,PCM音频信号通过有损编码器到有损解码器。 有损编码器提供有损比特流。 有损解码器还提供用于控制将PCM信号和有损解码器输出之间的差信号去相关的预测滤波器的系数的侧信息。 去相关差分信号是无损编码的,提供扩展比特流。 代替或者除了在时域中去相关之外,可以执行使用频谱白化的频域中的去相关。 有损编码比特流与无损编码扩展比特流一起形成无损编码比特流。 本发明有助于通过能够在数字上准确再现原始波形的扩展来增强有损感知音频编码/解码,并提供用于在解码器位置重建中等质量音频信号的附加数据。 无损扩展可用于将广泛使用的mp3编码/解码扩展到无损编码/解码以及优质的mp3编码/解码。

    AUDIO BITSTREAM DATA STRUCTURE ARRANGEMENT OF A LOSSY ENCODED SIGNAL TOGETHER WITH LOSSLESS ENCODED EXTENSION DATA FOR SAID SIGNAL
    8.
    发明申请
    AUDIO BITSTREAM DATA STRUCTURE ARRANGEMENT OF A LOSSY ENCODED SIGNAL TOGETHER WITH LOSSLESS ENCODED EXTENSION DATA FOR SAID SIGNAL 有权
    音频比特数据结构编码信号丢失编码信号与无噪声编码扩展数据的信号

    公开(公告)号:US20090240506A1

    公开(公告)日:2009-09-24

    申请号:US12309370

    申请日:2007-07-05

    IPC分类号: G10L21/00

    CPC分类号: G10L19/167

    摘要: Lossless compression algorithms can only exploit redundancies of the original audio signal to reduce the data rate, but not irrelevancies as identified by psycho-acoustics. Lossless audio coding schemes apply a filter or transform for decorrelation and then encode the transformed signal. The encoded bit stream comprises the parameters of the transform or filter, and the lossless representation of the transformed signal. However, in case of lossy based lossless coding the additional amount of information exceeds the amount of data for the base layer by a multiple of the base layer data amount. Therefore the additional data cannot be packed completely into the base layer data stream e.g. as ancillary data. The at least two data streams resulting from the combination of lossy coding format with a lossless coding extension are the base layer containing the lossy coding information and the enhancement data stream for rebuilding the mathematically lossless original input signal. Furthermore several intermediate quality layers are possible. However, these data streams are not independent from each other Every higher layer depends on the lower layers and can only be reasonably decoded in combination with these lower layers. According to the invention, a special combination of one-time header information with repeated header information in a block structure is used, which kind of combination depends on the type of application. Assignment information data identify the different parts or layers of the lossless format belonging to one input signal. Synchronisation data are used to combine the different data streams or parts or layers to a single lossless or intermediate output signal. These features are used in a file format and in a streaming format.

    摘要翻译: 无损压缩算法只能利用原始音频信号的冗余度来降低数据速率,而不是由心理声学识别的无关紧要。 无损音频编码方案应用滤波器或变换进行去相关,然后对经变换的信号进行编码。 编码比特流包括变换或滤波器的参数以及变换信号的无损表示。 然而,在基于有损耗的无损编码的情况下,附加信息量超过基层的数据量乘以基本层数据量的倍数。 因此,附加数据不能完全包装到基本层数据流中。 作为辅助数据。 由有损编码格式与无损编码扩展的组合产生的至少两个数据流是包含用于重建数学无损原始输入信号的有损编码信息和增强数据流的基本层。 此外,几个中等质量的层是可能的。 然而,这些数据流彼此不是独立的每一个更高层依赖于较低的层,并且只能与这些较低的层组合地合理解码。 根据本发明,使用具有块结构中的重复标题信息的一次头信息的特殊组合,哪种组合取决于应用的类型。 分配信息数据标识属于一个输入信号的无损格式的不同部分或多个层。 同步数据用于将不同的数据流或部分或多个层组合成单个无损或中间的输出信号。 这些功能以文件格式和流格式使用。

    Audio data structure for lossy and lossless encoded extension data
    9.
    发明授权
    Audio data structure for lossy and lossless encoded extension data 有权
    用于有损和无损编码扩展数据的音频数据结构

    公开(公告)号:US08326639B2

    公开(公告)日:2012-12-04

    申请号:US12309370

    申请日:2007-07-05

    IPC分类号: G10L19/02 H04B1/66

    CPC分类号: G10L19/167

    摘要: Lossless audio coding performs decorrelation and encodes the transformed signal. The encoded bit stream comprises de-correlation parameters and the lossless representation data of the transformed signal. However, in the case of lossy based lossless coding, the additional amount of information exceeds the base layer amount of data. Therefore the additional data cannot be packed completely into the base layer e.g. as ancillary data. The data streams resulting from the combination of lossy coding format with a lossless coding extension are the base layer containing the lossy coding information and the enhancement data stream for rebuilding the mathematically lossless original input signal. Every higher layer depends on the lower layers and can only be reasonably decoded in combination with these lower layers. According to the invention, a special combination of one-time header information with repeated header information in a block structure is used. Assignment information data identify the different layers.

    摘要翻译: 无损音频编码执行解相关并对变换的信号进行编码。 编码比特流包括去相关参数和变换信号的无损表示数据。 然而,在基于有损耗的无损编码的情况下,附加信息量超过基层数据量。 因此,附加数据不能完全包装到基本层中。 作为辅助数据。 由有损编码格式与无损编码扩展的组合产生的数据流是包含用于重建数学无损原始输入信号的有损编码信息和增强数据流的基本层。 每个更高的层取决于较低的层,并且只能与这些较低层组合地合理解码。 根据本发明,使用具有块结构中的重复标题信息的一次头信息的特殊组合。 分配信息数据标识不同的层。

    Method and Apparatus for Lossless Encoding of a Source Signal, Using a Lossy Encoded Data Steam and a Lossless Extension Data Stream
    10.
    发明申请
    Method and Apparatus for Lossless Encoding of a Source Signal, Using a Lossy Encoded Data Steam and a Lossless Extension Data Stream 失效
    用于信源的无损编码的方法和装置,使用有损编码数据蒸汽和无损扩展数据流

    公开(公告)号:US20090177478A1

    公开(公告)日:2009-07-09

    申请号:US12227045

    申请日:2007-04-18

    IPC分类号: G10L19/14 G10L19/00

    摘要: In lossy based lossless coding a PCM audio signal passes through a lossy encoder to a lossy decoder. The lossy encoder provides a lossy bit stream. The lossy decoder also provides side information that is used to control the coefficients of a prediction filter that de-correlates the difference signal between the PCM signal and the lossy decoder output. The de-correlated difference signal is lossless encoded, providing an extension bit stream. Instead of, or in addition to, de-correlating in the time domain, a de-correlation in the frequency domain using spectral whitening can be performed. The lossy encoded bit stream together with the lossless encoded extension bit stream form a lossless encoded bitstream. The invention facilitates enhancing a lossy perceptual audio encoding/decoding by an extension that enables mathematically exact reproduction of the original waveform, and provides additional data for reconstructing at decoder site an intermediate-quality audio signal. The lossless extension can be used to extend the widely used mp3 encoding/decoding to lossless encoding/decoding and superior quality mp3 encoding/decoding.

    摘要翻译: 在基于有损耗的无损编码中,PCM音频信号通过有损编码器到有损解码器。 有损编码器提供有损比特流。 有损解码器还提供用于控制将PCM信号和有损解码器输出之间的差信号去相关的预测滤波器的系数的侧信息。 去相关差分信号是无损编码的,提供扩展比特流。 代替或者除了在时域中去相关之外,可以执行使用频谱白化的频域中的去相关。 有损编码比特流与无损编码扩展比特流一起形成无损编码比特流。 本发明有助于通过能够在数字上准确再现原始波形的扩展来增强有损感知音频编码/解码,并提供用于在解码器位置重建中等质量音频信号的附加数据。 无损扩展可用于将广泛使用的mp3编码/解码扩展到无损编码/解码以及优质的mp3编码/解码。