Audio encoding method and system for generating a unified bitstream decodable by decoders implementing different decoding protocols
    1.
    发明授权
    Audio encoding method and system for generating a unified bitstream decodable by decoders implementing different decoding protocols 有权
    音频编码方法和系统,用于通过实现不同解码协议的解码器生成统一的比特流解码

    公开(公告)号:US09378743B2

    公开(公告)日:2016-06-28

    申请号:US14009503

    申请日:2012-04-05

    CPC分类号: G10L19/002 G10L19/167

    摘要: In a class of embodiments, an audio encoding system (typically, a perceptual encoding system that is configured to generate a single (“unified”) bitstream that is compatible with (i.e., decodable by) a first decoder configured to decode audio data encoded in accordance with a first encoding protocol (e.g., the multichannel Dolby Digital Plus, or DD+, protocol) and a second decoder configured to decode audio data encoded in accordance with a second encoding protocol (e.g., the stereo AAC, HE AAC v1, or HE AAC v2 protocol). The unified bitstream can include both encoded data (e.g., bursts of data) decodable by the first decoder (and ignored by the second decoder) and encoded data (e.g., other bursts of data) decodable by the second decoder (and ignored by the first decoder). In effect, the second encoding format is hidden within the unified bitstream when the bitstream is decoded by the first decoder, and the first encoding format is hidden within the unified bitstream when the bitstream is decoded by the second decoder. The format of the unified bitstream generated in accordance with the invention may eliminate the need for transcoding elements throughout an entire media chain and/or ecosystem. Other aspects of the invention are an encoding method performed by any embodiment of the inventive encoder, a decoding method performed by any embodiment of the inventive decoder, and a computer readable medium (e.g., disc) which stores code for implementing any embodiment of the inventive method.

    摘要翻译: 在一类实施例中,音频编码系统(通常是感知编码系统,其被配置为生成与第一解码器兼容的(即可解码的)单个(“统一”)比特流,第一解码器被配置为对 根据第一编码协议(例如,多频道杜比数字+或DD +协议)和被配置为对根据第二编码协议(例如立体声AAC,HE AAC v1或HE)编码的音频数据进行解码的第二解码器 统一比特流可以包括可由第一解码器解码(并由第二解码器忽略)的可编码数据(例如,数据突发)和由第二解码器解码的编码数据(例如,其他数据突发) 并且被第一解码器忽略),实际上,当第一解码器对比特流进行解码时,第二编码格式被隐藏在统一比特流内,并且当比特流中第一编码格式被隐藏在统一比特流内时 令牌由第二解码器解码。 根据本发明生成的统一比特流的格式可以消除在整个媒体链和/或生态系统中对代码转换元素的需要。 本发明的其他方面是由本发明编码器的任何实施例执行的编码方法,由本发明解码器的任何实施例执行的解码方法,以及存储用于实现本发明的任何实施例的代码的计算机可读介质(例如,盘) 方法。

    AUDIO ENCODING METHOD AND SYSTEM FOR GENERATING A UNIFIED BITSTREAM DECODABLE BY DECODERS IMPLEMENTING DIFFERENT DECODING PROTOCOLS
    2.
    发明申请
    AUDIO ENCODING METHOD AND SYSTEM FOR GENERATING A UNIFIED BITSTREAM DECODABLE BY DECODERS IMPLEMENTING DIFFERENT DECODING PROTOCOLS 有权
    音视频编码方法和系统,用于生成由解码器实现的不同解码协议解码的统一的双绞线

    公开(公告)号:US20140358554A1

    公开(公告)日:2014-12-04

    申请号:US14009503

    申请日:2012-04-05

    IPC分类号: G10L19/002

    CPC分类号: G10L19/002 G10L19/167

    摘要: In a class of embodiments, an audio encoding system (typically, a perceptual encoding system that is configured to generate a single (“unified”) bitstream that is compatible with (i.e., decodable by) a first decoder configured to decode audio data encoded in accordance with a first encoding protocol (e.g., the multichannel Dolby Digital Plus, or DD+, protocol) and a second decoder configured to decode audio data encoded in accordance with a second encoding protocol (e.g., the stereo AAC, HE AAC v1, or HE AAC v2 protocol). The unified bitstream can include both encoded data (e.g., bursts of data) decodable by the first decoder (and ignored by the second decoder) and encoded data (e.g., other bursts of data) decodable by the second decoder (and ignored by the first decoder). In effect, the second encoding format is hidden within the unified bitstream when the bitstream is decoded by the first decoder, and the first encoding format is hidden within the unified bitstream when the bitstream is decoded by the second decoder. The format of the unified bitstream generated in accordance with the invention may eliminate the need for transcoding elements throughout an entire media chain and/or ecosystem. Other aspects of the invention are an encoding method performed by any embodiment of the inventive encoder, a decoding method performed by any embodiment of the inventive decoder, and a computer readable medium (e.g., disc) which stores code for implementing any embodiment of the inventive method.

    摘要翻译: 在一类实施例中,音频编码系统(通常是感知编码系统,其被配置为生成与第一解码器兼容的(即可解码的)单个(“统一”)比特流,第一解码器被配置为对 根据第一编码协议(例如,多频道杜比数字+或DD +协议)和被配置为对根据第二编码协议(例如立体声AAC,HE AAC v1或HE)编码的音频数据进行解码的第二解码器 统一比特流可以包括可由第一解码器解码(并由第二解码器忽略)的可编码数据(例如,数据突发)和由第二解码器解码的编码数据(例如,其他数据突发) 并且被第一解码器忽略),实际上,当第一解码器对比特流进行解码时,第二编码格式被隐藏在统一比特流内,并且当比特流中第一编码格式被隐藏在统一比特流内时 令牌由第二解码器解码。 根据本发明生成的统一比特流的格式可以消除在整个媒体链和/或生态系统中对代码转换元素的需要。 本发明的其他方面是由本发明编码器的任何实施例执行的编码方法,由本发明解码器的任何实施例执行的解码方法,以及存储用于实现本发明的任何实施例的代码的计算机可读介质(例如,盘) 方法。

    Method and System for Encoding Audio Data with Adaptive Low Frequency Compensation
    3.
    发明申请
    Method and System for Encoding Audio Data with Adaptive Low Frequency Compensation 有权
    用自适应低频补偿编码音频数据的方法和系统

    公开(公告)号:US20130179175A1

    公开(公告)日:2013-07-11

    申请号:US13588890

    申请日:2012-08-17

    IPC分类号: G10L19/00

    摘要: A method for determining mantissa bit allocation of frequency domain audio data to be encoded, including by performing adaptive low frequency compensation on each frequency band of a set of low frequency bands of the data. The low frequency compensation includes steps of: performing tonality detection on the audio data to generate compensation control data indicative of whether each frequency band in the set has prominent tonal content; and performing low frequency compensation on each frequency band in the set having prominent tonal content, including by correcting a preliminary masking value for each frequency band having prominent tonal content, but not performing low frequency compensation on the audio data in any other frequency band in the set. Other aspects are audio encoding methods including such tonality detection and low frequency compensation steps, and a system configured to perform any embodiment of the inventive method.

    摘要翻译: 一种用于确定要编码的频域音频数据的尾数位分配的方法,包括通过对数据的一组低频带的每个频带执行自适应低频补偿。 低频补偿包括以下步骤:对音频数据执行音调检测,以产生指示该组中的每个频带是否具有突出的音调内容的补偿控制数据; 并且对具有突出音调内容的集合中的每个频带执行低频补偿,包括通过校正具有突出音调内容的每个频带的初步屏蔽值,但不对低频补偿中的任何其他频带中的音频数据执行低频补偿 组。 其他方面是包括这种音调检测和低频补偿步骤的音频编码方法,以及被配置为执行本发明方法的任何实施例的系统。

    Apparatus and method for determining a quantizer step size
    4.
    发明授权
    Apparatus and method for determining a quantizer step size 有权
    用于确定量化器步长的装置和方法

    公开(公告)号:US08756056B2

    公开(公告)日:2014-06-17

    申请号:US12496880

    申请日:2009-07-02

    IPC分类号: G10L19/032 G10L19/002

    CPC分类号: G10L19/032 G10L2019/0005

    摘要: For determining a quantizer step size for quantizing a signal including audio or video information, a first quantizer step size as well as an interference threshold are provided. Then, the actual interference introduced by the first quantizer step size is determined and compared with the interference threshold. Despite the fact that the comparison reveals that the actually introduced interference exceeds the threshold, a second, coarser quantizer step size is nevertheless used, which will then be used for quantization if it turns out that the interference introduced by the coarser, second quantizer step size falls below the threshold or falls below the interference introduced by the first quantizer step size. Thus, the quantization interference is reduced while the quantization is coarsened and, thus, the compression gain is increased.

    摘要翻译: 为了确定用于量化包括音频或视频信息的信号的量化器步长,提供第一量化器步长以及干扰阈值。 然后,确定由第一量化器步长引入的实际干扰并将其与干扰阈值进行比较。 尽管比较显示实际引入的干扰超过阈值,但是仍然使用第二较粗略的量化器步长,然后将其用于量化,如果证明由较粗的第二量化器步长引入的干扰 低于阈值或低于由第一量化器步长引入的干扰。 因此,量化干扰减小,而量化粗大,因此压缩增益增加。

    SYSTEM FOR COMBINING LOUDNESS MEASUREMENTS IN A SINGLE PLAYBACK MODE
    5.
    发明申请
    SYSTEM FOR COMBINING LOUDNESS MEASUREMENTS IN A SINGLE PLAYBACK MODE 有权
    用于在单个回放模式中组合舒适度测量的系统

    公开(公告)号:US20120328115A1

    公开(公告)日:2012-12-27

    申请号:US13581453

    申请日:2011-03-07

    IPC分类号: H03G7/00

    CPC分类号: H03G9/00 H03G9/005 H03G9/14

    摘要: The present document relates to processing of multimedia data, notably the encoding, the transmission, the decoding and the rendering of multimedia data, e.g. audio files or bitstreams. In particular, the present document relates to the implementation of loudness control in multimedia players. A method for providing loudness related data to a media player is described. The method comprises the steps of providing a first loudness related value associated with an audio signal; wherein the first loudness related value has been determined according to a first procedure; of converting the first loudness related value into a second loudness related value using a reversible relation; wherein the second loudness related value is associated with a second procedure for determining loudness related values; of storing the second loudness related value in metadata associated with the audio signal; and of providing the metadata to the media player.

    摘要翻译: 本文件涉及多媒体数据的处理,特别是多媒体数据的编码,传输,解码和呈现,例如, 音频文件或比特流。 特别地,本文件涉及多媒体播放器中的响度控制的实现。 描述了向媒体播放器提供响度相关数据的方法。 该方法包括以下步骤:提供与音频信号相关联的第一响度相关值; 其中所述第一响度相关值已经根据第一过程确定; 使用可逆关系将第一响度相关值转换为第二响度相关值; 其中所述第二响度相关值与用于确定响度相关值的第二过程相关联; 将第二响度相关值存储在与音频信号相关联的元数据中; 并向媒体播放器提供元数据。

    System and Method for Non-destructively Normalizing Loudness of Audio Signals Within Portable Devices
    6.
    发明申请
    System and Method for Non-destructively Normalizing Loudness of Audio Signals Within Portable Devices 有权
    便携式设备内音频信号响度的非破坏性系统和方法

    公开(公告)号:US20120310654A1

    公开(公告)日:2012-12-06

    申请号:US13576386

    申请日:2011-02-03

    IPC分类号: G10L19/00

    摘要: Many portable playback devices cannot decode and playback encoded audio content having wide bandwidth and wide dynamic range with consistent loudness and intelligibility unless the encoded audio content has been prepared specially for these devices. This problem can be overcome by including with the encoded content some metadata that specifies a suitable dynamic range compression profile by either absolute values or differential values relative to another known compression profile. A playback device may also adaptively apply gain and limiting to the playback audio. Implementations in encoders, in transcoders and in decoders are disclosed.

    摘要翻译: 许多便携式播放设备不能解码和播放具有宽带宽和宽动态范围的编码音频内容,具有一致的响度和清晰度,除非编码音频内容已经为这些设备专门准备。 通过使用编码的内容包含一些通过相对于另一已知压缩简档的绝对值或差分值来指定合适的动态范围压缩简档的元数据来克服该问题。 播放装置还可以自适应地对播放音频应用增益和限制。 公开了在编码器,代码转换器和解码器中的实现。

    METHOD AND ENCODER FOR PROCESSING A DIGITAL STEREO AUDIO SIGNAL
    7.
    发明申请
    METHOD AND ENCODER FOR PROCESSING A DIGITAL STEREO AUDIO SIGNAL 有权
    用于处理数字立体声音频信号的方法和编码器

    公开(公告)号:US20140072120A1

    公开(公告)日:2014-03-13

    申请号:US14113362

    申请日:2012-05-07

    IPC分类号: H04S1/00

    摘要: The invention discloses a method and an encoder for processing a digital audio stereo signal. A digital audio encoder for coding such audio signal comprises a predictive Temporal Noise Shaping (TNS) filter, a Mid-/Side (M/S) coding unit, a control unit for determining a first prediction gain related to the unmodified L/R signal processed by the TNS filter and for determining a second prediction gain related to the M/S-coded L/R signal processed by the TNS filter, wherein the control unit is adapted to disable TNS-filtering—i.e. to bypass the TNS filter—for a current signal frame, if the first and second prediction gains differ by more than a pre-determined mismatch range. Preferably, the first and second prediction gains are determined from signal energy ratios calculated for each channel of the stereo signal including the signal energies of both the TNS-processed (unmodified) L- respectively (unmodified) R-signal and the TNS-processed M/S coded L- respectively M/S coded R-signal divided by the respective signal energies before TNS processing. Furthermore, the control unit is preferably adapted to overrule the disabling of the TNS filter, if the input signal is a near-mono audio signal exhibiting only low energy either in its M- or S-band. In that case, operation of the TNS filter on the stereo audio signal is maintained.

    摘要翻译: 本发明公开了一种用于处理数字音频立体声信号的方法和编码器。 用于对这种音频信号进行编码的数字音频编码器包括预测时间噪声整形(TNS)滤波器,中/侧(M / S)编码单元,用于确定与未修改的L / R信号相关的第一预测增益的控制单元 由TNS滤波器处理并确定与由TNS滤波器处理的M / S编码的L / R信号相关的第二预测增益,其中该控制单元用于禁用TNS滤波 如果第一和第二预测增益相差超过预定的不匹配范围,则绕过TNS滤波器以获得当前信号帧。 优选地,第一和第二预测增益是根据对包括TNS处理(未修改)L信号和TNS处理的M信号的两个信号能量的立体声信号的每个信道计算的信号能量比确定的 / S编码的L-分别M / S编码的R信号除以TNS处理之前的各个信号能量。 此外,如果输入信号是在其M波段或S波段中仅表现出低能量的近乎单声道的音频信号,则控制单元优选地适用于推翻TNS滤波器的禁用。 在这种情况下,维持TNS滤波器对立体声音频信号的操作。

    Apparatus and method for encoding an information signal
    8.
    发明授权
    Apparatus and method for encoding an information signal 有权
    用于编码信息信号的装置和方法

    公开(公告)号:US08655652B2

    公开(公告)日:2014-02-18

    申请号:US12446164

    申请日:2007-09-25

    申请人: Michael Schug

    发明人: Michael Schug

    CPC分类号: G10L19/032

    摘要: An apparatus for encoding an information signal having discrete values includes a quantizer having a quantizer border, wherein the quantizer is adapted so that a discrete value above the quantization border is quantized to a quantization index, which is different from a quantization index obtained by quantizing a discrete value below the quantization border, a controller for modifying the quantization border, wherein the quantizer having a first quantization border setting is adapted to generate a first set of quantization indices for the discrete values, and wherein the quantizer having a second modified quantization border setting is adapted to generate a second set of quantization indices, and an output interface for outputting an encoded information signal which is either based on the first set of quantization indices or the second set of quantization indices dependent on a decision function.

    摘要翻译: 用于对具有离散值的信息信号进行编码的装置包括具有量化器边界的量化器,其中量化器适于使得高于量化边界的离散值被量化为量化指标,其不同于通过量化 离散值低于量化边界,用于修改量化边界的控制器,其中具有第一量化边界设置的量化器适于生成用于离散值的第一组量化索引,并且其中量化器具有第二修改量化边界设置 适于产生第二组量化索引,以及输出接口,用于输出基于第一组量化索引的编码信息信号或者取决于决策函数的第二组量化索引。

    AUDIO ENCODER AND DECODER
    9.
    发明申请
    AUDIO ENCODER AND DECODER 有权
    音频编码器和解码器

    公开(公告)号:US20100286991A1

    公开(公告)日:2010-11-11

    申请号:US12811421

    申请日:2008-12-30

    IPC分类号: G10L21/00

    摘要: The present invention teaches a new audio coding system that can code both general audio and speech signals well at low bit rates. A proposed audio coding system comprises linear prediction unit for filtering an input signal based on an adaptive filter; a transformation unit for transforming a frame of the filtered input signal into a transform domain; and a quantization unit for quantizing the transform domain signal. The quantization unit decides, based on input signal characteristics, to encode the transform domain signal with a model-based quantizer or a non-model-based quantizer. Preferably, the decision is based on the frame size applied by the transformation unit.

    摘要翻译: 本发明教导了一种新的音频编码系统,其可以以低比特率良好地对一般音频和语音信号进行编码。 所提出的音频编码系统包括用于基于自适应滤波器对输入信号进行滤波的线性预测单元; 变换单元,用于将经滤波的输入信号的帧变换为变换域; 以及用于量化变换域信号的量化单元。 量化单元基于输入信号特性来决定用基于模型的量化器或非基于模型的量化器对变换域信号进行编码。 优选地,该决定基于由变换单元应用的帧大小。

    APPARATUS AND METHOD FOR PROCESSING A MULTI-CHANNEL SIGNAL
    10.
    发明申请
    APPARATUS AND METHOD FOR PROCESSING A MULTI-CHANNEL SIGNAL 有权
    用于处理多通道信号的装置和方法

    公开(公告)号:US20070033056A1

    公开(公告)日:2007-02-08

    申请号:US11464315

    申请日:2006-08-14

    IPC分类号: G10L21/00 G10L21/04

    CPC分类号: G10L19/03 G10L19/008

    摘要: An apparatus for processing a multi-channel signal includes a means for determining a similarity between a first one of two channels and a second one of the two channels. Furthermore, a means for performing a prediction filtering of the spectral coefficients is provided, which is formed to perform a prediction filtering with only a single prediction filter for both channels in case of high similarity between the first and the second channel, and to perform a prediction filtering with two separate prediction filters in case of a dissimilarity between the first and the second channel. With this, an introduction of stereo artifacts and a deterioration of the coding gain in stereo coding techniques are avoided.

    摘要翻译: 用于处理多信道信号的装置包括用于确定两个信道中的第一个信道和两个信道中的第二信道之间的相似性的装置。 此外,提供了一种用于执行频谱系数的预测滤波的装置,其被形成为在第一和第二信道之间具有高相似性的情况下仅对两个信道执行预测滤波,并且执行 在第一和第二信道之间具有不相似性的情况下,使用两个单独的预测滤波器进行预测滤波。 由此,避免立体声伪影的引入和立体声编码技术中的编码增益的恶化。