Method and unit for performing dynamic range control

    公开(公告)号:US12191834B2

    公开(公告)日:2025-01-07

    申请号:US17921279

    申请日:2021-05-17

    Abstract: A dynamic range control unit (210) configured to apply dynamic range control, referred to as DRC, to an audio signal (211). The DRC unit (210) is configured to downsample a subband signal (212) derived from the audio signal (211), to provide a downsampled subband signal (321), to determine a DRC gain (329) based on the downsampled subband signal (321), and to apply the DRC gain (329) to the subband signal (212), to provide a compressed subband signal (213) of a compressed audio signal (214).

    Using metadata to aggregate signal processing operations

    公开(公告)号:US11545166B2

    公开(公告)日:2023-01-03

    申请号:US16917931

    申请日:2020-07-01

    Abstract: A technique including receiving and decoding a coded bitstream encoded with audio content including first audio objects corresponding to a first media content type of two consecutive media content types and second audio objects corresponding to a second media content type of the two consecutive media content types, and audio metadata corresponding to the audio content. The audio metadata including first and second audio object gains, for the first and second audio objects, generated in part based on a first fading curve of the first media content type and a second fading curve of the second media content type, respectively. The technique further includes applying the first and second audio object gains to the first and second audio objects, and rendering a sound field represented by the first audio object with the applied first audio object gain and the second audio object with the applied second audio object gain.

    Methods and systems for efficient recovery of high frequency audio content

    公开(公告)号:US09984695B2

    公开(公告)日:2018-05-29

    申请号:US15494195

    申请日:2017-04-21

    Abstract: The present document relates to the technical field of audio coding, decoding and processing. It specifically relates to methods of recovering high frequency content of an audio signal from low frequency content of the same audio signal in an efficient manner. A method for determining a first banded tonality value for a first frequency subband of an audio signal is described. The first banded tonality value is used for approximating a high frequency component of the audio signal based on a low frequency component of the audio signal. The method comprises determining a set of transform coefficients in a corresponding set of frequency bins based on a block of samples of the audio signal; determining a set of bin tonality values for the set of frequency bins using the set of transform coefficients, respectively; and combining a first subset of two or more of the set of bin tonality values for two or more corresponding adjacent frequency bins of the set of frequency bins lying within the first frequency subband, thereby yielding the first banded tonality value for the first frequency subband.

    Audio encoder and decoder for interleaved waveform coding
    5.
    发明授权
    Audio encoder and decoder for interleaved waveform coding 有权
    用于交织波形编码的音频编码器和解码器

    公开(公告)号:US09514761B2

    公开(公告)日:2016-12-06

    申请号:US14781891

    申请日:2014-04-04

    Abstract: There is provided methods and apparatuses for decoding and encoding of audio signals. In particular, a method for decoding includes receiving a waveform-coded signal having a spectral content corresponding to a subset of the frequency range above a cross-over frequency. The waveform-coded signal is interleaved with a parametric high frequency reconstruction of the audio signal above the cross-over frequency. In this way an improved reconstruction of the high frequency bands of the audio signal is achieved.

    Abstract translation: 提供了用于对音频信号进行解码和编码的方法和装置。 特别地,一种解码方法包括:接收波形编码信号,该波形编码信号具有对应于高于交叉频率的频率范围子集的频谱内容。 波形编码信号与高于交叉频率的音频信号的参数高频重构进行交织。 以这种方式,实现了音频信号的高频带的改进的重建。

    Hybrid encoding of higher frequency and downmixed low frequency content of multichannel audio
    6.
    发明授权
    Hybrid encoding of higher frequency and downmixed low frequency content of multichannel audio 有权
    混合编码高频和低混频多频道内容的多声道音频

    公开(公告)号:US08804971B1

    公开(公告)日:2014-08-12

    申请号:US14010826

    申请日:2013-08-27

    CPC classification number: G10L19/008 G10L19/02

    Abstract: A method for encoding a multichannel audio input signal, including steps of generating a downmix of low frequency components of a subset of channels of the input signal, waveform coding each channel of the downmix, thereby generating waveform coded, downmixed data, performing parametric encoding on at least some higher frequency components of each channel of the input signal, thereby generating parametrically coded data, and generating an encoded audio signal (e.g., an E-AC-3 encoded signal) indicative of the waveform coded, downmixed data and the parametrically coded data. Other aspects are methods for decoding such an encoded signal, and systems configured to perform any embodiment of the inventive method.

    Abstract translation: 一种用于编码多声道音频输入信号的方法,包括以下步骤:产生输入信号的通道子集的低频分量的下混合,对下混频的每个通道进行波形编码,从而产生波形编码的下混合数据,对 输入信号的每个通道的至少一些较高频率分量,由此产生参数编码数据,并产生指示波形编码的下混合数据和参数编码的编码音频信号(例如,E-AC-3编码信号) 数据。 其他方面是用于对这样的编码信号进行解码的方法,以及被配置为执行本发明方法的任何实施例的系统。

    Integrated reconstruction and rendering of audio signals

    公开(公告)号:US11264040B2

    公开(公告)日:2022-03-01

    申请号:US17114192

    申请日:2020-12-07

    Abstract: A method for rendering an audio output based on an audio data stream including M audio signals, side information including a series of reconstruction instances of a reconstruction matrix C and first timing data, the side information allowing reconstruction of N audio objects from the M audio signals, and object metadata defining spatial relationships between the N audio objects. The method includes generating a synchronized rendering matrix based on the object metadata, the first timing data, and information relating to a current playback system configuration, the synchronized rendering matrix having a rendering instance for each reconstruction instance, multiplying each reconstruction instance with a corresponding rendering instance to form a corresponding instance of an integrated rendering matrix, and applying the integrated rendering matrix to the audio signals in order to render an audio output.

    Integrated reconstruction and rendering of audio signals

    公开(公告)号:US10891962B2

    公开(公告)日:2021-01-12

    申请号:US16486493

    申请日:2018-03-06

    Abstract: A method for rendering an audio output based on an audio data stream including M audio signals, side information including a series of reconstruction instances of a reconstruction matrix C and first timing data, the side information allowing reconstruction of N audio objects from the M audio signals, and object metadata defining spatial relationships between the N audio objects. The method includes generating a synchronized rendering matrix based on the object metadata, the first timing data, and information relating to a current playback system configuration, the synchronized rendering matrix having a rendering instance for each reconstruction instance, multiplying each reconstruction instance with a corresponding rendering instance to form a corresponding instance of an integrated rendering matrix, and applying the integrated rendering matrix to the audio signals in order to render an audio output.

    USING METADATA TO AGGREGATE SIGNAL PROCESSING OPERATIONS

    公开(公告)号:US20210005211A1

    公开(公告)日:2021-01-07

    申请号:US16917931

    申请日:2020-07-01

    Abstract: A technique including receiving and decoding a coded bitstream encoded with audio content including first audio objects corresponding to a first media content type of two consecutive media content types and second audio objects corresponding to a second media content type of the two consecutive media content types, and audio metadata corresponding to the audio content. The audio metadata including first and second audio object gains, for the first and second audio objects, generated in part based on a first fading curve of the first media content type and a second fading curve of the second media content type, respectively. The technique further includes applying the first and second audio object gains to the first and second audio objects, and rendering a sound field represented by the first audio object with the applied first audio object gain and the second audio object with the applied second audio object gain.

    Audio decoder and decoding method using efficient downmixing
    10.
    发明授权
    Audio decoder and decoding method using efficient downmixing 有权
    音频解码器和解码方法采用高效降混

    公开(公告)号:US09311921B2

    公开(公告)日:2016-04-12

    申请号:US14517800

    申请日:2014-10-18

    CPC classification number: G10L19/008 G10L19/02 G10L19/022 H04S3/008

    Abstract: A method, an apparatus, a computer readable storage medium configured with instructions for carrying out a method, and logic encoded in one or more computer-readable tangible medium to carry out actions. The method is to decode audio data that includes N.n channels to M.m decoded audio channels, including unpacking metadata and unpacking and decoding frequency domain exponent and mantissa data; determining transform coefficients from the unpacked and decoded frequency domain exponent and mantissa data; inverse transforming the frequency domain data; and in the case M

    Abstract translation: 配置有用于执行方法的指令的方法,装置,计算机可读存储介质,以及编码在一个或多个计算机可读有形介质中以执行动作的逻辑。 该方法是将包括N.n个信道的音频数据解码为M.m个解码的音频信道,包括解包元数据和解码和解码频域指数和尾数数据; 从解压缩和解码的频域指数和尾数数据确定变换系数; 逆变换频域数据; 在M

Patent Agency Ranking