Integration of high frequency reconstruction techniques with reduced post-processing delay

    公开(公告)号:US12243543B2

    公开(公告)日:2025-03-04

    申请号:US18417902

    申请日:2024-01-19

    Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.

    Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element

    公开(公告)号:US11842743B2

    公开(公告)日:2023-12-12

    申请号:US17831234

    申请日:2022-06-02

    CPC classification number: G10L19/167 G10L19/035 G10L19/24 G10L21/038

    Abstract: Embodiments relate to audio processing unit(s) and methods for decoding an encoded audio bitstream, that includes a fill element with an identifier indicating a start of the fill element and fill data which includes a flag identifying whether to perform a base form of spectral band replication or an enhanced form of spectral band replication, wherein the base form of spectral band replication includes spectral patching, the enhanced form of spectral band replication includes harmonic transposition, one value of the flag indicates that said enhanced form of spectral band replication should be performed on the audio content, and another indicates that said base form of spectral band replication but not said harmonic transposition should be performed on the audio content, wherein the fill data further includes a parameter indicating whether pre-flattening is to be performed after spectral patching for avoiding spectral discontinuities.

    Integration of high frequency reconstruction techniques with reduced post-processing delay

    公开(公告)号:US11830509B2

    公开(公告)日:2023-11-28

    申请号:US18178405

    申请日:2023-03-03

    CPC classification number: G10L19/18 G10L21/038

    Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.

    Integration of high frequency reconstruction techniques with reduced post-processing delay

    公开(公告)号:US11562759B2

    公开(公告)日:2023-01-24

    申请号:US17050664

    申请日:2019-04-25

    Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.

    DECODING AUDIO BITSTREAMS WITH ENHANCED SPECTRAL BAND REPLICATION METADATA IN AT LEAST ONE FILL ELEMENT

    公开(公告)号:US20220293116A1

    公开(公告)日:2022-09-15

    申请号:US17831234

    申请日:2022-06-02

    Abstract: Embodiments relate to an audio processing unit that includes a bitstream payload deformatter and a decoding subsystem. The decoding subsystem is coupled to the bitstream payload deformatter and configured to decode at least a portion of a block of an encoded audio bitstream. The block includes a fill element with an identifier indicating a start of the fill element and fill data after the identifier. The fill data includes at least one flag identifying whether a base form of spectral band replication or an enhanced form of spectral band replication is to be performed on audio content of the block. The identifier is a three bit unsigned integer transmitted most significant bit first and having a value of 0x6.

    Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element

    公开(公告)号:US10553232B2

    公开(公告)日:2020-02-04

    申请号:US16040243

    申请日:2018-07-19

    Abstract: Embodiments relate to an audio processing unit that includes a bitstream payload deformatter and a decoding subsystem. The decoding subsystem is coupled to the bitstream payload deformatter and configured to decode at least a portion of a block of an encoded audio bitstream. The block includes a fill element with an identifier indicating a start of the fill element and fill data after the identifier. The fill data includes at least one flag identifying whether a base form of spectral band replication or an enhanced form of spectral band replication is to be performed on audio content of the block. The identifier is a three bit unsigned integer transmitted most significant bit first and having a value of 0x6.

Patent Agency Ranking