EFFICIENT COMBINED HARMONIC TRANSPOSITION
    83.
    发明公开

    公开(公告)号:US20230245637A1

    公开(公告)日:2023-08-03

    申请号:US18194414

    申请日:2023-03-31

    Abstract: The present document relates to audio coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), and to digital effect processors, e.g. so-called exciters, where generation of harmonic distortion adds brightness to the processed signal. In particular, a system configured to generate a high frequency component of a signal from a low frequency component of the signal is described. The system may comprise an analysis filter bank (501) configured to provide a set of analysis subband signals from the low frequency component of the signal; wherein the set of analysis subband signals comprises at least two analysis subband signals; wherein the analysis filter bank (501) has a frequency resolution of Δf. The system further comprises a nonlinear processing unit (502) configured to determine a set of synthesis subband signals from the set of analysis subband signals using a transposition order P; wherein the set of synthesis subband signals comprises a portion of the set of analysis subband signals phase shifted by an amount derived from the transposition order P; and a synthesis filter bank (504) configured to generate the high frequency component of the signal from the set of synthesis subband signals; wherein the synthesis filter bank (504) has a frequency resolution of FΔf; with F being a resolution factor, with F≥1; wherein the transposition order P is different from the resolution factor F.

    SUBBAND BLOCK BASED HARMONIC TRANSPOSITION
    84.
    发明公开

    公开(公告)号:US20230238017A1

    公开(公告)日:2023-07-27

    申请号:US18192982

    申请日:2023-03-30

    Inventor: Lars VILLEMOES

    Abstract: The present document relates to audio source coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), as well as to digital effect processors, e.g. exciters, where generation of harmonic distortion add brightness to the processed signal, and to time stretchers where a signal duration is prolonged with maintained spectral content. A system and method configured to generate a time stretched and/or frequency transposed signal from an input signal is described. The system comprises an analysis filterbank configured to provide an analysis subband signal from the input signal; wherein the analysis subband signal comprises a plurality of complex valued analysis samples, each having a phase and a magnitude. Furthermore, the system comprises a subband processing unit configured to determine a synthesis subband signal from the analysis subband signal using a subband transposition factor Q and a subband stretch factor S. The subband processing unit performs a block based nonlinear processing wherein the magnitude of samples of the synthesis subband signal are determined from the magnitude of corresponding samples of the analysis subband signal and a predetermined sample of the analysis subband signal. In addition, the system comprises a synthesis filterbank configured to generate the time stretched and/or frequency transposed signal from the synthesis subband signal.

    METHOD AND APPARATUS FOR DETERMINING PARAMETERS OF A GENERATIVE NEURAL NETWORK

    公开(公告)号:US20230229892A1

    公开(公告)日:2023-07-20

    申请号:US17927929

    申请日:2021-05-31

    CPC classification number: G06N3/0455 G10L19/26 G10L25/30 G10L25/69 G06N3/082

    Abstract: Described herein is a method of determining parameters for a generative neural network for processing an audio signal, wherein the generative neural network includes an encoder stage mapping to a coded feature space and a decoder stage, each stage including a plurality of convolutional layers with one or more weight coefficients, the method comprising a plurality of cycles with sequential processes of: pruning the weight coefficients of either or both stages based on pruning control information, the pruning control information determining the number of weight coefficients that are pruned for respective convolutional layers; training the pruned generative neural network based on a set of training data; determining a loss for the trained and pruned generative neural network based on a loss function; and determining updated pruning control information based on the determined loss and a target loss. Further described are corresponding apparatus, programs, and computer-readable storage media.

    METHOD AND UNIT FOR PERFORMING DYNAMIC RANGE CONTROL

    公开(公告)号:US20230198488A1

    公开(公告)日:2023-06-22

    申请号:US17921279

    申请日:2021-05-17

    CPC classification number: H03G9/025 H03G7/007

    Abstract: The present document describes a dynamic range control unit (210) configured to apply dynamic range control, referred to as DRC, to an audio signal (211). The DRC unit (210) is configured to downsample a subband signal (212) derived from the audio signal (211), to provide a downsampled subband signal (321), to determine a DRC gain (329) based on the downsampled subband signal (321), and to apply the DRC gain (329) to the subband signal (212), to provide a compressed subband signal (213) of a compressed audio signal (214).

    INTEGRATION OF HIGH FREQUENCY AUDIO RECONSTRUCTION TECHNIQUES

    公开(公告)号:US20230197103A1

    公开(公告)日:2023-06-22

    申请号:US18113406

    申请日:2023-02-23

    CPC classification number: G10L21/0388 G10L19/02 G10L19/008 H04S3/008

    Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.

    Backward-compatible integration of harmonic transposer for high frequency reconstruction of audio signals

    公开(公告)号:US11676616B2

    公开(公告)日:2023-06-13

    申请号:US17963743

    申请日:2022-10-11

    CPC classification number: G10L19/26 G10L19/008 G10L19/24

    Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag.

Patent Agency Ranking