HARMONIC TRANSPOSITION IN AN AUDIO CODING METHOD AND SYSTEM

    公开(公告)号:US20230027660A1

    公开(公告)日:2023-01-26

    申请号:US17954179

    申请日:2022-09-27

    摘要: The present invention relates to transposing signals in time and/or frequency and in particular to coding of audio signals. More particular, the present invention relates to high frequency reconstruction (HFR) methods including a frequency domain harmonic transposer. A method and system for generating a transposed output signal from an input signal using a transposition factor T is described. The system comprises an analysis window of length La, extracting a frame of the input signal, and an analysis transformation unit of order M transforming the samples into M complex coefficients. M is a function of the transposition factor T. The system further comprises a nonlinear processing unit altering the phase of the complex coefficients by using the transposition factor T, a synthesis transformation unit of order M transforming the altered coefficients into M altered samples, and a synthesis window of length Ls, generating a frame of the output signal.

    Harmonic transposition in an audio coding method and system

    公开(公告)号:US11562755B2

    公开(公告)日:2023-01-24

    申请号:US17409592

    申请日:2021-08-23

    摘要: The present invention relates to transposing signals in time and/or frequency and in particular to coding of audio signals. More particular, the present invention relates to high frequency reconstruction (HFR) methods including a frequency domain harmonic transposer. A method and system for generating a transposed output signal from an input signal using a transposition factor T is described. The system comprises an analysis window of length La, extracting a frame of the input signal, and an analysis transformation unit of order M transforming the samples into M complex coefficients. M is a function of the transposition factor T. The system further comprises a nonlinear processing unit altering the phase of the complex coefficients by using the transposition factor T, a synthesis transformation unit of order M transforming the altered coefficients into M altered samples, and a synthesis window of length Ls, generating a frame of the output signal.

    CROSS PRODUCT ENHANCED SUBBAND BLOCK BASED HARMONIC TRANSPOSITION

    公开(公告)号:US20220293113A1

    公开(公告)日:2022-09-15

    申请号:US17829733

    申请日:2022-06-01

    发明人: Lars Villemoes

    摘要: The invention provides an efficient implementation of cross-product enhanced high-frequency reconstruction (HFR), wherein a new component at frequency QΩ+rΩ0 is generated on the basis of existing components at Ω and Ω+Ω0. The invention provides a block-based harmonic transposition, wherein a time block of complex subband samples is processed with a common phase modification. Superposition of several modified samples has the net effect of limiting undesirable intermodulation products, thereby enabling a coarser frequency resolution and/or lower degree of oversampling to be used. In one embodiment, the invention further includes a window function suitable for use with block-based cross-product enhanced HFR. A hardware embodiment of the invention may include an analysis filter bank, a subband processing unit configurable by control data and a synthesis filter bank.

    METHOD AND APPARATUS IN AUDIO PROCESSING

    公开(公告)号:US20220270626A1

    公开(公告)日:2022-08-25

    申请号:US17450015

    申请日:2021-10-05

    IPC分类号: G10L21/003 G10L21/04

    摘要: Aspects of the disclosure provide methods and apparatuses for audio processing. In some examples, an apparatus of audio coding includes processing circuitry. The processing circuitry decodes, from a coded bitstream, information indicative of an adjusted speech signal and a loudness adjustment to the adjusted speech signal. The adjusted speech signal is indicated in an association with multiple speech signals in a scene of an immersive media application. The processing circuitry determines a plurality of loudness adjustments to sound signals including the multiple speech signals in the scene based the plurality of loudness adjustment to the adjusted speech signal, and generates the sound signals in the scene based on the loudness adjustments to the sound signals.

    Data Driven Radio Enhancement
    6.
    发明申请

    公开(公告)号:US20210151069A1

    公开(公告)日:2021-05-20

    申请号:US17158873

    申请日:2021-01-26

    申请人: BabbleLabs LLC

    摘要: Systems and methods are disclosed for data driven radio enhancement. For example, methods may include demodulating a radio signal to obtain a demodulated audio signal; determining a window of audio samples based on the demodulated audio signal; applying an audio enhancement network to the window of audio samples to obtain an enhanced audio segment, in which the audio enhancement network includes a machine learning network that has been trained using demodulated audio signals derived from radio signals; and storing, playing, or transmitting an enhanced audio signal based on the enhanced audio segment.

    Optimized scale factor for frequency band extension in an audio frequency signal decoder

    公开(公告)号:US10943594B2

    公开(公告)日:2021-03-09

    申请号:US16546898

    申请日:2019-08-21

    摘要: A method and device are provided for determining an optimized scale factor to be applied to an excitation signal or a filter during a process for frequency band extension of an audio frequency signal. The band extension process includes decoding or extracting, in a first frequency band, an excitation signal and parameters of the first frequency band including coefficients of a linear prediction filter, generating an excitation signal extending over at least one second frequency band, filtering using a linear prediction filter for the second frequency band. The determination method includes determining an additional linear prediction filter, of a lower order than that of the linear prediction filter of the first frequency band, the coefficients of the additional filter being obtained from the parameters decoded or extracted from the first frequency and calculating the optimized scale factor as a function of at least the coefficients of the additional filter.

    Audio processing method and audio processing device for expanding or compressing audio signals

    公开(公告)号:US10891966B2

    公开(公告)日:2021-01-12

    申请号:US16135818

    申请日:2018-09-19

    发明人: Akira Maezawa

    摘要: An audio processing device includes a feature extraction unit and signal generating unit. The feature extraction unit is configured to extract a feature quantity of a first audio signal for each of a plurality of periods. The signal generating unit is configured to for generate a second audio signal by time axis expanding/compressing either a section of the first audio signal in which the feature quantity is steadily maintained for a period time, or a section of the first audio signal in which a fluctuation of the feature quantity is repeated and excluding from the time axis expanding/compressing a section of the first audio signal in which a fluctuation of the feature quantity is not similar to that of other sections of the first audio signal.

    STEREO UNFOLD WITH PSYCHOACOUSTIC GROUPING PHENOMENON

    公开(公告)号:US20200304929A1

    公开(公告)日:2020-09-24

    申请号:US16605009

    申请日:2018-03-23

    发明人: Bernt BÖHMER

    摘要: The Stereo Unfold Technology solves the inherent problems in the stereo reproduction by utilizing modern DSP technology to extract information from the Left (L) and Right (R) stereo channels to create a number of new channels that feeds into processing algorithms. The Stereo Unfold Technology operates by sending the ordinary stereo information in the customary way towards the listener to establish the perceived location of performers in the sound field with great accuracy and then projects delayed and frequency shaped extracted signals forward as well as in other directions to provide additional psychoacoustically based clues to the ear and brain. The additional clues generate the sensation of increased detail and transparency as well as establishing the three dimensional properties of the sound sources and the acoustic environment in which they are performing. The Stereo Unfold Technology manages to create a real believable three-dimensional soundstage populated with three-dimensional sound sources generating sound in a continuous real sounding acoustic environment.