-
公开(公告)号:US20240271217A1
公开(公告)日:2024-08-15
申请号:US18637814
申请日:2024-04-17
发明人: Lars VILLEMOES , Per EKSTRAND
IPC分类号: C12Q1/6883 , G10L19/02 , G10L19/022 , G10L19/26 , G10L21/038
CPC分类号: C12Q1/6883 , G10L19/0204 , G10L19/022 , G10L19/265 , G10L21/038 , C12Q2600/118 , C12Q2600/156
摘要: The present invention relates to coding of audio signals, and in particular to high frequency reconstruction methods including a frequency domain harmonic transposer. A system and method for generating a high frequency component of a signal from a low frequency component of the signal is described. The system comprises an analysis filter bank (501) comprising an analysis transformation unit (601) having a frequency resolution of Δf; and an analysis window (611) having a duration of DA; the analysis filter bank (501) being configured to provide a set of analysis subband signals from the low frequency component of the signal; a nonlinear processing unit (502, 650) configured to determine a set of synthesis subband signals based on a portion of the set of analysis subband signals, wherein the portion of the set of analysis subband signals is phase shifted by a transposition order T; and a synthesis filter bank (504) comprising a synthesis transformation unit (602) having a frequency resolution of QΔf; and a synthesis window (612) having a duration of DS; the synthesis filter bank (504) being configured to generate the high frequency component of the signal from the set of synthesis subband signals; wherein Q is a frequency resolution factor with Q≥1 and smaller than the transposition order T; and wherein the value of the product of the frequency resolution Δf and the duration DA of the analysis filter bank is selected based on the frequency resolution factor Q.
-
公开(公告)号:US12057133B2
公开(公告)日:2024-08-06
申请号:US18312853
申请日:2023-05-05
IPC分类号: H03M13/00 , G10L19/022 , G10L19/035 , G10L21/0324 , H03M13/07 , H03M13/15 , H04B17/309 , H04L1/00
CPC分类号: G10L19/035 , G10L19/022 , G10L21/0324 , H03M13/07 , H03M13/1515 , H04B17/309 , H04L1/0009 , H04L1/0032 , H04L1/0042 , H04L1/0045 , H04L1/0046 , H04L1/0084
摘要: A channel encoder for encoding a frame includes a multi-mode redundancy encoder for redundancy encoding the frame in accordance with a certain coding mode from a set of different coding modes, wherein the coding modes are different from each other with respect to an amount of redundancy added to the frame, wherein the multi-mode redundancy encoder is configured to output a coded frame including at least one code word; and a colorator for applying a coloration sequence to the at least one code word; wherein the coloration sequence is such that at least one bit of the code word is changed by the application of the at least one of coloration sequence, wherein the specific coloration sequence is selected in accordance with the certain coding mode.
-
公开(公告)号:US20240249736A1
公开(公告)日:2024-07-25
申请号:US18628632
申请日:2024-04-05
发明人: Markus Multrus , Bernhard Grill , Guillaume Fuchs , Stefan Geyersberger , Nikolaus Rettelbach , Virgilio Bacigalupo
IPC分类号: G10L19/02 , G10L19/022 , H03M7/30
CPC分类号: G10L19/02 , G10L19/022 , H03M7/30
摘要: An audio encoder for encoding segments of coefficients, the segments of coefficients representing different time or frequency resolutions of a sampled audio signal, the audio encoder including a processor for deriving a coding context for a currently encoded coefficient of a current segment based on a previously encoded coefficient of a previous segment, the previously encoded coefficient representing a different time or frequency resolution than the currently encoded coefficient. The audio encoder further includes an entropy encoder for entropy encoding the current coefficient based on the coding context to obtain an encoded audio stream.
-
公开(公告)号:US20240249733A1
公开(公告)日:2024-07-25
申请号:US18628500
申请日:2024-04-05
发明人: Markus Multrus , Bernhard Grill , Guillaume Fuchs , Stefan Geyersberger , Nikolaus Rettelbach , Virgilio Bacigalupo
IPC分类号: G10L19/02 , G10L19/022 , H03M7/30
CPC分类号: G10L19/02 , G10L19/022 , H03M7/30
摘要: An audio encoder for encoding segments of coefficients, the segments of coefficients representing different time or frequency resolutions of a sampled audio signal, the audio encoder including a processor for deriving a coding context for a currently encoded coefficient of a current segment based on a previously encoded coefficient of a previous segment, the previously encoded coefficient representing a different time or frequency resolution than the currently encoded coefficient. The audio encoder further includes an entropy encoder for entropy encoding the current coefficient based on the coding context to obtain an encoded audio stream.
-
公开(公告)号:US20240212698A1
公开(公告)日:2024-06-27
申请号:US18426726
申请日:2024-01-30
申请人: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE , KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION
发明人: Seungkwon BEACK , Tae Jin LEE , Min Je KIM , Kyeongok KANG , Dae Young JANG , Jeongil SEO , Jin Woo HONG , Chieteuk AHN , Ho Chong PARK , Young-cheol PARK
IPC分类号: G10L19/22 , G10L19/022 , G10L19/06 , G10L19/18
CPC分类号: G10L19/22 , G10L19/022 , G10L19/06 , G10L19/18
摘要: A Unified Speech and Audio Codec (USAC) that may process a window sequence based on mode switching is provided. The USAC may perform encoding or decoding by overlapping between frames based on a folding point when mode switching occurs. The USAC may process different window sequences for each situation to perform encoding or decoding, and thereby may improve a coding efficiency.
-
6.
公开(公告)号:US12009002B2
公开(公告)日:2024-06-11
申请号:US17400422
申请日:2021-08-12
IPC分类号: G10L19/00 , G10L19/022 , G10L19/035 , G10L21/0324 , H03M13/07 , H03M13/15 , H04B17/309 , H04L1/00
CPC分类号: G10L19/035 , G10L19/022 , G10L21/0324 , H03M13/07 , H03M13/1515 , H04B17/309 , H04L1/0009 , H04L1/0032 , H04L1/0042 , H04L1/0045 , H04L1/0046 , H04L1/0084
摘要: An audio transmitter processor for generating an error protected frame using encoded audio data of an audio frame, the encoded audio data for the audio frame having a first amount of information units and a second amount of information units, has: a frame builder for building a codeword frame having a codeword raster, wherein the frame builder is configured to determine a border between a first amount of information units and a second amount of information units so that a starting information unit of the second amount of information units coincides with a codeword border; and an error protection coder to obtain a plurality of processed codewords representing the error protected frame.
-
公开(公告)号:USRE49999E1
公开(公告)日:2024-06-04
申请号:US17845607
申请日:2022-06-21
发明人: Markus Schnell , Manfred Lutzky , Markus Lohwasser , Markus Schmidt , Marc Gayer , Michael Mellar , Bernd Edler , Markus Multrus , Gerald Schuller , Ralf Geiger , Bernhard Grill
IPC分类号: G10L19/00 , G10L19/02 , G10L19/022 , G10L25/45 , H03H17/02 , G10L21/038
CPC分类号: G10L19/0204 , G10L19/022 , G10L25/45 , H03H17/0266 , G10L21/038
摘要: An embodiment of an apparatus for generating audio subband values in audio subband channels includes an analysis windower for windowing a frame of time-domain audio input samples being in a time sequence extending from an early sample to a later sample using an analysis window function including a sequence of window coefficients to obtain windowed samples. The analysis window function includes a first number of window coefficients derived from a larger window function including a sequence of a larger second number of window coefficients, wherein the window coefficients of the window function are derived by an interpolation of window coefficients of the larger window function. The apparatus further includes a calculator for calculating the audio subband values using the windowed samples.
-
公开(公告)号:US20240177721A1
公开(公告)日:2024-05-30
申请号:US18423083
申请日:2024-01-25
发明人: Bingyin XIA , Jiawei LI , Zhe WANG
IPC分类号: G10L19/022 , G10L25/30
CPC分类号: G10L19/022 , G10L25/30
摘要: Embodiments of this application disclose an audio signal encoding and decoding method, including: obtaining, based on spectra of M blocks of a current frame of a to-be-encoded audio signal, M transient state identifiers of the M blocks, where the M blocks include a first block, and a transient state identifier of the first block indicates that the first block is a transient state block, or indicates that the first block is a non-transient state block; obtaining group information of the M blocks based on the M transient state identifiers of the M blocks; performing grouping and arranging on the spectra of the M blocks based on the group information of the M blocks, to obtain a to-be-encoded spectrum of the current frame; encoding the to-be-encoded spectrum by using an encoding neural network to obtain a spectrum encoding result; and writing the spectrum encoding result into a bitstream.
-
公开(公告)号:US20240169998A1
公开(公告)日:2024-05-23
申请号:US18423990
申请日:2024-01-26
发明人: Xianbo Meng , Bingyin Xia , Zhe Wang
IPC分类号: G10L19/008 , G10L19/022
CPC分类号: G10L19/008 , G10L19/022
摘要: In a multi-channel signal encoding method, a current frame includes a first sound channel and a second sound channel. First group information of M blocks of the first sound channel and second group information of M blocks of the second sound channel are obtained. When the first group information and the second group information meet a preset condition, first adjusted group information and second adjusted group information are obtained based on the first group information and the second group information. Then, a first to-be-encoded spectrum is obtained based on the first adjusted group information and the spectrums of the M blocks of the first sound channel. Similarly, a second to-be-encoded spectrum may be obtained. Finally, the first to-be-encoded spectrum and the second to-be-encoded spectrum are encoded by using an encoding neural network to obtain a spectrum encoding result. The spectrum encoding result may be carried by a bitstream.
-
公开(公告)号:US20240161756A1
公开(公告)日:2024-05-16
申请号:US18419794
申请日:2024-01-23
发明人: Zexin Liu , Xingtao Zhang , Haiting Li , Lei Miao
IPC分类号: G10L19/008 , G10L19/022
CPC分类号: G10L19/008 , G10L19/022 , H04S3/00
摘要: A multi-channel signal encoding method includes obtaining a multi-channel signal of a current frame; determining an initial multi-channel parameter of the current frame; determining a difference parameter based on the initial multi-channel parameter of the current frame and multi-channel parameters of previous K frames of the current frame, where the difference parameter represents a difference between the initial multi-channel parameter of the current frame and the multi-channel parameters of the previous K frames, and K is an integer greater than or equal to one; determining a multi-channel parameter of the current frame based on the difference parameter and a characteristic parameter of the current frame; and encoding the multi-channel signal based on the multi-channel parameter of the current frame.
-
-
-
-
-
-
-
-
-