-
公开(公告)号:US20250061902A1
公开(公告)日:2025-02-20
申请号:US18940536
申请日:2024-11-07
Inventor: Bernhard GRILL , Roch LEFEBVRE , Bruno BESSETTE , Jimmy LAPIERRE , Philippe GOURNAY , Redwan SALAMI , Stefan BAYER , Guillaume FUCHS , Stefan GEYERSBERGER , Ralf GEIGER , Johannes HILPERT , Ulrich KRAEMER , Jérémie LECOMTE , Markus MULTRUS , Max NEUENDORF , Harald POPP , Nikolaus RETTELBACH
IPC: G10L19/008 , G10L19/00 , G10L19/02 , G10L19/16 , G10L19/18
Abstract: An audio encoder has a first information sink oriented encoding branch such as a spectral domain encoding branch, a second information source or SNR oriented encoding branch such as an LPC-domain encoding branch, and a switch for switching between the first and second encoding branches, the second encoding branch having a converter into a specific domain different from the spectral domain such as an LPC analysis stage generating an excitation signal, and the second encoding branch having a specific domain coding branch such as LPC domain processing branch, and a specific spectral domain coding branch such as LPC spectral domain processing branch, and an additional switch for switching between the specific domain coding branch and the specific spectral domain coding branch. An audio decoder has a first domain decoder, a second domain decoder, and a third domain decoder as well as two cascaded switches for switching between the decoders.
-
公开(公告)号:US20250054504A1
公开(公告)日:2025-02-13
申请号:US18817567
申请日:2024-08-28
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zhe Wang
Abstract: A method for processing an audio signal includes receiving a bitstream corresponding to the audio signal; obtaining a silence insertion descriptor (SID) type of a current frame of the audio signal by decoding the bitstream; obtaining a low-band parameter of the current frame by decoding the bitstream; obtaining a low-band signal of the current frame based on the low-band parameter; obtaining, based on the SID type of the current frame, a high-band parameter of the current frame; obtaining a high-band signal of the current frame based on the high-band parameter; and obtaining a synthesis signal of the current frame based on the low-band signal and the high-band signal.
-
公开(公告)号:US12159636B2
公开(公告)日:2024-12-03
申请号:US17145047
申请日:2021-01-08
Inventor: Frederik Nagel , Max Neuendorf , Nikolaus Rettelbach , Jérémie Lecomte , Markus Multrus , Bernhard Grill , Sascha Disch
IPC: G10L19/008 , G10L19/02 , G10L19/18 , G10L21/038
Abstract: An apparatus for generating a representation of a bandwidth-extended signal on the basis of an input signal representation includes a phase vocoder configured to obtain values of a spectral domain representation of a first patch of the bandwidth-extended signal on the basis of the input signal representation. The apparatus also includes a value copier configured to copy a set of values of the spectral domain representation of the first patch, which values are provided by the phase vocoder, to obtain a set of values of a spectral domain representation of a second patch, wherein the second patch is associated with higher frequencies than the first patch. The apparatus is configured to obtain the representation of the bandwidth-extended signal using the values of the spectral domain representation of the first patch and the values of the spectral domain representation of the second patch.
-
4.
公开(公告)号:US20240347069A1
公开(公告)日:2024-10-17
申请号:US18751078
申请日:2024-06-21
Inventor: Stefan BRUHN , Juan Felix TORRES
IPC: G10L19/16 , G10L19/008 , G10L19/18
CPC classification number: G10L19/167 , G10L19/008 , G10L19/18
Abstract: The present document describes a method for generating a bitstream, wherein the bitstream comprises a sequence of superframes for a sequence of frames of an immersive audio signal. The method comprises, repeatedly for the sequence of superframes, inserting coded audio data for one or more frames of one or more downmix channel signals derived from the immersive audio signal, into data fields of a superframe; and inserting metadata for reconstructing one or more frames of the immersive audio signal from the coded audio data, into a metadata field of the superframe.
-
公开(公告)号:US12100406B2
公开(公告)日:2024-09-24
申请号:US18344445
申请日:2023-06-29
Applicant: Huawei Technologies Co., Ltd.
Inventor: Zhe Wang
CPC classification number: G10L19/012 , G10L19/0204 , G10L19/22 , G10L19/265 , G10L25/21 , G10L25/78 , G10L19/18
Abstract: A method for processing an audio signal includes receiving a bitstream corresponding to the audio signal; obtaining a silence insertion descriptor (SID) type of a current frame of the audio signal by decoding the bitstream; obtaining a low-band parameter of the current frame by decoding the bitstream; obtaining a low-band signal of the current frame based on the low-band parameter; obtaining, based on the SID type of the current frame, a high-band parameter of the current frame; obtaining a high-band signal of the current frame based on the high-band parameter; and obtaining a synthesis signal of the current frame based on the low-band signal and the high-band signal.
-
6.
公开(公告)号:US12020718B2
公开(公告)日:2024-06-25
申请号:US17251940
申请日:2019-07-02
Inventor: Stefan Bruhn , Juan Felix Torres
IPC: G10L19/16 , G10L19/008 , G10L19/18 , H04S3/00
CPC classification number: G10L19/167 , G10L19/008 , G10L19/18
Abstract: The present document describes a method (500) for generating a bitstream (101), wherein the bitstream (101) comprises a sequence of superframes (400) for a sequence of frames of an immersive audio signal (111). The method (500) comprises, repeatedly for the sequence of superframes (400), inserting (501) coded audio data (206) for one or more frames of one or more downmix channel signals (203) derived from the immersive audio signal (111), into data fields (411, 421, 412, 422) of a superframe (400); and inserting (502) metadata (202, 205) for reconstructing one or more frames of the immersive audio signal (111) from the coded audio data (206), into a metadata field (403) of the superframe (400).
-
公开(公告)号:US11915712B2
公开(公告)日:2024-02-27
申请号:US17453139
申请日:2021-11-01
Inventor: Sascha Disch , Martin Dietz , Markus Multrus , Guillaume Fuchs , Emmanuel Ravelli , Matthias Neusinger , Markus Schnell , Benjamin Schubert , Bernhard Grill
IPC: G10L19/00 , G10L19/02 , G10L19/022 , G10L19/028 , G10L19/04 , G10L19/083 , G10L19/18 , G10L19/24 , G10L19/26 , G10L21/038
CPC classification number: G10L19/0208 , G10L19/022 , G10L19/18 , G10L19/24 , G10L2019/0001 , G10L19/02 , G10L19/028 , G10L19/04 , G10L19/083 , G10L19/26 , G10L21/038
Abstract: An audio encoder for encoding an audio signal includes: a first encoding processor for encoding a first audio signal portion in a frequency domain, wherein the first encoding processor includes: a time frequency converter for converting the first audio signal portion into a frequency domain representation having spectral lines up to a maximum frequency of the first audio signal portion; a spectral encoder for encoding the frequency domain representation; a second encoding processor for encoding a second different audio signal portion in the time domain; a cross-processor for calculating, from the encoded spectral representation of the first audio signal portion, initialization data of the second encoding processor, so that the second encoding processing is initialized to encode the second audio signal portion immediately following the first audio signal portion in time in the audio signal; a controller configured for analyzing the audio signal and for determining, which portion of the audio signal is the first audio signal portion encoded in the frequency domain and which portion of the audio signal is the second audio signal portion encoded in the time domain; and an encoded signal former for forming an encoded audio signal including a first encoded signal portion for the first audio signal portion and a second encoded signal portion for the second audio signal portion.
-
8.
公开(公告)号:US11823694B2
公开(公告)日:2023-11-21
申请号:US18178396
申请日:2023-03-03
Applicant: Dolby International AB
Inventor: Kristofer Kjoerling , Lars Villemoes , Heiko Purnhagen , Per Ekstrand
IPC: G10L19/18 , G10L21/038
CPC classification number: G10L19/18 , G10L21/038
Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.
-
公开(公告)号:US11715477B1
公开(公告)日:2023-08-01
申请号:US17716805
申请日:2022-04-08
Applicant: Digital Voice Systems, Inc.
Inventor: Daniel W. Griffin , John C. Hardwick
IPC: G10L19/087 , G10L19/038 , G10L25/21 , G10L19/00 , G10L19/18
CPC classification number: G10L19/038 , G10L25/21 , G10L19/087 , G10L19/18 , G10L2019/0002
Abstract: Quantizing speech model parameters includes, for each of multiple vectors of quantized excitation strength parameters, determining first and second errors between first and second elements of a vector of excitation strength parameters and, respectively, first and second elements of the vector of quantized excitation strength parameters, and determining a first energy and a second energy associated with, respectively, the first and second errors. First and second weights for, respectively, the first error and the second error, are determined and are used to produce first and second weighted errors, which are combined to produce a total error. The total errors of each of the multiple vectors of quantized excitation strength parameters are compared and the vector of quantized excitation strength parameters that produces the smallest total error is selected to represent the vector of excitation strength parameters.
-
10.
公开(公告)号:US11708741B2
公开(公告)日:2023-07-25
申请号:US17201689
申请日:2021-03-15
Inventor: Jeffrey Riedmiller , Karl J. Roeden , Kristofer Kjoerling , Heiko Purnhagen , Vinay Melkote , Leif Sehlstrom
IPC: G10L19/24 , G10L19/008 , G10L19/16 , G10L19/18 , E21B33/138 , E21B41/00 , E21B21/00
CPC classification number: E21B33/138 , E21B21/003 , E21B41/00 , G10L19/008 , G10L19/167 , G10L19/18 , G10L19/24
Abstract: On the basis of a bitstream (P), an n-channel audio signal (X) is reconstructed by deriving an m-channel core signal (Y) and multichannel coding parameters (α) from the bitstream, where 1≤m
-
-
-
-
-
-
-
-
-