-
公开(公告)号:US20230009374A1
公开(公告)日:2023-01-12
申请号:US17933567
申请日:2022-09-20
发明人: Bernhard GRILL , Roch LEFEBVRE , Bruno BESSETTE , Jimmy LAPIERRE , Philippe GOURNAY , Redwan SALAMI , Stefan BAYER , Guillaume FUCHS , Stefan GEYERSBERGER , Ralf GEIGER , Johannes HILPERT , Ulrich KRAEMER , Jérémie LECOMTE , Markus MULTRUS , Max NEUENDORF , Harald POPP , Nikolaus RETTELBACH
IPC分类号: G10L19/008 , G10L19/16 , G10L19/18
摘要: An audio encoder has a first information sink oriented encoding branch such as a spectral domain encoding branch, a second information source or SNR oriented encoding branch such as an LPC-domain encoding branch, and a switch for switching between the first and second encoding branches, the second encoding branch having a converter into a specific domain different from the spectral domain such as an LPC analysis stage generating an excitation signal, and the second encoding branch having a specific domain coding branch such as LPC domain processing branch, and a specific spectral domain coding branch such as LPC spectral domain processing branch, and an additional switch for switching between the specific domain coding branch and the specific spectral domain coding branch. An audio decoder has a first domain decoder, a second domain decoder, and a third domain decoder as well as two cascaded switches for switching between the decoders.
-
公开(公告)号:US11410668B2
公开(公告)日:2022-08-09
申请号:US16290587
申请日:2019-03-01
发明人: Sascha Disch , Martin Dietz , Markus Multrus , Guillaume Fuchs , Emmanuel Ravelli , Matthias Neusinger , Markus Schnell , Benjamin Schubert , Bernhard Grill
IPC分类号: G10L19/02 , G10L19/022 , G10L19/18 , G10L19/24 , G10L19/028 , G10L19/04 , G10L19/083 , G10L19/26 , G10L19/038 , G10L19/00 , G10L21/038
摘要: An audio encoder for encoding an audio signal includes: a first encoding processor for encoding a first audio signal portion in a frequency domain, wherein the first encoding processor includes: a time frequency converter for converting the first audio signal portion into a frequency domain representation having spectral lines up to a maximum frequency of the first audio signal portion; a spectral encoder for encoding the frequency domain representation; a second encoding processor for encoding a second different audio signal portion in the time domain; a cross-processor for calculating, from the encoded spectral representation of the first audio signal portion, initialization data of the second encoding processor, so that the second encoding processing is initialized to encode the second audio signal portion immediately following the first audio signal portion in time in the audio signal; a controller configured for analyzing the audio signal and for determining, which portion of the audio signal is the first audio signal portion encoded in the frequency domain and which portion of the audio signal is the second audio signal portion encoded in the time domain; and an encoded signal former for forming an encoded audio signal including a first encoded signal portion for the first audio signal portion and a second encoded signal portion for the second audio signal portion.
-
公开(公告)号:US11232804B2
公开(公告)日:2022-01-25
申请号:US16628235
申请日:2018-07-03
发明人: Arijit Biswas , Michael Schug , Harald Mundt
IPC分类号: G10L19/025 , G10L19/032 , G10L19/18 , H03M7/30 , H04B1/66 , H04H20/88
摘要: The present disclosure relates to methods and apparatus for audio coding. A method of encoding a portion of an audio signal comprises determining whether the portion of the audio signal is likely to contain dense transient events, and if it is determined that the portion of the audio signal is likely to contain dense transient events, quantizing the portion of the audio signal using a quantization 5 mode that applies a substantially constant signal-to-noise ratio over frequency for the portion of the audio signal. The present disclosure further relates to a method of detecting dense transient events in a portion of an audio signal.
-
公开(公告)号:US20210287689A1
公开(公告)日:2021-09-16
申请号:US17336132
申请日:2021-06-01
发明人: Sascha DISCH , Martin DIETZ , Markus MULTRUS , Guillaume FUCHS , Emmanuel RAVELLI , Matthias NEUSINGER , Markus SCHNELL , Benjamin SCHUBERT , Bernhard GRILL
IPC分类号: G10L19/18 , G10L19/028 , G10L19/032 , G10L19/06 , G10L19/26
摘要: An audio encoder for encoding an audio signal has: a first encoding processor for encoding a first audio signal portion in a frequency domain, having: a time frequency converter for converting the first audio signal portion into a frequency domain representation; an analyzer for analyzing the frequency domain representation to determine first spectral portions to be encoded with a first spectral resolution and second regions to be encoded with a second resolution; and a spectral encoder for encoding the first spectral portions with the first spectral resolution and encoding the second portions with the second resolution; a second encoding processor for encoding a second different audio signal portion in the time domain; a controller for analyzing and determining, which portion of the audio signal is the first audio signal portion encoded in the frequency domain and which portion is the second audio signal portion encoded in the time domain; and an encoded signal former for forming an encoded audio signal having a first encoded signal portion for the first audio signal portion and a second encoded signal portion for the second portion.
-
公开(公告)号:US20210134303A1
公开(公告)日:2021-05-06
申请号:US17145047
申请日:2021-01-08
发明人: Frederik NAGEL , Max NEUENDORF , Nikolaus RETTELBACH , Jérémie LECOMTE , Markus MULTRUS , Bernhard GRILL , Sascha DISCH
IPC分类号: G10L19/008 , G10L19/18 , G10L21/038
摘要: An apparatus for generating a representation of a bandwidth-extended signal on the basis of an input signal representation includes a phase vocoder configured to obtain values of a spectral domain representation of a first patch of the bandwidth-extended signal on the basis of the input signal representation. The apparatus also includes a value copier configured to copy a set of values of the spectral domain representation of the first patch, which values are provided by the phase vocoder, to obtain a set of values of a spectral domain representation of a second patch, wherein the second patch is associated with higher frequencies than the first patch. The apparatus is configured to obtain the representation of the bandwidth-extended signal using the values of the spectral domain representation of the first patch and the values of the spectral domain representation of the second patch.
-
公开(公告)号:US20210020187A1
公开(公告)日:2021-01-21
申请号:US17030104
申请日:2020-09-23
发明人: Ki-hyun CHOO , Eun-mi OH , Seon-ho HWANG
IPC分类号: G10L19/12 , G10L21/038 , G10L19/18 , G10L19/012 , G10L19/16
摘要: Disclosed are a method and an apparatus for high frequency decoding for bandwidth extension. The method for high frequency decoding for bandwidth extension comprises the steps of: decoding an excitation class; transforming a decoded low frequency spectrum on the basis of the excitation class; and generating a high frequency excitation spectrum on the basis of the transformed low frequency spectrum. The method and apparatus for high frequency decoding for bandwidth extension according to an embodiment can transform a restored low frequency spectrum and generate a high frequency excitation spectrum, thereby improving the restored sound quality without an excessive increase in complexity.
-
公开(公告)号:US20200381001A1
公开(公告)日:2020-12-03
申请号:US16996671
申请日:2020-08-18
发明人: Stefan DOEHLA , Guillaume FUCHS , Bernhard GRILL , Markus MULTRUS , Grzegorz PIETRZYK , Emmanuel RAVELLI , Markus SCHNELL
摘要: Audio decoder device for decoding a bitstream, the audio decoder device including: a predictive decoder for producing a decoded audio frame from the bitstream, wherein the predictive decoder includes a parameter decoder for producing one or more audio parameters for the decoded audio frame from the bitstream and wherein the predictive decoder includes a synthesis filter device for producing the decoded audio frame by synthesizing the one or more audio parameters for the decoded audio frame; a memory device including one or more memories, wherein each of the memories is configured to store a memory state for the decoded audio frame, wherein the memory state for the decoded audio frame of the one or more memories is used by the synthesis filter device for synthesizing the one or more audio parameters for the decoded audio frame; and a memory state resampling device configured to determine the memory state for synthesizing the one or more audio parameters for the decoded audio frame, which has a sampling rate, for one or more of the memories by resampling a preceding memory state for synthesizing one or more audio parameters for a preceding decoded audio frame, which has a preceding sampling rate being different from the sampling rate of the decoded audio frame, for one or more of the memories and to store the memory state for synthesizing of the one or more audio parameters for the decoded audio frame for one or more of the memories into the respective memory.
-
公开(公告)号:US20200335116A1
公开(公告)日:2020-10-22
申请号:US16915904
申请日:2020-06-29
摘要: A codec allowing for switching between different coding modes is improved by, responsive to a switching instance, performing temporal smoothing and/or blending at a respective transition.
-
公开(公告)号:US20200066285A1
公开(公告)日:2020-02-27
申请号:US16673237
申请日:2019-11-04
发明人: Ki-hyun CHOO
IPC分类号: G10L19/00 , G10L19/18 , G10L19/002 , G10L19/22
摘要: The lossless coding method includes selecting one of a first coding method and a second coding method, based on a range in which a quantization index of energy is represented, and coding the quantization index by using the selected coding method. The lossless decoding method includes determining a coding method of a differential quantization index of energy included in a bitstream and decoding the differential quantization index by using one of a first decoding method and a second decoding method based on a range in which a quantization index of energy is represented, in response to the determined coding method.
-
10.
公开(公告)号:US10522163B2
公开(公告)日:2019-12-31
申请号:US16514533
申请日:2019-07-17
发明人: Jeffrey Riedmiller , Karl J. Roeden , Kristofer Kjoerling , Heiko Purnhagen , Vinay Melkote , Leif Sehlstrom
IPC分类号: G10L19/24 , G10L19/008 , G10L19/16 , G10L19/18
摘要: On the basis of a bitstream (P), an n-channel audio signal (X) is reconstructed by deriving an m-channel core signal (Y) and multichannel coding parameters (a) from the bitstream, where 1≤m
-
-
-
-
-
-
-
-
-