-
公开(公告)号:US12223971B2
公开(公告)日:2025-02-11
申请号:US17874975
申请日:2022-07-27
Applicant: NTT DOCOMO, INC.
Inventor: Kei Kikuiri , Atsushi Yamaguchi
IPC: G10L19/00 , G10L19/02 , G10L19/028 , G10L19/032 , G10L19/12 , G10L19/16 , G10L19/26 , G10L19/24 , G10L21/038
Abstract: The purpose of the present invention is to reduce distortion a frequency band component encoded with a small number of bits in a time domain and improve quality. An audio decoding device (10) decodes an encoded audio signal and outputs the audio signal. A decoding unit (10a) decodes an encoded sequence containing an encoded audio signal and obtains a decoded signal. A selective temporal envelope shaping unit (10b) shapes a temporal envelope of a decoded signal in the frequency band on the basis of decoding related information concerning decoding of the encoded sequence.
-
公开(公告)号:US20250046322A1
公开(公告)日:2025-02-06
申请号:US18925144
申请日:2024-10-24
Applicant: Telefonaktiebolaget LM Ericsson (publ)
Inventor: Erik Norvell , Volodya Grancharov
IPC: G10L19/038 , G10L19/02 , G10L19/032 , G10L19/083 , G10L21/0232
Abstract: A gain adjustment apparatus for use in decoding of audio that has been encoded with separate gain and shape representations includes an accuracy meter configured to estimate an accuracy measure of the shape representation, and to determine a gain correction based on the estimated accuracy measure. An envelope adjuster further included in the apparatus is configured to adjust the gain representation based on the determined gain correction.
-
公开(公告)号:US12205604B2
公开(公告)日:2025-01-21
申请号:US18449085
申请日:2023-08-14
Inventor: Sascha Disch , Ralf Geiger , Andreas Niedermeier , Matthias Neusinger , Konstantin Schmidt , Stephan Wilde , Benjamin Schubert , Christian Neukam
IPC: G10L19/028 , G10L15/20 , G10L19/02 , G10L19/032 , G10L21/038 , G10L25/21
Abstract: An apparatus for generating an enhanced signal from an input signal, wherein the enhanced signal has spectral values for an enhancement spectral region, the spectral values for the enhancement spectral regions not being contained in the input signal, includes a mapper for mapping a source spectral region of the input signal to a target region in the enhancement spectral region, the source spectral region including a noise-filling region; and a noise filler configured for generating first noise values for the noise-filling region in the source spectral region of the input signal and for generating second noise values for a noise region in the target region, wherein the second noise values are decorrelated from the first noise values or for generating second noise values for a noise region in the target region, wherein the second noise values are decorrelated from first noise values in the source region.
-
公开(公告)号:US12198709B2
公开(公告)日:2025-01-14
申请号:US17375465
申请日:2021-07-14
Inventor: Fabian Küch , Oliver Thiergart , Guillaume Fuchs , Stefan Döhla , Alexandre Bouthéon , Jürgen Herre , Stefan Bayer
IPC: G10L19/032 , G10L19/26 , H04S1/00 , H04S7/00
Abstract: An apparatus for encoding a spatial audio representation representing an audio scene to obtain an encoded audio signal includes: a transport representation generator for generating a transport representation from the spatial audio representation, and for generating transport metadata related to the generation of the transport representation or indicating one or more directional properties of the transport representation; and an output interface for generating the encoded audio signal, the encoded audio signal including information on the transport representation, and information on the transport metadata.
-
5.
公开(公告)号:US12148436B2
公开(公告)日:2024-11-19
申请号:US18362453
申请日:2023-07-31
Applicant: Huawei Technologies Co., Ltd.
Inventor: Eyal Shlomot , Jonathan Alastair Gibbs , Haiting Li
IPC: G10L19/008 , G10L19/038 , G10L19/06 , G10L19/032 , G10L19/07
Abstract: A stereo signal encoding method includes performing spectrum broadening on a quantized line spectral frequency (LSF) parameter of a primary channel signal in a current frame in a stereo signal to obtain a spectrum-broadened LSF parameter of the primary channel signal, determining a prediction residual of an LSF parameter of a secondary channel signal in the current frame based on an original LSF parameter of the secondary channel signal and the spectrum-broadened LSF parameter of the primary channel signal, and performing a quantization on the prediction residual of the LSF parameter of the secondary channel signal.
-
公开(公告)号:US12142284B2
公开(公告)日:2024-11-12
申请号:US18220677
申请日:2023-07-11
Inventor: Sascha Disch , Frederik Nagel , Ralf Geiger , Balaji Nagendran Thoshkahna , Konstantin Schmidt , Stefan Bayer , Christian Neukam , Bernd Edler , Christian Helmrich
IPC: G10L19/00 , G10L19/008 , G10L19/02 , G10L19/022 , G10L19/025 , G10L19/03 , G10L19/032 , G10L19/06 , G10L21/0388 , G10L25/06 , G10L25/18 , G10L25/21 , H04S1/00
Abstract: An apparatus for generating a decoded two-channel signal, comprising: a parametric decoder for providing parametric data for a second set of second spectral portions and a two-channel identification identifying for a second spectral portion of the second set of second spectral portions either a first two-channel representation for the second spectral portion of the second set of second spectral portions or a second two-channel representation for the second spectral portion of the second set of second spectral portions, the second two-channel representation being different from the first two-channel representation; and a frequency regenerator for regenerating the second spectral portion of the second set of second spectral portions depending on a first spectral portion of a first set of first spectral portions, the parametric data for the second spectral portion of the second set of second spectral portions and the two-channel identification for the second spectral portion of the second set of second spectral portions to acquire a regenerated second spectral portion of the second set of second spectral portions.
-
公开(公告)号:US12112765B2
公开(公告)日:2024-10-08
申请号:US16802397
申请日:2020-02-26
Inventor: Bernd Edler , Christian Helmrich , Max Neuendorf , Benjamin Schubert
IPC: G10L19/032 , G10L19/02 , H04N19/547 , H04N19/635
CPC classification number: G10L19/032 , G10L19/02 , H04N19/547 , H04N19/635
Abstract: An encoder for encoding an audio signal is configured to encode the audio signal in a transform domain or filter-bank domain, is configured to determine spectral coefficients of the audio signal for a current frame and at least one previous frame, and is configured to selectively apply predictive encoding to a plurality of individual spectral coefficients or groups of spectral coefficients which are separated by at least one spectral coefficient.
-
公开(公告)号:US20240331710A1
公开(公告)日:2024-10-03
申请号:US18435725
申请日:2024-02-07
Applicant: The ADT Security Corporation
Inventor: Andrew P. WEIR
IPC: G10L19/032 , G10L19/022
CPC classification number: G10L19/032 , G10L19/022
Abstract: A system comprising an audio compression device is provided. The audio compression device receives a plurality of pulse code modulated (PCM) samples, performs a quantization of the plurality of the PCM samples, and determines a plurality of time windows for the plurality of quantized PCM samples. The audio compression device further determines a first number of delta bits and a first sample count for a first time window of a plurality of time windows, where the first time window includes a first group of time-domain consecutive samples according to the first sample count. The audio compression device encodes the sample into the first number of delta bits based on a difference from a previous sample to generate a first stream of delta bits, and encodes the first sample count and the first number of delta bits in a corresponding first header for the first time window.
-
9.
公开(公告)号:US20240296854A1
公开(公告)日:2024-09-05
申请号:US18647394
申请日:2024-04-26
Inventor: Qingbo HUANG , Yuyong KANG , Wei XIAO , Meng WANG , Yupeng SHI
IPC: G10L19/02 , G10L19/032
CPC classification number: G10L19/0204 , G10L19/032
Abstract: An audio processing method includes: filtering an audio signal to obtain a low-frequency signal and a high-frequency signal; encoding the low-frequency signal to obtain a bitstream of the low-frequency signal; performing frequency domain transform on the low-frequency signal and the high-frequency signal respectively, to obtain a low-frequency spectrum and a high-frequency spectrum; performing spectral envelope extraction on the low-frequency spectrum and the high-frequency spectrum to obtain spectral envelope information, and performing spectral flatness extraction on the high-frequency spectrum to obtain spectral flatness information; and performing quantization encoding on the spectral flatness information and the spectral envelope information to obtain a bandwidth extension bitstream, and combining the bandwidth extension bitstream and the bitstream of low-frequency signal into an encoded bitstream.
-
公开(公告)号:US12075224B2
公开(公告)日:2024-08-27
申请号:US18311854
申请日:2023-05-03
Applicant: DOLBY INTERNATIONAL AB
Inventor: Heiko Purnhagen , Lars Villemoes , Jonas Engdegard , Jonas Roeden , Kristofer Kjoerling
IPC: G10L19/08 , G10L19/008 , G10L19/02 , G10L19/032 , G10L19/16 , G10L19/26 , H04R5/00 , H04S3/02 , H04S5/00
CPC classification number: H04R5/00 , G10L19/008 , G10L19/0204 , G10L19/032 , G10L19/167 , G10L19/26 , H04S3/02 , H04S5/00 , H04S2400/01 , H04S2400/03 , H04S2420/03
Abstract: A method and apparatus for reconstructing N audio channels from M audio channels is disclosed. The method includes receiving a bitstream containing an encoded audio signal representing the M audio channels and decoding the encoded audio signal to obtain a frequency domain representation of the M audio channels. The method further includes extracting a parameter from the bitstream and reconstructing at least one of the N audio channels using the parameter. The parameter represents an angle between two signals, at least one of which is included in the M audio channels.