-
公开(公告)号:US11610592B2
公开(公告)日:2023-03-21
申请号:US17204073
申请日:2021-03-17
发明人: Zexin Liu , Fengyan Qi , Lei Miao
IPC分类号: G10L19/002 , G10L19/028 , G10L19/02 , G10L19/005
摘要: An audio signal decoding device includes a non-transitory memory storage stores audio data in a form of a bitstream; and an audio decoder, by which a first spectral coefficient of a first sub-band of a current frame of an audio signal by decoding the bitstream is obtained; a first average quantity of allocated bits per spectral coefficient of the first sub-band is obtained; a first noise filling gain for the first sub-band is obtained when the first average quantity is less than a threshold; a second spectral coefficient is reconstructed according to the first noise filling gain; a frequency domain audio signal is obtained according to the first spectral coefficient and the second spectral coefficient; and a time domain audio signal is generated according to the frequency domain signal.
-
公开(公告)号:US11545160B2
公开(公告)日:2023-01-03
申请号:US16863439
申请日:2020-04-30
申请人: Axis AB
IPC分类号: G10L19/02 , G10L19/002 , G06N7/00 , H04L65/70 , H04L65/75
摘要: A method, a computer program product, an encoder and a monitoring device for encoding an audio signal with variable bitrate, wherein: an audio signal comprising a plurality of successive audio frames is received; and for each successive audio frame of the audio signal: the audio frame is represented in a frequency domain with respect to a plurality of frequency sub-bands; the audio frame is classified in each frequency sub-band as either background or foreground using a background model specific to the frequency sub-band; each successive audio frame of the audio signal is encoded, wherein a number of bits is allocated for each frequency sub-band of the audio frame, wherein the number of bits allocated for a frequency sub-band is higher if the audio frame is classified as foreground in the frequency sub-band than if the audio frame is classified as background in the frequency sub-band.
-
3.
公开(公告)号:US11501783B2
公开(公告)日:2022-11-15
申请号:US16795561
申请日:2020-02-19
IPC分类号: G10L19/00 , G10L19/005 , G10L19/06 , G10L19/002 , G10L19/012 , G10L19/083 , G10L19/09 , G10L19/12 , G10L19/07 , G10L19/22 , G10L19/02
摘要: An apparatus for decoding an encoded audio signal to obtain a reconstructed audio signal includes a receiving interface for receiving one or more frames comprising information on a plurality of audio signal samples of an audio signal spectrum of the encoded audio signal, and a processor for generating the reconstructed audio signal. The processor is configured to generate the reconstructed audio signal by fading a modified spectrum to a target spectrum, if a current frame is not received by the receiving interface or if the current frame is received by the receiving interface but is corrupted, wherein the modified spectrum includes a plurality of modified signal samples, wherein, for each of the modified signal samples of the modified spectrum, an absolute value of the modified signal sample is equal to an absolute value of one of the audio signal samples of the audio signal spectrum.
-
公开(公告)号:US20220172730A1
公开(公告)日:2022-06-02
申请号:US17672824
申请日:2022-02-16
IPC分类号: G10L19/002 , G10L19/02 , G10L19/035
摘要: An audio signal encoding method and device are provided. The method and device are used to encode an audio signal to obtain a bitstream representing the analog audio signal, in which a proper bit allocation for spectral coefficients can be performed.
-
公开(公告)号:US20220148607A1
公开(公告)日:2022-05-12
申请号:US17436390
申请日:2020-02-10
申请人: ORANGE
发明人: Stéphane Ragot , Pierre Mahe
IPC分类号: G10L19/032 , G10L19/06 , G10L19/008 , G10L19/002
摘要: A method and device for compressing audio signals forming, over time, a succession of sample frames, in each of N channels of an ambisonic representation of order higher than 0. The method includes: forming, based on the channels and for a current frame, a matrix of inter-channel covariance, and searching for eigenvectors of the covariance matrix with a view to obtaining a matrix of eigenvectors; testing the matrix of eigenvectors to verify that it represents a rotation in an N-dimensional space, and if not, correcting the matrix of eigenvectors until a rotation matrix is obtained, for the current frame; and applying the rotation matrix to the signals of the N channels before separate-channel encoding of the signals.
-
公开(公告)号:US20220103948A1
公开(公告)日:2022-03-31
申请号:US17471011
申请日:2021-09-09
申请人: Apple Inc.
发明人: Aarti Kumar , Shehryar Lasi , Baptiste P. Paquier , Brian Clark
摘要: A method performed by an audio source device. The method receives a first audio signal and a second, different audio signal and encodes the first audio signal and the second audio signal, wherein the first audio signal is encoded differently than the second audio signal. The method generates a first data packet that comprises the first encoded audio signal and a first volume level and a second data packet that comprises the second encoded audio signal and a second volume level, wherein the first volume level is lower than the second volume level and transmits, over a wireless connection, the first and second data packets as a dual audio stream to an audio output device.
-
公开(公告)号:US11264041B2
公开(公告)日:2022-03-01
申请号:US16737451
申请日:2020-01-08
IPC分类号: G10L19/028 , G10L19/02 , G10L19/038 , G10L19/002
摘要: An encoder for encoding frequency transform coefficients of a harmonic audio signal include the following elements: A peak locator configured to locate spectral peaks having magnitudes exceeding a predetermined frequency dependent threshold. A peak region encoder configured to encode peak regions including and surrounding the located peaks. A low-frequency set encoder configured to encode at least one low-frequency set of coefficients outside the peak regions and below a crossover frequency that depends on the number of bits used to encode the peak regions. A noise-floor gain encoder configured to encode a noise-floor gain of at least one high-frequency set of not yet encoded coefficients outside the peak regions.
-
公开(公告)号:US11146903B2
公开(公告)日:2021-10-12
申请号:US14289522
申请日:2014-05-28
发明人: Dipanjan Sen , Sang-Uk Ryu
IPC分类号: G10L21/04 , G10L21/00 , H04S5/00 , G10L19/008 , G06F17/16 , H04S7/00 , G10L19/06 , G10L25/18 , G10L19/002 , G10L19/038 , G10L19/02 , G10L19/16 , G10L19/20 , G10L19/00
摘要: In general, techniques are described for compressing decomposed representations of a sound field. A device comprising one or more processors may be configured to perform the techniques. The one or more processors may be configured to obtain a bitstream comprising a compressed version of a spatial component of a sound field, the spatial component generated by performing a vector based synthesis with respect to a plurality of spherical harmonic coefficients.
-
公开(公告)号:US11081121B2
公开(公告)日:2021-08-03
申请号:US16725678
申请日:2019-12-23
IPC分类号: G10L19/00 , G10L19/02 , G10L19/002
摘要: A signal processing method and device includes obtaining spectral coefficients of a current frame of an audio signal, in which N sub-bands of the current frame comprises at least one of the spectral coefficients. A total energy of M successive sub-bands of the N sub-bands, a total energy of K successive sub-bands of the N sub-bands, and an energy of a first sub-band are obtained to determine whether to modify original envelope values of the M sub-bands. When the original envelope values of the M sub-bands are modified, encoding bits are allocated to each of the N sub-bands according to the modified envelope values of the M sub-bands.
-
公开(公告)号:US20210166701A1
公开(公告)日:2021-06-03
申请号:US17104400
申请日:2020-11-25
发明人: Woo-taek LIM , Seung Kwon BEACK , Jongmo SUNG , Mi Suk LEE , Tae Jin LEE
IPC分类号: G10L19/002
摘要: An audio signal encoding/decoding device and method using a filter bank is disclosed. The audio signal encoding method includes generating a plurality of first audio signals by performing filtering on an input audio signal using an analysis filter bank, generating a plurality of second audio signals by performing downsampling on the first audio signals, and outputting a bitstream by encoding and quantizing the second audio signals.
-
-
-
-
-
-
-
-
-