-
公开(公告)号:US20180182413A1
公开(公告)日:2018-06-28
申请号:US15889748
申请日:2018-02-06
发明人: Yutaka KAMAMOTO , Takehiro Moriya , Noboru Harada
摘要: An autocorrelation calculating part calculates autocorrelation Ro(i) from an input signal. A predictive coefficient calculating part performs linear predictive analysis using modified autocorrelation R′o(i) obtained by multiplying the autocorrelation Ro(i) by a coefficient wo(i). Here, it is assumed that a case where, for at least part of each order i, the coefficient wo(i) corresponding to each order i monotonically increases as a value having negative correlation with a fundamental frequency of an input signal in a current frame or a past frame increases and a case where the coefficient wo(i) monotonically decreases as a value having positive correlation with a pitch gain in a current frame or a past frame increases, are included.
-
公开(公告)号:US09980074B2
公开(公告)日:2018-05-22
申请号:US14289477
申请日:2014-05-28
发明人: Dipanjan Sen , Sang-Uk Ryu
IPC分类号: G10L19/00 , G10L19/008 , G10L19/032 , H04S5/00 , G06F17/16 , G10L19/06 , G10L25/18 , H04S7/00 , G10L19/002 , G10L19/038 , G10L19/02 , G10L19/16 , G10L19/20
CPC分类号: H04S5/005 , G06F17/16 , G10L19/002 , G10L19/008 , G10L19/0204 , G10L19/038 , G10L19/06 , G10L19/167 , G10L19/20 , G10L25/18 , G10L2019/0001 , G10L2019/0005 , H04R2205/021 , H04S7/30 , H04S7/304 , H04S7/40 , H04S2400/01 , H04S2400/15 , H04S2420/01 , H04S2420/03 , H04S2420/11
摘要: In general, techniques are described for determining quantization step sizes for compression of spatial components of a sound field. A device comprising one or more processors may be configured to perform the techniques. In other words, the one or more processors may be configured to determine a quantization step size to be used when compressing a spatial component of a sound field, where the spatial component generated by performing a vector based synthesis with respect to a plurality of spherical harmonic coefficients.
-
公开(公告)号:US20180137867A1
公开(公告)日:2018-05-17
申请号:US15849645
申请日:2017-12-20
发明人: Heiko Purnhagen , Pontus Carlsson , Lars Villemoes
IPC分类号: G10L19/008 , G10L19/16 , G10L19/02 , H04S3/00 , G10L19/06
CPC分类号: G10L19/008 , G10L19/0212 , G10L19/06 , G10L19/167 , G10L25/12 , H04S3/008 , H04S2400/01
摘要: The invention provides methods and devices for stereo encoding and decoding using complex prediction in the frequency domain. In one embodiment, a decoding method, for obtaining an output stereo signal from an input stereo signal encoded by complex prediction coding and comprising first frequency-domain representations of two input channels, comprises the upmixing steps of: (i) computing a second frequency-domain representation of a first input channel; and (ii) computing an output channel on the basis of the first and second frequency-domain representations of the first input channel, the first frequency-domain representation of the second input channel and a complex prediction coefficient. The upmixing can be suspended responsive to control data.
-
公开(公告)号:US09947328B2
公开(公告)日:2018-04-17
申请号:US15702451
申请日:2017-09-12
发明人: Michael M. Truman , Mark S. Vinton
IPC分类号: G10L21/038 , G10L21/0388 , G10L19/02 , G10L19/028 , G10L19/26
CPC分类号: G10L19/0208 , G10L19/0017 , G10L19/002 , G10L19/012 , G10L19/02 , G10L19/0204 , G10L19/0212 , G10L19/028 , G10L19/03 , G10L19/06 , G10L19/16 , G10L19/167 , G10L19/173 , G10L19/26 , G10L19/265 , G10L21/00 , G10L21/038 , G10L21/0388
摘要: According to an aspect of the present invention, a method for reconstructing an audio signal having a baseband portion and a highband portion is disclosed. The method includes obtaining a decoded baseband audio signal by decoding an encoded audio signal and obtaining a plurality of subband signals by filtering the decoded baseband audio signal. The method further includes generating a high-frequency reconstructed signal by copying a number of consecutive subband signals of the plurality of subband signals and obtaining an envelope adjusted high-frequency signal. The method further includes generating a noise component based on a noise parameter. Finally, the method includes adjusting a phase of the high-frequency reconstructed signal and obtaining a time-domain reconstructed audio signal by combining the decoded baseband audio signal and the combined high-frequency signal to obtain a time-domain reconstructed audio signal.
-
25.
公开(公告)号:US20180102134A1
公开(公告)日:2018-04-12
申请号:US15834260
申请日:2017-12-07
发明人: Sascha DISCH , Frederik NAGEL , Ralf GEIGER , Balaji Nagendran THOSHKAHNA , Konstantin SCHMIDT , Stefan BAYER , Christian NEUKAM , Bernd EDLER , Christian HELMRICH
IPC分类号: G10L21/0388 , H04S1/00 , G10L25/06 , G10L19/06 , G10L19/032 , G10L19/03 , G10L19/025 , G10L19/022 , G10L19/02
CPC分类号: G10L19/008 , G10L19/02 , G10L19/0204 , G10L19/0208 , G10L19/0212 , G10L19/022 , G10L19/025 , G10L19/03 , G10L19/032 , G10L19/06 , G10L21/0388 , G10L25/06 , H04S1/007
摘要: An apparatus for decoding an encoded audio signal, includes a spectral domain audio decoder for generating a first decoded representation of a first set of first spectral portions, the decoded representation having a first spectral resolution; a parametric decoder for generating a second decoded representation of a second set of second spectral portions having a second spectral resolution being lower than the first spectral resolution; a frequency regenerator for regenerating every constructed second spectral portion having the first spectral resolution using a first spectral portion and spectral envelope information for the second spectral portion; and a spectrum time converter for converting the first decoded representation and the reconstructed second spectral portion into a time representation.
-
26.
公开(公告)号:US20180096694A1
公开(公告)日:2018-04-05
申请号:US15562689
申请日:2016-04-11
CPC分类号: G10L19/09 , G10L19/038 , G10L19/06 , G10L19/07 , G10L19/13 , G10L19/26 , G10L2019/0007 , G10L2019/0016
摘要: A linear predictive coding apparatus includes: a linear predictive analysis part performing linear predictive analysis using a pseudo correlation function signal sequence obtained by performing inverse Fourier transform regarding the η1-th power of absolute values of the frequency domain sample sequence corresponding to the time-series signal as a power spectrum to obtain coefficients transformable to linear predictive coefficients; an adaptation part adapting values η for a plurality of plural candidates for coefficients transformable to linear predictive coefficients stored in a code book in a code book storing part and the coefficients transformable to linear predictive coefficients obtained by the linear predictive analysis part; and a coding part obtaining a linear predictive coefficient code corresponding to the coefficients transformable to linear predictive coefficients, using the plurality of candidates for coefficients transformable to linear predictive coefficients and the coefficients transformable to linear predictive coefficients for which the values of η have been adapted.
-
公开(公告)号:US20180075854A1
公开(公告)日:2018-03-15
申请号:US15817218
申请日:2017-11-19
发明人: Stefan Bruhn
IPC分类号: G10L19/087 , G10L21/02 , G10L19/06 , G10L25/84 , G10L21/0308 , G10L19/26 , G10L19/012 , G10L21/0216
CPC分类号: G10L19/26 , G10L19/012 , G10L19/06 , G10L19/087 , G10L21/02 , G10L21/0308 , G10L25/84 , G10L2021/02168
摘要: In a method for coding of information for enhancing a background noise representation, voice activity of an input speech signal is determined. A noisiness parameter is determined for an inactive speech signal, wherein the noisiness parameter is based on a ratio of prediction gains of two Linear Predictive Coder (LPC) prediction filters with different orders. The noisiness parameter is quantized, and the quantized noisiness parameter is encoded for transmission.
-
公开(公告)号:US09911426B2
公开(公告)日:2018-03-06
申请号:US15478049
申请日:2017-04-03
IPC分类号: G10L19/16 , G10L19/002 , H03G9/00 , H03G9/02
CPC分类号: G10L19/167 , G10L19/002 , G10L19/06 , H03G9/005 , H03G9/025
摘要: Apparatus and methods for generating an encoded audio bitstream, including by including program loudness metadata and audio data in the bitstream, and optionally also program boundary metadata in at least one segment (e.g., frame) of the bitstream. Other aspects are apparatus and methods for decoding such a bitstream, e.g., including by performing adaptive loudness processing of the audio data of an audio program indicated by the bitstream, or authentication and/or validation of metadata and/or audio data of such an audio program. Another aspect is an audio processing unit (e.g., an encoder, decoder, or post-processor) configured (e.g., programmed) to perform any embodiment of the method or which includes a buffer memory which stores at least one frame of an audio bitstream generated in accordance with any embodiment of the method.
-
公开(公告)号:US09892736B2
公开(公告)日:2018-02-13
申请号:US14793297
申请日:2015-07-07
发明人: Heiko Purnhagen , Pontus Carlsson , Lars Villemoes
CPC分类号: G10L19/008 , G10L19/0212 , G10L19/06 , G10L19/167 , G10L25/12 , H04S3/008 , H04S2400/01
摘要: The invention provides methods and devices for stereo encoding and decoding using complex prediction in the frequency domain. In one embodiment, a decoding method, for obtaining an output stereo signal from an input stereo signal encoded by complex prediction coding and comprising first frequency-domain representations of two input channels, comprises the upmixing steps of: (i) computing a second frequency-domain representation of a first input channel; and (ii) computing an output channel on the basis of the first and second frequency-domain representations of the first input channel, the first frequency-domain representation of the second input channel and a complex prediction coefficient. The upmixing can be suspended responsive to control data.
-
公开(公告)号:US09886959B2
公开(公告)日:2018-02-06
申请号:US14050042
申请日:2013-10-09
发明人: Clyde Holmes
IPC分类号: G10L19/00 , G10L19/002 , G10L19/06 , G10L19/08 , G10L25/09
CPC分类号: G10L19/002 , G10L19/06 , G10L19/08 , G10L25/09
摘要: A voice encoder/decoder (vocoder) may provide receiving a voice sample and generating zero crossings of the voice sample in response to voice excitation in a first formant and creating a corresponding output signal. Additional operations may include dividing the output signal by two, and sampling the output signal at a predefined frequency such that a resulting combination uses half of a bit rate for an excitation and a remainder for short term spectrum analysis.
-
-
-
-
-
-
-
-
-