摘要:
To provide an enhanced true-to-life atmosphere enjoyed in multipoint connecting, and reduce a calculation load at a multipoint connection unit, as well.A stream synthesizing device includes an input unit which inputs at least two coded signals each including a first downmix acoustic signal and an extended signal, each of first downmix acoustic signals being obtained by coding an acoustic signal into which at least two sound signals are downmixed, and the extended signal being for obtaining the at least two sound signals out of the first downmix acoustic signal; a coded signal generating unit which generates: a second downmix acoustic signal and an extended signal based on each of coded signals inputted by the input unit, the second downmix acoustic signal being for obtaining each of the first downmix acoustic signals, and the generated extended signal being for obtaining each of the first downmix acoustic signals out of the second downmix acoustic signal; and generate a coded signal including the generated second downmix acoustic signal, the generated extended signal, and each of extended signals included in the corresponding inputted coded signal; and an output unit which outputs the generated coded signal.
摘要:
To provide an enhanced true-to-life atmosphere enjoyed in multipoint connecting, and reduce a calculation load at a multipoint connection unit, as well.A stream synthesizing device includes an input unit which inputs at least two coded signals each including a first downmix acoustic signal and an extended signal, each of first downmix acoustic signals being obtained by coding an acoustic signal into which at least two sound signals are downmixed, and the extended signal being for obtaining the at least two sound signals out of the first downmix acoustic signal; a coded signal generating unit which generates: a second downmix acoustic signal and an extended signal based on each of coded signals inputted by the input unit, the second downmix acoustic signal being for obtaining each of the first downmix acoustic signals, and the generated extended signal being for obtaining each of the first downmix acoustic signals out of the second downmix acoustic signal; and generate a coded signal including the generated second downmix acoustic signal, the generated extended signal, and each of extended signals included in the corresponding inputted coded signal; and an output unit which outputs the generated coded signal.
摘要:
A hybrid sound signal decoder decodes a bitstream including audio frames encoded by an audio encoding process using a low delay filter bank and speech frames encoded by a speech encoding process using linear prediction coefficients. When a current frame to be decoded is an ith frame which is an initial speech frame after switching from an audio frame to a speech frame, the hybrid sound signal decoder generates sub-frames which are a signal corresponding to an i−1th frame before being encoded, using a sub-frame which is a signal generated using a signal of the i−1th frame before being encoded, the signal of the i−1th frame being obtained by decoding the ith frame.
摘要:
Provided is an encoding device (1) including: a pitch contour analysis unit (101) which detects information, a dynamic time-warping unit (102) which generates, based on the information, pitch change ratios (Tw_ratio in FIG. 18) within a range (86) including a range (86a) of the pitch change ratios corresponding to absolute pitch differences of 42 cents or larger; a first lossless coding unit (103) which codes the generated pitch parameters (102x); a time-warping unit (104) which shifts a pitch of a signal according to the information; and a second encoding unit which codes a signal (104x) obtained by the shifting.
摘要:
The delay in a multi-channel audio coding apparatus and a multi-channel audio decoding apparatus is reduced. The audio coding apparatus includes: a downmix signal generating unit (410) that generates, in a time domain, a first downmix signal that is one of a 1-channel audio signal and a 2-channel audio signal from an input multi-channel audio signal; a downmix signal coding unit (404) that codes the first downmix signal; a first t-f converting unit (401) that converts the input multi-channel audio signal into a multi-channel audio signal in a frequency domain; and a spatial information calculating unit (409) that generates spatial information for generating a multi-channel audio signal from a downmix signal.
摘要:
An encoder of the present invention includes: G storage sections for storing G groups of data; a selection section for selecting one of H Huffman codebooks having codebook numbers for each of the groups of data; G encoding sections Huffman-encoding the G groups of data by using the selected Huffman codebook; and an encoding section for encoding the codebook number of each Huffman codebook selected. The selection section includes a calculation section for calculating a code length and a control section for selecting one of the Huffman codebooks. When the Huffman codebook selected is an unsigned codebook, a number of bits required for sign information has previously been added to the calculated code length.
摘要:
To provide a bandwidth extension method which allows reduction of computation amount in bandwidth extension and suppression of deterioration of quality in the bandwidth to be extended. In the bandwidth extension method: a low frequency bandwidth signal is transformed into a QMF domain to generate a first low frequency QMF spectrum; pitch-shifted signals are generated by applying different shifting factors on the low frequency bandwidth signal; a high frequency QMF spectrum is generated by time-stretching the pitch-shifted signals in the QMF domain; the high frequency QMF spectrum is modified; and the modified high frequency QMF spectrum is combined with the first low frequency QMF spectrum.
摘要:
To provide a bandwidth extension method which allows reduction of computation amount in bandwidth extension and suppression of deterioration of quality in the bandwidth to be extended. In the bandwidth extension method: a low frequency bandwidth signal is transformed into a QMF domain to generate a first low frequency QMF spectrum; pitch-shifted signals are generated by applying different shifting factors on the low frequency bandwidth signal; a high frequency QMF spectrum is generated by time-stretching the pitch-shifted signals in the QMF domain; the high frequency QMF spectrum is modified; and the modified high frequency QMF spectrum is combined with the first low frequency QMF spectrum.
摘要:
To provide an audio signal processing apparatus which can perform, with low operation amount, audio signal processing that is either time stretch and/or compression processing or frequency modulation processing. The audio signal processing apparatus is intended to transform an input audio signal sequence using a predetermined adjustment factor. The audio signal processing apparatus includes a filter bank (2601) which transforms the input audio signal sequence into Quadrature Mirror Filter (QMF) coefficients using a filter for Quadrature Mirror Filter analysis (a QMF analysis filter) and an adjusting unit (2602) which adjusts the QMF coefficients based on a predetermined adjustment factor.
摘要:
A temporal processing apparatus includes: a splitter splitting an audio signal, included in the sub-band domain, into diffuse signals indicating reverberating components and direct signals indicating non-reverberating components; a downmix unit generating a downmix signal by downmixing the direct signals; BPFs respectively generating a bandpass downmix signal and bandpass diffuse signals; normalization processing units respectively generating a normalized downmix signal and normalized diffuse signals; a scale computation processing unit computing, on a predetermined time slot basis, a scale factor indicating the magnitude of energy of the normalized downmix signal with respect to energy of the normalized diffuse signals; a calculating unit generating scale diffuse signals; a HPF generating high-pass diffuse signals; an adding unit generating addition signals; and a synthesis filter bank performing synthesis filter processing on the addition signals and transforming the addition signals into the time domains.