摘要:
The delay in a multi-channel audio coding apparatus and a multi-channel audio decoding apparatus is reduced. The audio coding apparatus includes: a downmix signal generating unit (410) that generates, in a time domain, a first downmix signal that is one of a 1-channel audio signal and a 2-channel audio signal from an input multi-channel audio signal; a downmix signal coding unit (404) that codes the first downmix signal; a first t-f converting unit (401) that converts the input multi-channel audio signal into a multi-channel audio signal in a frequency domain; and a spatial information calculating unit (409) that generates spatial information for generating a multi-channel audio signal from a downmix signal.
摘要:
To provide a bandwidth extension method which allows reduction of computation amount in bandwidth extension and suppression of deterioration of quality in the bandwidth to be extended. In the bandwidth extension method: a low frequency bandwidth signal is transformed into a QMF domain to generate a first low frequency QMF spectrum; pitch-shifted signals are generated by applying different shifting factors on the low frequency bandwidth signal; a high frequency QMF spectrum is generated by time-stretching the pitch-shifted signals in the QMF domain; the high frequency QMF spectrum is modified; and the modified high frequency QMF spectrum is combined with the first low frequency QMF spectrum.
摘要:
To provide a bandwidth extension method which allows reduction of computation amount in bandwidth extension and suppression of deterioration of quality in the bandwidth to be extended. In the bandwidth extension method: a low frequency bandwidth signal is transformed into a QMF domain to generate a first low frequency QMF spectrum; pitch-shifted signals are generated by applying different shifting factors on the low frequency bandwidth signal; a high frequency QMF spectrum is generated by time-stretching the pitch-shifted signals in the QMF domain; the high frequency QMF spectrum is modified; and the modified high frequency QMF spectrum is combined with the first low frequency QMF spectrum.
摘要:
To provide an audio signal processing apparatus which can perform, with low operation amount, audio signal processing that is either time stretch and/or compression processing or frequency modulation processing. The audio signal processing apparatus is intended to transform an input audio signal sequence using a predetermined adjustment factor. The audio signal processing apparatus includes a filter bank (2601) which transforms the input audio signal sequence into Quadrature Mirror Filter (QMF) coefficients using a filter for Quadrature Mirror Filter analysis (a QMF analysis filter) and an adjusting unit (2602) which adjusts the QMF coefficients based on a predetermined adjustment factor.
摘要:
To provide an enhanced true-to-life atmosphere enjoyed in multipoint connecting, and reduce a calculation load at a multipoint connection unit, as well.A stream synthesizing device includes an input unit which inputs at least two coded signals each including a first downmix acoustic signal and an extended signal, each of first downmix acoustic signals being obtained by coding an acoustic signal into which at least two sound signals are downmixed, and the extended signal being for obtaining the at least two sound signals out of the first downmix acoustic signal; a coded signal generating unit which generates: a second downmix acoustic signal and an extended signal based on each of coded signals inputted by the input unit, the second downmix acoustic signal being for obtaining each of the first downmix acoustic signals, and the generated extended signal being for obtaining each of the first downmix acoustic signals out of the second downmix acoustic signal; and generate a coded signal including the generated second downmix acoustic signal, the generated extended signal, and each of extended signals included in the corresponding inputted coded signal; and an output unit which outputs the generated coded signal.
摘要:
Provided is an encoding device (1) including: a pitch contour analysis unit (101) which detects information, a dynamic time-warping unit (102) which generates, based on the information, pitch change ratios (Tw_ratio in FIG. 18) within a range (86) including a range (86a) of the pitch change ratios corresponding to absolute pitch differences of 42 cents or larger; a first lossless coding unit (103) which codes the generated pitch parameters (102x); a time-warping unit (104) which shifts a pitch of a signal according to the information; and a second encoding unit which codes a signal (104x) obtained by the shifting.
摘要:
A coding device includes: a pitch contour detection unit which detects a pitch contour of an input audio signal; a dynamic time warping unit which determines the number of pitch nodes based on the pitch contour and generates a first time warping parameter including information indicating the determined number of pitch nodes, a pitch change position, and a pitch change ratio; a first encoder which codes the first time warping parameter; a time warping unit which corrects pitch, using the information obtained from the first time warping parameter, to approximate the pitches of the number of pitch nodes to a predetermined reference value; a second encoder which codes the input audio signal at the corrected pitch; and a multiplexer which multiplexes the coded time warping parameter and the coded audio signal to generate a bitstream.
摘要:
Provided are a new hybrid audio decoder and a new hybrid audio encoder having block switching for speech signals and audio signals. Currently, very low bitrate audio coding methods for speech and audio signal are proposed. These audio coding methods cause very long delay. Generally, in coding an audio signal, algorithm delay tends to be long to achieve higher frequency resolution. In coding a speech signal, the delay needs to be reduced because the speech signal is used for telecommunication. To balance fine coding quality for these two kinds of input signals with very low bitrate, this invention provides a combination of a low delay filter bank like AAC-ELD and a CELP coding method.
摘要:
The delay in a multi-channel audio coding apparatus and a multi-channel audio decoding apparatus is reduced. The audio coding apparatus includes: a downmix signal generating unit that generates, in a time domain, a first downmix signal that is one of a 1-channel audio signal and a 2-channel audio signal from an input multi-channel audio signal; a downmix signal coding unit that codes the first downmix signal; a first t-f converting unit that converts the input multi-channel audio signal into a multi-channel audio signal in a frequency domain; and a spatial information calculating unit that generates spatial information for generating a multi-channel audio signal from a downmix signal.
摘要:
A coding apparatus which suppresses an extreme increase in a bit rate, includes: a downmixing and coding unit (301) that downmixes audio signals that have been provided, to reduce the number of channels to be fewer than the number of the provided audio signals, and to code the downmix signals; an object parameter extracting unit (304) that extracts parameters indicating correlation between the audio signals; and a multiplexing circuit (309) that multiplexes the extracted parameters with the generated downmix coded signals. The object parameter extracting unit (304) includes: an object classifying unit (305) that classifies each of the provided audio signals into a predetermined one of types based on audio characteristics; and an object parameter extracting circuit (308) that extracts parameters using a temporal granularity and a frequency granularity each of which is determined for a corresponding one of the types.