摘要:
An audio encoding device includes, a time-frequency transformer that transforms signals of channels, a first spatial-information determiner that generates a frequency signal of a third channel, a second spatial-information determiner that generates a frequency signal of the third channel, a similarity calculator that calculates a similarity between the frequency signal of the at least one first channel and the frequency signal of the at least one second channel, a phase-difference calculator that calculates a phase difference between the frequency signal of the at least one first channel and the signal of the at least one second channel, a controller that controls determination of the first spatial information when the similarity and the phase difference satisfy a predetermined determination condition, a channel-signal encoder that encodes the frequency signal of the third channel, and a spatial-information encoder that encodes the first spatial information or the second spatial information.
摘要:
A decoded sound analysis unit (104) calculates, regarding the frequency-region stereo signals L(b) and R(b) decoded by the PS decoding unit (103), a second degree of similarity (109) and a second intensity difference (110) from the decoded sound signals. A spectrum correction unit (105) detects a distortion added by the parametric-stereo conversion by comparing the second degree of similarity (109) and the second intensity difference (110) calculated at the decoding side with the first degree of similarity (107) and the first intensity difference (108) calculated and transmitted from the encoding side, and corrects the spectrum of the frequency-region stereo decoded signals L(b) and R(b).
摘要:
An audio information processing apparatus and method include dividing an audio signal, determining a time period having a power change ratio of an audio signal larger than a first threshold value as an attack candidate, searching the time period of the attack candidate and a time period immediately before the time period of the attack candidate for an attack starting point, correcting a power of an audio signal included in the time period, and determining whether a power change ratio of the audio signal included in the time period is larger than a second threshold value for attack detection which is larger than the first threshold value.
摘要:
A decoding apparatus includes a unit decoding and inversely quantizing coded data to obtain frequency domain audio signal data, a unit computing from the coded data one of the number of scale bits composed of the number of bits corresponding to the scale value of the coded data and the number of spectrum bits composed of the number of bits corresponding to the spectrum value of the coded data, a unit estimating a quantization error of the frequency domain audio signal data based on one of the number of scale bits and the number of spectrum bits of the coded data, a unit computing a correction amount based on the estimated quantization error and correct the frequency domain audio signal data obtained by the frequency domain data obtaining unit based on the computed correction amount, and a unit converting the corrected frequency domain audio signal data into the audio signal.
摘要:
An audio coding device includes a time-to-frequency converter that performs time-to-frequency conversion on each frame of a signal in at least one channel included in an audio signal in a predetermined length of time in order to convert the signal in the at least one channel to a frequency signal; a complexity calculator that calculates complexity of the frequency signal for each of the at least one channel. The audio further includes a bit allocation controller that determines a number of bits to be allocated to each of at least one channel so that more bits are allocated to the each of the at least one channel as the complexity of the each of at least one channel increases, and increases the number of bits to be allocated as an estimation error in the number; and a coder that codes the frequency signal.
摘要:
A downmixing device includes: a matrix conversion unit configured to perform a matrix operation for an input signal; a rotation correction unit configured to rotate an output signal of the matrix conversion unit; a spatial information extraction unit configured to extract spatial information from the output signal of the rotation correction unit; and an error calculation unit configured to calculate an error amount of the matrix operation result for the input signal by performing a matrix operation for the output signal of the rotation correction unit and the spatial information extracted by the spatial information extraction unit using a matrix that is inverse to the matrix used for the matrix operation by the matrix conversion unit.
摘要:
An encoding device includes, an estimation unit to estimate a decoded signal of a plurality of channels based on a down-mix signal obtained by down-mixing an input signal of the plurality of channels, similarity between the channels of the input signal, and an intensity difference between the channels of the input signal; an analysis unit to analyze a phase of the input signal and a phase of the decoded signal; a calculation unit to calculate phase information based on the phase of the input signal and the phase of the decoded signal; and a coding unit to encode the similarity between the channels of the input signal, the intensity difference between the channels of the input signal, and the phase information.
摘要:
An audio decoding apparatus and method are provided. The audio decoding apparatus includes a spectrum converting part configured to divide the first frequency spectrum in each channel of the first audio signal in a time direction or in a frequency direction to calculate a first signal sequence having the same time resolution and the same frequency resolution in all the channels of the first audio signal, a down-mixing part configured to perform weighted addition on the signals at the same time and within the same frequency band included in the first signal sequence in all the channels to calculate a second signal sequence having channels of a second number different from the first number of channels.
摘要:
An audio encoding apparatus that encodes audio signals of a plurality of channels, includes an adaptive bit allocation control unit that adaptively controls a number of encoding bits assigned to the audio signal of each channel in accordance with perceptual entropy of the audio signal of each of the channels, a fixed bit allocation control unit that fixedly controls the number of encoding bits assigned to the audio signal of each of the channels in predetermined allocations, and a channel encoding unit that encodes the audio signal of each of the channels based on the number of adaptive allocation bits assigned by the adaptive bit allocation control unit and the number of fixed allocation bits assigned by the fixed bit allocation control unit.
摘要:
In a voice packet communication system, a voice packet loss concealment device compensates for the deterioration of voice quality due to voice packet loss. In the device, a detecting section detects a loss of a voice packet and outputting information; an estimating section estimates the voice characteristics of the lost segment using a pre-loss voice packet received before the lost segment or a post-loss voice packet received after the lost segment; a pitch signal generating section generates a pitch signal having the voice characteristics; and a lost packet generating section outputs the pitch signal generated by the pitch signal generating section, with the voice characteristics estimated by the estimating section, which allows abnormal noise and feeling of mute, subjective deterioration of naturalness and continuity to be improved, and the voice packet loss concealment to be further improved.