摘要:
An audio decoding apparatus and method are provided. The audio decoding apparatus includes a spectrum converting part configured to divide the first frequency spectrum in each channel of the first audio signal in a time direction or in a frequency direction to calculate a first signal sequence having the same time resolution and the same frequency resolution in all the channels of the first audio signal, a down-mixing part configured to perform weighted addition on the signals at the same time and within the same frequency band included in the first signal sequence in all the channels to calculate a second signal sequence having channels of a second number different from the first number of channels.
摘要:
An audio decoding method includes: acquiring, from encoded audio data, a reception audio signal and first auxiliary decoded audio information; calculating coefficient information from the first auxiliary decoded audio information; generating a decoded output audio signal based on the coefficient information and the reception audio signal; decoding to result in a decoded audio signal based on the first auxiliary decoded audio signal and the reception audio signal; calculating, from the decoded audio signal, second auxiliary decoded audio information corresponding to the first auxiliary decoded audio information; detecting a distortion caused in a decoding operation of the decoded audio signal by comparing the second auxiliary decoded audio information with the first auxiliary decoded audio information; correcting the coefficient information in response to the detected distortion; and supplying the corrected coefficient information as the coefficient information when generating the decoded output audio signal.
摘要:
In a voice packet communication system, a voice packet loss concealment device compensates for the deterioration of voice quality due to voice packet loss. In the device, a detecting section detects a loss of a voice packet and outputting information; an estimating section estimates the voice characteristics of the lost segment using a pre-loss voice packet received before the lost segment or a post-loss voice packet received after the lost segment; a pitch signal generating section generates a pitch signal having the voice characteristics; and a lost packet generating section outputs the pitch signal generated by the pitch signal generating section, with the voice characteristics estimated by the estimating section, which allows abnormal noise and feeling of mute, subjective deterioration of naturalness and continuity to be improved, and the voice packet loss concealment to be further improved.
摘要:
An audio encoding apparatus comprising: a power calculation unit that calculates a power fluctuation ratio based on the input signal; a calculation unit that calculates a prediction gain fluctuation ratio based on the input signal; and a block length judging unit that selects one of encoding using a long block mode segmenting an input signal into frames each consisting of a predetermined number of samples and encoding each of the frames, and encoding using a short block mode segmenting each of the frames into short blocks and encoding each of the short blocks, based on the power fluctuation ratio and the prediction gain fluctuation ratio.
摘要:
A downmixing device includes: a matrix conversion unit configured to perform a matrix operation for an input signal; a rotation correction unit configured to rotate an output signal of the matrix conversion unit; a spatial information extraction unit configured to extract spatial information from the output signal of the rotation correction unit; and an error calculation unit configured to calculate an error amount of the matrix operation result for the input signal by performing a matrix operation for the output signal of the rotation correction unit and the spatial information extracted by the spatial information extraction unit using a matrix that is inverse to the matrix used for the matrix operation by the matrix conversion unit.
摘要:
An audio decoding method includes: acquiring, from encoded audio data, a reception audio signal and first auxiliary decoded audio information; calculating coefficient information from the first auxiliary decoded audio information; generating a decoded output audio signal based on the coefficient information and the reception audio signal; decoding to result in a decoded audio signal based on the first auxiliary decoded audio signal and the reception audio signal; calculating, from the decoded audio signal, second auxiliary decoded audio information corresponding to the first auxiliary decoded audio information; detecting a distortion caused in a decoding operation of the decoded audio signal by comparing the second auxiliary decoded audio information with the first auxiliary decoded audio information; correcting the coefficient information in response to the detected distortion; and supplying the corrected coefficient information as the coefficient information when generating the decoded output audio signal.
摘要:
An audio information processing apparatus and method include dividing an audio signal, determining a time period having a power change ratio of an audio signal larger than a first threshold value as an attack candidate, searching the time period of the attack candidate and a time period immediately before the time period of the attack candidate for an attack starting point, correcting a power of an audio signal included in the time period, and determining whether a power change ratio of the audio signal included in the time period is larger than a second threshold value for attack detection which is larger than the first threshold value.
摘要:
A disclosed encoding device converts an audio signal into frequency spectra, determines allowable error powers with respect to bands divided by the frequency of the audio signal by a predetermined with, detects a tonal frequency spectrum from the frequency spectra, and detects a band containing the frequency spectrum. Using the detection result and the allowable error powers, the encoding device performs correction such that allowable error powers determined by a power determining unit with respect to bands adjacent to the band detected by a detecting unit become smaller than the powers of the frequency spectra with respect to the adjacent bands, and quantizes each of frequency spectra having greater powers than the corrected allowable error powers.
摘要:
An audio coding device includes a time frequency transform unit that, with respect to each of a plurality of channels included in an audio signal, generates a time frequency signal indicating frequency components at each time by performing a time frequency transform on a signal of the channel; a transient detection unit that detects a transient with respect to each of the plurality of channels so as to obtain a transient detection time; a transient time correction unit that, when a difference in transient detection times between an early detection channel in which the transient detection time is earliest and a late detection channel that is a channel other than the early detection channel among the plurality of channels is within a range in which the transient; a grid determination unit that, with respect to each of the plurality of channels, and a coding unit that codes.
摘要:
An encoder includes, a degree-of-importance calculating unit that calculates a degree of importance of each of a first number of signals included in input signals; a signal converting unit that converts the first number of signals included in the input signals into a second number of signals; a degree-of-importance converting unit that converts a first number of degrees of importance, a number of which is equal to the first number of signals, calculated by the degree-of-importance calculating unit into a second number of degrees of importance, a number of which is equal to the second number of signals; a number-of-bits determining unit that determines a number of bits for use in quantizing each of the second number of signals obtained by the conversion performed by the signal converting; and a quantizing unit that quantizes each of the second number.