摘要:
An encoding device includes: a frequency region converter which converts an inputted audio signal into a frequency region; a band selector which selects a quantization object band from a plurality of sub bands obtained by dividing the frequency region; and a shape quantizer which quantizes the shape of the frequency region parameter of the quantization object band. When a prediction encoding presence/absence determiner determines that the number of common sub bands between the quantization object band and the quantization object band selected in the past is not smaller than a predetermined value, a gain quantizer performs prediction encoding on the gain of the frequency region parameter of the quantization object band. When the number of common sub bands is smaller than the predetermined value, the gain quantizer non-predictively encodes the gain of the frequency region parameter of the quantization object band.
摘要:
Provided is a decoding device which suppresses generation of an abnormal sound caused by a layer switch. The decoding device includes: a first layer decoding unit (202) which performs a decoding process on first layer encoded data so as to generate a first layer decoding signal; a second layer decoding unit (203) which performs a decoding process on second layer encoded data so as to generate a first layer decoding error signal; an adder (204) which adds the first layer decoding signal and the first layer decoding error signal so as to generate a second layer decoding signal; a switching unit (205) which performs switching between the first layer signal and the second layer decoding signal for output according to layer information; and a post-filter (206) which selects a control parameter corresponding to the respective layer information and performs a control parameter smoothing process so as to generate a smoothed control parameter and performs a filter process on the decoding signal from the switching unit (205) by using the generated smoothed control parameter.
摘要:
Provided is a voice encoding device which can accurately encode a spectrum shape of a signal having a strong tonality such as a vowel. The device includes: a sub-band constituting unit (151) which divides a first layer error conversion coefficient to be encoded into M sub-bands so as to generate M sub-band conversion coefficients; a shape vector encoding unit (152) which performs encoding on each of the M sub-band conversion coefficient so as to obtain M shape encoded information and calculates a target gain of each of the M sub-band conversion coefficients; a gain vector forming unit (153) which forms one gain vector by using M target gains; a gain vector encoding unit (154) which encodes the gain vector so as to obtain gain encoded information; and a multiplexing section unit (155) which multiplexes the shape encoded information with the gain encoded information.
摘要:
Disclosed is an encoding device which can accurately specify a band having a large error among all the bands by using a small calculation amount. The device includes: a first position identification unit (201) which uses a first layer error conversion coefficient indicating an error of decoding signal for an input signal so as to search for a band having a large error in a relatively wide bandwidth in all the bands of the input signal and generates first position information indicating the identified band; a second position identification unit (202) which searches for a target frequency band having a large error in a relatively narrow bandwidth in the band identified by the first position identification unit (201) and generates second position information indicating the identified target frequency band; and an encoding unit (203) which encodes a first layer decoding error conversion coefficient contained in the target frequency band. The first position information, the second position information, and the encoding unit are transmitted to a communication partner.
摘要:
Disclosed is a decoding device and others capable of flexibly calculating high-band spectrum data with a high accuracy in accordance with an encoding band selected by an upper-node layer of the encoding side. In this device: a first layer decoding unit (202) decodes first layer encoded information to generate a first layer decoded signal; a second layer decoding unit (204) decodes second layer encoded information to generate a second layer decoded signal; a spectrum decoding unit (205) performs a band extension process by using the second layer decoded signal and the first layer decoded signal up-sampled in an up-sampling unit (203) so as to generate a all-band decoded signal; and a switch (206) outputs the first layer decoded signal or the all-band decoded signal according to the control information generated in a control unit (201).
摘要:
A scalable encoding device for realizing scalable encoding by CELP encoding of a stereo sound signal and improving the encoding efficiency. In this device, an adder and a multiplier obtain an average of a first channel signal CH1 and a second channel signal CH2 as a monaural signal M. A CELP encoder for a monaural signal subjects the monaural signal M to CELP encoding, outputs the obtained encoded parameter to outside, and outputs a synthesized monaural signal M′ synthesized by using the encoded parameter to a first channel signal encoder. By using the synthesized monaural signal M′ and the second channel signal CH2, the first channel signal encoder subjects the first channel signal CH1 to CELP encoding to minimize the sum of the encoding distortion of the first channel signal CH1 and the encoding distortion of the second channel signal CH2.
摘要:
A scalable decoding apparatus capable of providing decoded audio signals of high quality having less degradation of a high frequency spectrum even when decoding audio signals by generating the high frequency spectrum by use of a low frequency spectrum. In the apparatus, an amplitude adjusting part (1211) uses different adjustment coefficients in accordance with the characteristic of first layer spectrum information to adjust the amplitude of a first layer decoded signal spectrum, and then outputs the amplitude-adjusted first layer decoded signal spectrum to a pseudo-spectrum generating part (1012). Using amplitude-adjusted first layer decoded signal spectrum received from the amplitude adjusting part (1211), the pseudo-spectrum generating part (1012) generates and outputs a pseudo-spectrum of high frequencies to a scaling part (1013). The scaling part (1013) scales the spectrum received from the pseudo-spectrum generating part (1012) and then outputs it to an adder (B).
摘要:
A speech encoding method, apparatus and program wherein an input speech signal is divided into a plurality of frames each having a predetermined length, each of the frames is subdivided into a plurality of subframes, a predictive pitch period of a subframe in a to-be-encoded current frame is obtained by using pitch periods of at least two frames of the current frame and past and future frames with respect to the current frame; a pitch period of a subframe in the current frame is obtained by using the predictive pitch period, a relative pitch pattern codebook storing a plurality of relative pitch patterns representing fluctuations in pitch periods of a plurality of subframes is prepared, and a change in pitch period of plural subframes is expressed with one relative pitch pattern selected from the relative pitch pattern codebook.
摘要:
A method for encoding speech wherein an input speech signal is separated by a component separator into a first component mainly constituted by speech and a second component mainly constituted by a background noise at each predetermined unit of time, a bit allocation selector selects bit allocation for each component based on the first and second components from among a plurality of predetermined candidates for bit allocation, a speech encoder and a noise encoder encode the first and second components from the component separator based on the bit allocation according to predetermined different methods for encoding, and a multiplexer multiplexes encoded data of the first and second components and information on the bit allocation and outputs them as transmitted encoded data.
摘要:
A voice coding device capable of preventing overall quality degradation even when the bit rate for coding is lowered. The voice coding device codes a wide band signal in a first layer, and codes an extended band signal whose frequency band is located in higher frequency than the wide band signal in an extended band layer. An adaptive band selection unit (301) selects a frequency band to be excluded from a coding object in the extended band layer or a frequency band whose energy is to be attenuated in the extended band layer. A band-limited signal generation unit (302) excludes, within the frequency band of an input signal, the frequency band selected by the adaptive band selection unit (301) from the coding object, or attenuates the energy of the frequency band selected by the adaptive band selection unit (301).