摘要:
Disclosed is a spectral smoothing device with a structure whereby smoothing is performed after a nonlinear conversion has been performed for a spectrum calculated from an audio signal, and with which the amount of processing calculation is significantly reduced while maintaining excellent audio quality. With this spectral smoothing device, a sub band division unit (102) divides an input spectrum into multiple sub bands; a representative value calculation unit (103) calculates a representative value for each sub band using an arithmetic mean and a geometric mean; with respect to each representative value, a nonlinear conversion unit (104) performs a nonlinear conversion the characteristic of which is further emphasized as the value increases; and a smoothing unit (105) that smoothes the representative value which has undergone the nonlinear conversion for each sub band, at the frequency domain.
摘要:
Disclosed is a spectral smoothing device with a structure whereby smoothing is performed after a nonlinear conversion has been performed for a spectrum calculated from an audio signal, and with which the amount of processing calculation is significantly reduced while maintaining excellent audio quality. With this spectral smoothing device, a sub band division unit (102) divides an input spectrum into multiple sub bands; a representative value calculation unit (103) calculates a representative value for each sub band using an arithmetic mean and a geometric mean; with respect to each representative value, a nonlinear conversion unit (104) performs a nonlinear conversion the characteristic of which is further emphasized as the value increases; and a smoothing unit (105) that smoothes the representative value which has undergone the nonlinear conversion for each sub band, at the frequency domain.
摘要:
Disclosed is an encoding device which can accurately specify a band having a large error among all the bands by using a small calculation amount. A first position identifier uses a first layer error conversion coefficient indicating an error of a decoding signal for an input signal so as to search for a band having a large error in a relatively wide bandwidth in all the bands of the input signal and generates first position information indicating the identified band. A second position identifier searches for a target frequency band having a large error in a relatively narrow bandwidth in the band identified by the first position identifier and generates second position information indicating the identified target frequency band. An encoder encodes a first layer decoding error conversion coefficient contained in the target frequency band.
摘要:
Provided is an encoding device which can reduce the encoding distortion as compared to the conventional technique and can obtain a preferable sound quality for auditory sense. In the encoding device, a shape quantization unit (111) quantizes the shape of an input spectrum with a small number of pulse positions and polarities. The shape quantization unit (111) sets a pulse amplitude width to be searched later upon search of the pulse position to a value not greater than the pulse amplitude width which has been searched previously. A gain quantization unit (112) calculates a gain of a pulse searched by the shape quantization unit (111) for each of bands.
摘要:
Provided is an encoding device which can obtain a sound quality preferable for auditory sense even if the number of information bits is small. The encoding device includes a shape quantization unit (111) having: a section search unit (121) which searches for a pulse for each of bands into which a predetermined search section is divided; and a whole search unit (122) which performs search for a pulse over the entire search section. The shape of an input spectrum is quantized by a small number of pulse positions and polarities. A gain quantization unit (112) calculates a gain of the pulse searched by the shape quantization unit (111) and quantizes the gain for each of the bands.
摘要:
Provided is a voice encoding device which can accurately encode a spectrum shape of a signal having a strong tonality such as a vowel. The device includes: a sub-band constituting unit (151) which divides a first layer error conversion coefficient to be encoded into M sub-bands so as to generate M sub-band conversion coefficients; a shape vector encoding unit (152) which performs encoding on each of the M sub-band conversion coefficient so as to obtain M shape encoded information and calculates a target gain of each of the M sub-band conversion coefficients; a gain vector forming unit (153) which forms one gain vector by using M target gains; a gain vector encoding unit (154) which encodes the gain vector so as to obtain gain encoded information; and a multiplexing section unit (155) which multiplexes the shape encoded information with the gain encoded information.
摘要:
Disclosed is an encoding device which can accurately specify a band having a large error among all the bands by using a small calculation amount. The device includes: a first position identification unit (201) which uses a first layer error conversion coefficient indicating an error of decoding signal for an input signal so as to search for a band having a large error in a relatively wide bandwidth in all the bands of the input signal and generates first position information indicating the identified band; a second position identification unit (202) which searches for a target frequency band having a large error in a relatively narrow bandwidth in the band identified by the first position identification unit (201) and generates second position information indicating the identified target frequency band; and an encoding unit (203) which encodes a first layer decoding error conversion coefficient contained in the target frequency band. The first position information, the second position information, and the encoding unit are transmitted to a communication partner.
摘要:
A coding device is provided with features in which optimum coding in a higher layer is flexibly carried out based on a coding result of a lower layer and a quality audio signal in limited circumstances is served to users. In this coding device, a basic layer coding unit codes an input signal to generate a basic layer information source code and outputs a linear prediction coefficient (LPC) and a quantum LPC, which are parameters calculated at coding, to an expanded layer control unit. A basic layer decoding unit decodes the basic layer information source code. An adding unit reverses a polarity of a basic layer decoded signal, adds the same to the input signal, and calculates a difference signal. The expanded layer control unit generates expanded layer mode information indicative of a coding mode in an expanded layer based on the LPC and the quantum LPC. An expanded layer coding unit codes the difference signal obtained from the adding unit under control of the expanded layer control unit.
摘要:
A coding device is provided with features in which optimum coding in a higher layer is flexibly carried out based on a coding result of a lower layer and a quality audio signal in limited circumstances is served to users. In this coding device, a basic layer coding unit codes an input signal to generate a basic layer information source code and outputs a linear prediction coefficient (LPC) and a quantum LPC, which are parameters calculated at coding, to an expanded layer control unit. A basic layer decoding unit decodes the basic layer information source code. An adding unit reverses a polarity of a basic layer decoded signal, adds the same to the input signal, and calculates a difference signal. The expanded layer control unit generates expanded layer mode information indicative of a coding mode in an expanded layer based on the LPC and the quantum LPC. An expanded layer coding unit codes the difference signal obtained from the adding unit under control of the expanded layer control unit.
摘要:
A voice encoding device accurately encodes a spectrum shape of a signal having a strong tonality such as a vowel. The device includes: a sub-band divider which divides a first layer error conversion coefficient to be encoded into M sub-bands so as to generate M sub-band conversion coefficients; a shape vector encoder which performs encoding on each of the M sub-band conversion coefficients so as to obtain M shape encoded information and calculates a target gain of each of the M sub-band conversion coefficients; a gain vector former which forms one gain vector by using M target gains; a gain vector encoder which encodes the gain vector so as to obtain gain encoded information; and a multiplexer which multiplexes the shape encoded information with the gain encoded information.