摘要:
To alleviate degradation of sound quality which may be caused by pre-echoes and bit starvation. An acoustic analyzer analyzes an audio signal to calculate perceptual entropy indicating how many bits are required for quantization. A coded bit count monitor monitors the number of coded bits produced from the audio signal and calculates the number of available bits for the current frame. Based on the combination of the perceptual entropy and the number of available bits, a frame division number determiner determines a division number N for dividing a frame of the audio signal into N blocks. An orthogonal transform processor divides a frame by the determined division number and subjects each divided block of the audio signal to an orthogonal transform process, thereby obtaining orthogonal transform coefficients. A quantizer quantizes the orthogonal transform coefficients on a divided block basis.
摘要:
An audio decoding method includes: acquiring, from encoded audio data, a reception audio signal and first auxiliary decoded audio information; calculating coefficient information from the first auxiliary decoded audio information; generating a decoded output audio signal based on the coefficient information and the reception audio signal; decoding to result in a decoded audio signal based on the first auxiliary decoded audio signal and the reception audio signal; calculating, from the decoded audio signal, second auxiliary decoded audio information corresponding to the first auxiliary decoded audio information; detecting a distortion caused in a decoding operation of the decoded audio signal by comparing the second auxiliary decoded audio information with the first auxiliary decoded audio information; correcting the coefficient information in response to the detected distortion; and supplying the corrected coefficient information as the coefficient information when generating the decoded output audio signal.
摘要:
When creating SBR data in a the low-resolution mode, an encoding device divides a high-frequency component of input audio data being encoded by SBR method into a high-frequency band and a low-frequency band, and calculates an average high-frequency power value that indicates the average value of the power in the high-frequency band of the audio data, as well as an average low-frequency power value that indicates the average value of the power in the low-frequency band of the audio data. The encoding device then compares the average high-frequency power value and the average low-frequency power value, selecting the smaller of the two. The encoding device then corrects the power of the high-frequency component of the signal being encoded by the SBR method so that it equals the selected average power value.
摘要:
An audio information processing apparatus and method include dividing an audio signal, determining a time period having a power change ratio of an audio signal larger than a first threshold value as an attack candidate, searching the time period of the attack candidate and a time period immediately before the time period of the attack candidate for an attack starting point, correcting a power of an audio signal included in the time period, and determining whether a power change ratio of the audio signal included in the time period is larger than a second threshold value for attack detection which is larger than the first threshold value.
摘要:
An encoding apparatus compresses a stereo signal using a sum signal and a difference signal of a left component signal and a right component signal of the stereo signal. The encoding apparatus includes a calculating unit that calculates complexity of the sum signal and complexity of the difference signal; a setting unit that sets, based on the complexity, an allocation rate of bits to be allocated in quantizing the sum signal and the difference signal; and a quantizing unit that quantizes the sum signal and the difference signal based on the allocation rate.
摘要:
According to an aspect of an embodiment, a method for regenerating an audio signal including a low frequency component and a high frequency component by decoding a coded data including a first coded data and a second coded data, the method comprising the steps of: generating the low frequency component; generating the high frequency component; determining whether the low frequency component has transient characteristics or not; generating a low frequency correction component by removing a stationary component when the audio signal has the transient characteristics; generating a corrected high frequency component by correcting the high-frequency component on the basis of the duration of the low frequency correction component when the audio signal has the transient characteristics; and regenerating the audio signal by synthesizing the low frequency component with the corrected high-frequency component.
摘要:
When creating SBR data in a the low-resolution mode, an encoding device divides a high-frequency component of input audio data being encoded by SBR method into a high-frequency band and a low-frequency band, and calculates an average high-frequency power value that indicates the average value of the power in the high-frequency band of the audio data, as well as an average low-frequency power value that indicates the average value of the power in the low-frequency band of the audio data. The encoding device then compares the average high-frequency power value and the average low-frequency power value, selecting the smaller of the two. The encoding device then corrects the power of the high-frequency component of the signal being encoded by the SBR method so that it equals the selected average power value.
摘要:
To alleviate degradation of sound quality which may be caused by pre-echoes and bit starvation. An acoustic analyzer analyzes an audio signal to calculate perceptual entropy indicating how many bits are required for quantization. A coded bit count monitor monitors the number of coded bits produced from the audio signal and calculates the number of available bits for the current frame. Based on the combination of the perceptual entropy and the number of available bits, a frame division number determiner determines a division number N for dividing a frame of the audio signal into N blocks. An orthogonal transform processor divides a frame by the determined division number and subjects each divided block of the audio signal to an orthogonal transform process, thereby obtaining orthogonal transform coefficients. A quantizer quantizes the orthogonal transform coefficients on a divided block basis.
摘要:
A decoded sound analysis unit (104) calculates, regarding the frequency-region stereo signals L(b) and R(b) decoded by the PS decoding unit (103), a second degree of similarity (109) and a second intensity difference (110) from the decoded sound signals. A spectrum correction unit (105) detects a distortion added by the parametric-stereo conversion by comparing the second degree of similarity (109) and the second intensity difference (110) calculated at the decoding side with the first degree of similarity (107) and the first intensity difference (108) calculated and transmitted from the encoding side, and corrects the spectrum of the frequency-region stereo decoded signals L(b) and R(b).
摘要:
An audio information processing apparatus and method include dividing an audio signal, determining a time period having a power change ratio of an audio signal larger than a first threshold value as an attack candidate, searching the time period of the attack candidate and a time period immediately before the time period of the attack candidate for an attack starting point, correcting a power of an audio signal included in the time period, and determining whether a power change ratio of the audio signal included in the time period is larger than a second threshold value for attack detection which is larger than the first threshold value.