摘要:
To provide a bandwidth extension method which allows reduction of computation amount in bandwidth extension and suppression of deterioration of quality in the bandwidth to be extended. In the bandwidth extension method: a low frequency bandwidth signal is transformed into a QMF domain to generate a first low frequency QMF spectrum; pitch-shifted signals are generated by applying different shifting factors on the low frequency bandwidth signal; a high frequency QMF spectrum is generated by time-stretching the pitch-shifted signals in the QMF domain; the high frequency QMF spectrum is modified; and the modified high frequency QMF spectrum is combined with the first low frequency QMF spectrum.
摘要:
To provide a bandwidth extension method which allows reduction of computation amount in bandwidth extension and suppression of deterioration of quality in the bandwidth to be extended. In the bandwidth extension method: a low frequency bandwidth signal is transformed into a QMF domain to generate a first low frequency QMF spectrum; pitch-shifted signals are generated by applying different shifting factors on the low frequency bandwidth signal; a high frequency QMF spectrum is generated by time-stretching the pitch-shifted signals in the QMF domain; the high frequency QMF spectrum is modified; and the modified high frequency QMF spectrum is combined with the first low frequency QMF spectrum.
摘要:
To provide an audio signal processing apparatus which can perform, with low operation amount, audio signal processing that is either time stretch and/or compression processing or frequency modulation processing. The audio signal processing apparatus is intended to transform an input audio signal sequence using a predetermined adjustment factor. The audio signal processing apparatus includes a filter bank (2601) which transforms the input audio signal sequence into Quadrature Mirror Filter (QMF) coefficients using a filter for Quadrature Mirror Filter analysis (a QMF analysis filter) and an adjusting unit (2602) which adjusts the QMF coefficients based on a predetermined adjustment factor.
摘要:
Provided is an encoding device (1) including: a pitch contour analysis unit (101) which detects information, a dynamic time-warping unit (102) which generates, based on the information, pitch change ratios (Tw_ratio in FIG. 18) within a range (86) including a range (86a) of the pitch change ratios corresponding to absolute pitch differences of 42 cents or larger; a first lossless coding unit (103) which codes the generated pitch parameters (102x); a time-warping unit (104) which shifts a pitch of a signal according to the information; and a second encoding unit which codes a signal (104x) obtained by the shifting.
摘要:
Provided is an encoding device (1) including: a pitch contour analysis unit (101) which detects information, a dynamic time-warping unit (102) which generates, based on the information, pitch change ratios (Tw_ratio in FIG. 18) within a range (86) including a range (86a) of the pitch change ratios corresponding to absolute pitch differences of 42 cents or larger; a first lossless coding unit (103) which codes the generated pitch parameters (102x); a time-warping unit (104) which shifts a pitch of a signal according to the information; and a second encoding unit which codes a signal (104x) obtained by the shifting.
摘要:
Provided are a new hybrid audio decoder and a new hybrid audio encoder having block switching for speech signals and audio signals. Currently, very low bitrate audio coding methods for speech and audio signal are proposed. These audio coding methods cause very long delay. Generally, in coding an audio signal, algorithm delay tends to be long to achieve higher frequency resolution. In coding a speech signal, the delay needs to be reduced because the speech signal is used for telecommunication. To balance fine coding quality for these two kinds of input signals with very low bitrate, this invention provides a combination of a low delay filter bank like AAC-ELD and a CELP coding method.
摘要:
A new hybrid audio decoder and a new hybrid audio encoder having block switching for speech signals and audio signals are provided. Currently, very low bitrate audio coding methods for speech and audio signals are proposed. These audio coding methods cause very long delays. Generally, in coding an audio signal, an algorithm delay tends to be long to achieve higher frequency resolution. In coding a speech signal, the delay needs to be reduced because the speech signal is used for telecommunication. To balance fine coding quality for speech and audio input signals with very low bitrate, a combination of a low delay filter bank like AAC-ELD and a CELP coding method is provided.
摘要:
To provide an audio signal processing apparatus which can perform, with low operation amount, audio signal processing that is either time stretch and/or compression processing or frequency modulation processing. The audio signal processing apparatus is intended to transform an input audio signal sequence using a predetermined adjustment factor. The audio signal processing apparatus includes a filter bank (2601) which transforms the input audio signal sequence into Quadrature Mirror Filter (QMF) coefficients using a filter for Quadrature Mirror Filter analysis (a QMF analysis filter) and an adjusting unit (2602) which adjusts the QMF coefficients based on a predetermined adjustment factor.
摘要:
A coding device includes: a pitch contour detection unit which detects a pitch contour of an input audio signal; a dynamic time warping unit which determines the number of pitch nodes based on the pitch contour and generates a first time warping parameter including information indicating the determined number of pitch nodes, a pitch change position, and a pitch change ratio; a first encoder which codes the first time warping parameter; a time warping unit which corrects pitch, using the information obtained from the first time warping parameter, to approximate the pitches of the number of pitch nodes to a predetermined reference value; a second encoder which codes the input audio signal at the corrected pitch; and a multiplexer which multiplexes the coded time warping parameter and the coded audio signal to generate a bitstream.
摘要:
A coding device includes: a pitch contour detection unit which detects a pitch contour of an input audio signal; a dynamic time warping unit which determines the number of pitch nodes based on the pitch contour and generates a first time warping parameter including information indicating the determined number of pitch nodes, a pitch change position, and a pitch change ratio; a first encoder which codes the first time warping parameter; a time warping unit which corrects pitch, using the information obtained from the first time warping parameter, to approximate the pitches of the number of pitch nodes to a predetermined reference value; a second encoder which codes the input audio signal at the corrected pitch; and a multiplexer which multiplexes the coded time warping parameter and the coded audio signal to generate a bitstream.