Abstract:
A method and apparatus to determine an encoding mode of an audio signal, and a method and apparatus to encode an audio signal according to the encoding mode. In the encoding mode determination method, a mode determination threshold for the current frame that is subject to encoding mode determination is adaptively adjusted according to a long-term feature of the audio signal for a frame (the current frame) that is subject to encoding mode determination, thereby improving the hit rate of encoding mode determination and signal classification, suppressing frequent oscillation of an encoding mode in frame units, improving noise tolerance, and improving smoothness of a reconstructed audio signal.
Abstract:
A high-speed search method in a speech encoder using an order character of LSP (Line Spectrum Pair) parameters in an LSP parameter quantizer using SVQ (Split Vector Quantization) used in a low-speed transmission speech encoder, includes the steps of rearranging a codebook according to an element value of a reference row for determining a range of code vectors to be searched; and determining a search range by using an order character between a given target vector and an arranged code vector to obtain an optimal code vector. The method gives effects of reducing computational complexity required to search the codebook without signal distortion in quantizing the LSP parameters of the speech encoder using SVQ, and reducing computational complexity without loss of tone quality in G.729 fixed codebook search by performing candidate selection and search on the basis of the correlation value size of the pulse position index.
Abstract:
An apparatus to process an audio signal includes an encoder to determine a variable window size of sub bands of a frame of an audio signal, to transform the sub bands according to the variable window size from a first domain to a second domain, to quantize the transformed sub bands, and to multiplex the quantized sub bands and information on the variable window size corresponding the respective sub bands.
Abstract:
A method and apparatus to perform bandwidth extension encoding and decoding encodes and/or decodes a high frequency signal using an excitation signal for a low frequency signal encoded in a time domain or a frequency domain or using an excitation spectrum for the low frequency signal. Accordingly, although an audio signal is encoded or decoded using a small number of bits, the quality of sound corresponding to a signal in a high frequency band does not degrade. Therefore, a coding efficiency of the audio signal can be maximized.
Abstract:
An apparatus and a method for computing a Speech Absence Probability (SAP), and an apparatus and a method for removing noise by using the SAP computing device and method are provided. The provided SAP computing device for computing the SAP indicating probability that speech is absent in a mth frame, from a first through Ncth posteriori (Nc means the total number of channels) Signal to Noise Ratios (SNR) calculated with regard to the mth frame of a speech signal and a first through Ncth predicted SNRs predicted with regard to the mth frame, includes: a first through Ncth likelihood ratio generators for generating a first through Ncth likelihood ratios from the first through Ncth posterior SNRs and the first through Ncth predicted SNRs, and outputting them; a first multiplying unit for multiplying the first through Ncth likelihood ratios by a predetermined a priori probability, and outputting the multiplication results; an adding unit for adding each of the multiplication results received from the first multiplying unit to a predetermined value, and outputting the added results; a second multiplying unit for multiplying the added results received from the adding unit and outputting the multiplication result; and a inverse number calculator for calculating inverse number of the multiplication result received from the second multiplying unit and outputting the calculated inverse number as the SAP. Therefore, since the accuracy of the calculated SAP is high, noise can be efficiently removed from the speech signal that may have noise and an enhanced speech signal with an enhanced quality can be provided.
Abstract:
A coding apparatus including a base layer, a speech quality enhancement layer, and a multiplexer. The base layer filters an input speech signal using linear prediction coding and generates an excitation signal corresponding to the filtered speech signal through a fixed codebook search and an adaptive codebook search. The speech quality enhancement layer searches a fixed codebook using parameters obtained through the fixed codebook search in the base layer, or searches the fixed codebook using a target signal, which is obtained by removing a contribution of a fixed codebook of the base layer and a signal which is obtained by synthesizing and filtering a previous fixed codebook of the speech quality enhancement layer from a target signal for the fixed codebook search of the base layer. The multiplexer multiplexes signals generated by the base layer and the at least one speech quality enhancement layer.
Abstract:
A speech compression apparatus including: a first band-transform unit transforming a wideband speech signal to a narrowband low-band speech signal; a narrowband speech compressor compressing the narrowband low-band speech signal and outputting a result of the compressing as a low-band speech packet; a decompression unit decompressing the low-band speech packet and obtaining a decompressed wideband low-band speech signal; an error detection unit detecting an error signal that corresponds to a difference between the wideband speech signal and the decompressed wideband low-band speech signal; and a high-band speech compression unit compressing the error signal and a high-band speech signal of the wideband speech signal and outputting the result of the compressing as a high-band speech packet.
Abstract:
A fast search method for LSP (Linear Spectrum Pair) quantization is provided. The fast search method in accordance with an embodiment of the present invention includes the following steps. A first step is obtaining a target vector and a code vector. The target vector and the code vector are converted for ordering property. A second step is generating a code book having the ordering property for sub-matrices by utilizing the target vector and the code vector. A third step is selecting a particular line for determining a search scope in the code books and sorting the code book in descending order with respect to component values of the particular line. A fourth step is determining the search scope by utilizing the ordering property of the target vector and the sorted code vectors. The fifth step is obtaining an error standard by utilizing the target vector and the code vector, and obtaining an optimal code vector by utilizing the error standard within the determined search scope.
Abstract:
An adaptive time/frequency-based encoding mode determination apparatus including a time domain feature extraction unit to generate a time domain feature by analysis of a time domain signal of an input audio signal, a frequency domain feature extraction unit to generate a frequency domain feature corresponding to each frequency band generated by division of a frequency domain corresponding to a frame of the input audio signal into a plurality of frequency domains, by analysis of a frequency domain signal of the input audio signal, and a mode determination unit to determine any one of a time-based encoding mode and a frequency-based encoding mode, with respect to the each frequency band, by use of the time domain feature and the frequency domain feature.
Abstract:
Provided are a method and apparatus for encoding and decoding an audio signal. According to the present application, a signal of a high frequency band above a preset frequency band is adaptively encoded or decoded in the time domain or in the frequency domain by using a signal of a low frequency band below the preset frequency band. As such, the sound quality of a high frequency signal is not deteriorate even when an audio signal is encoded or decoded by using a small number of bits and thus coding efficiency may be maximized.