AUDIO ENCODING SYSTEM
    1.
    发明申请
    AUDIO ENCODING SYSTEM 审中-公开
    音频编码系统

    公开(公告)号:WO2008022564A1

    公开(公告)日:2008-02-28

    申请号:PCT/CN2007/002489

    申请日:2007-08-17

    Inventor: YOU, Yuli

    CPC classification number: G10L19/025 G10L19/008 G10L19/038

    Abstract: Provided are, among other things, systems, methods and techniques for encoding an audio signal, in which is obtained a sampled audio signal which has been divided into frames. The location of a transient within one of the frames is identified, and transform data samples are generated by performing multi-resolution filter bank analysis on the frame data, including filtering at different resolutions for different portions of the frame that includes the transient. Quantization data are generated by quantizing the transform data samples using variable numbers of bits based on a psychoacoustical model, and the quantization data are grouped into variable-length segments based on magnitudes of the quantization data. A code book is assigned to each of the variable-length segments, and the quantization data in each of the variable-length segments are encoded using the code book assigned to such variable-length segment.

    Abstract translation: 除其他之外,提供了用于编码音频信号的系统,方法和技术,其中获得已被划分为帧的采样音频信号。 识别一帧内的瞬态位置,并通过对帧数据执行多分辨率滤波器组分析来生成变换数据样本,包括对包括瞬态的帧的不同部分的不同分辨率进行滤波。 通过使用基于心理声学模型的可变位数量化变换数据采样来生成量化数据,并且基于量化数据的量值将量化数据分组成可变长度段。 代码本被分配给每个可变长度段,并且使用分配给这种可变长度段的代码簿来对每个可变长度段中的量化数据进行编码。

    VARIABLE-RESOLUTION PROCESSING OF FRAME-BASED DATA
    2.
    发明申请
    VARIABLE-RESOLUTION PROCESSING OF FRAME-BASED DATA 审中-公开
    基于框架数据的可变分辨率处理

    公开(公告)号:WO2008022566A1

    公开(公告)日:2008-02-28

    申请号:PCT/CN2007/002491

    申请日:2007-08-17

    Inventor: YOU, Yuli

    CPC classification number: G10L19/00 G10L19/008 G10L19/025 G10L19/038

    Abstract: Provide are systems, methods and techniques for processing frame-based data. A frame of data, an indication that a transient occurs within the frame, and a location of the transient within the frame are obtained. Based on the indication f the transient, a block size is set for the frame, thereby effectively defining a plurality of equal-sized blocks with the frame. In addition, different window functions are selected for efferent ones of the plurality of equal-sized blocks based on the location of the transient, and the framed of data is processed by applying the selected window functions.

    Abstract translation: 提供用于处理基于帧的数据的系统,方法和技术。 获得一帧数据,在帧内发生瞬态的指示,以及该帧内瞬态的位置。 基于瞬态的指示,为帧设置块大小,从而有效地定义与帧相等的多个块。 此外,基于瞬态的位置,针对多个等大小块中的传出选择不同的窗口函数,并且通过应用所选择的窗函数来处理数据帧。

    APPARATUS AND METHODS FOR MULTICHANNEL DIGITAL AUDIO CODING
    3.
    发明申请
    APPARATUS AND METHODS FOR MULTICHANNEL DIGITAL AUDIO CODING 审中-公开
    多通道数字音频编码的装置和方法

    公开(公告)号:WO2006030289A1

    公开(公告)日:2006-03-23

    申请号:PCT/IB2005/002724

    申请日:2005-09-14

    Inventor: YOU, Yuli

    CPC classification number: G10L19/025 G10L19/008 G10L19/032

    Abstract: A low bit rate digital audio coding system includes an encoder which assigns codebooks to groups of quantization indexes based on their local properties resulting in codebook application ranges that are independent of block quantization boundaries. The invention also incorporates a resolution filter bank, or a tri-mode resolution filter bank, which is selectively switchable between high and low frequency resolution modes or high, low and intermediate modes such as when detecting transient in a frame. The result is a multichannel audio signal having a significantly lower bit rate for efficient transmission or storage. The decoder is essentially an inverse of the structure and methods of the encoder, and results in a reproduced audio signal that cannot be audibly distinguished from the original signal.

    Abstract translation: 低比特率数字音频编码系统包括:编码器,其基于其本地属性将码本分配给量化索引组,导致独立于块量化边界的码本应用范围。 本发明还包括分辨率滤波器组或三模式分辨率滤波器组,其可以在高频和低频分辨率模式或高,低和中间模式之间选择性地切换,例如当检测到帧中的瞬态时。 结果是具有用于有效传输或存储的显着较低比特率的多声道音频信号。 解码器基本上是编码器的结构和方法的逆,并且导致不能与原始信号可听区分的再现音频信号。

    AUDIO SIGNAL TRANSIENT DETECTION
    4.
    发明申请
    AUDIO SIGNAL TRANSIENT DETECTION 审中-公开
    音频信号瞬态检测

    公开(公告)号:WO2009144564A3

    公开(公告)日:2010-01-14

    申请号:PCT/IB2009005737

    申请日:2009-05-27

    Inventor: YOU YULI

    Abstract: Provided are, among other things, systems, methods and techniques for detecting whether a transient exists within an audio signal. According to one representative embodiment, a segment of a digital audio signal is divided into blocks, and a norm value is calculated for each of a number of the blocks, resulting in a set of norm values for such blocks, each such norm value representing a measure of signal strength within a corresponding block. A maximum norm value is then identified across such blocks, and a test criterion is applied to the norm values. If the test criterion is not satisfied, a first signal indicating that the segment does not include any transient is output, and if the test criterion is satisfied, a second signal indicating that the segment includes a transient is output. According to this embodiment, the test criterion involves a comparison of the maximum norm value to a different second maximum norm value, subject to a specified constraint, within the segment.

    Abstract translation: 除其他之外,提供用于检测音频信号内是否存在瞬态的系统,方法和技术。 根据一个代表性实施例,数字音频信号的一段被划分为多个块,并为多个块中的每个块计算范数值,得到这样的块的一组范数值,每个这样的标准值表示一个 测量相应块内的信号强度。 然后在这些块之间识别最大范数值,并将测试标准应用于范数值。 如果测试标准不满足,则输出指示该段不包括任何瞬态的第一信号,并且如果满足测试标准,则输出指示该段包括瞬态的第二信号。 根据本实施例,测试标准包括在段内对最大范数值与不同的第二最大范数值进行比较,受限于规定的约束。

    AUDIO DECODING
    5.
    发明申请
    AUDIO DECODING 审中-公开
    音频解码

    公开(公告)号:WO2008022565A1

    公开(公告)日:2008-02-28

    申请号:PCT/CN2007/002490

    申请日:2007-08-17

    Inventor: YOU, Yuli

    Abstract: Provided are, among other things, systems, methods and techniques for decoding an audio signal from a frame-based bit stream. Each frame includes processing information pertaining to the frame and entropy-encoded quantization indexes representing audio data within the frame. The processing information includes: (i) code book indexes, (ii) code book application information specifying ranges of entropy-encoded quantization indexes to which the code books are to be applied, and (iii) window information. The entropy-encoded quantization indexes are decoded by applying the identified code books to the corresponding ranges of entropy-encoded quantization indexes. Subband samples are then generated by dequantizing the decoded quantization indexes, and a sequence of different window functions that were applied within a single frame of the audio data is identified based on the window information. Time-domain audio data are obtained by inverse-transforming the subband samples and using the plural different window functions indicated by the window information.

    Abstract translation: 尤其提供了用于从基于帧的比特流解码音频信号的系统,方法和技术。 每帧包括与该帧有关的处理信息和表示该帧内的音频数据的熵编码的量化索引。 处理信息包括:(i)码本索引,(ii)指定要应用码本的熵编码量化索引的范围的码本应用信息,以及(iii)窗口信息。 熵编码的量化索引通过将所识别的码本应用于熵编码的量化索引的对应范围来解码。 然后通过对解码的量化索引进行去量化来生成子带样本,并且基于窗口信息来识别应用于音频数据的单个帧内的不同窗口函数的序列。 时域音频数据通过逆变换子带样本并使用由窗口信息指示的多个不同的窗函数来获得。

Patent Agency Ranking