Bit allocating, audio encoding and decoding
    12.
    发明申请
    Bit allocating, audio encoding and decoding 有权
    位分配,音频编码和解码

    公开(公告)号:US20170061971A1

    公开(公告)日:2017-03-02

    申请号:US15330779

    申请日:2016-11-07

    CPC classification number: G10L19/002 G10L19/0204 G10L19/028 G10L19/032

    Abstract: A bit allocating method is provided that includes determining the allocated number of bits in decimal point units based on each frequency band so that a Signal-to-Noise Ratio (SNR) of a spectrum existing in a predetermined frequency band is maximized within a range of the allowable number of bits for a given frame; and adjusting the allocated number of bits based on each frequency band.

    Abstract translation: 提供了一种比特分配方法,其包括基于每个频带确定分配的小数点单位数,使得存在于预定频带中的频谱的信噪比(SNR)在 给定帧的允许位数; 以及基于每个频带调整所分配的位数。

    Method, medium, and apparatus encoding and/or decoding multichannel audio signals
    13.
    发明授权
    Method, medium, and apparatus encoding and/or decoding multichannel audio signals 有权
    方法,介质和装置编码和/或解码多声道音频信号

    公开(公告)号:US09570082B2

    公开(公告)日:2017-02-14

    申请号:US14629839

    申请日:2015-02-24

    CPC classification number: G10L19/008 G10L19/00

    Abstract: A method, medium, and apparatus encoding and/or decoding a multichannel audio signal. The method includes detecting the type of spatial extension data included in an encoding result of an audio signal, if the spatial extension data is data indicating a core audio object type related to a technique of encoding core audio data, detecting the core audio object type; decoding core audio data by using a decoding technique according to the detected core audio object type, if the spatial extension data is residual coding data, decoding the residual coding data by using the decoding technique according to the core audio object type, and up-mixing the decoded core audio data by using the decoded residual coding data. According to the method, the core audio data and residual coding data may be decoded by using an identical decoding technique, thereby reducing complexity at the decoding end.

    Abstract translation: 一种编码和/或解码多声道音频信号的方法,介质和装置。 该方法包括:如果空间扩展数据是指示与核心音频数据的编码技术相关的核心音频对象类型的数据,检测核心音频对象类型,则检测包括在音频信号的编码结果中的空间扩展数据的类型; 通过使用根据所检测的核心音频对象类型的解码技术对核心音频数据进行解码,如果空间扩展数据是残差编码数据,则使用根据核心音频对象类型的解码技术对残差编码数据进行解码,以及上调 通过使用解码的残留编码数据来解码核心音频数据。 根据该方法,可以通过使用相同的解码技术来解码核心音频数据和残留编码数据,从而降低解码端的复杂度。

    Method and apparatus for encoding and decoding audio/speech signal
    14.
    发明授权
    Method and apparatus for encoding and decoding audio/speech signal 有权
    用于对音频/语音信号进行编码和解码的方法和装置

    公开(公告)号:US09418666B2

    公开(公告)日:2016-08-16

    申请号:US14132224

    申请日:2013-12-18

    CPC classification number: G10L19/00 G10L19/025

    Abstract: Provided is a method of encoding an audio/speech signal, the method including determining a variable length of a frame, that is, a processing unit of an input signal in accordance with a position of an attack in the input signal; transforming each frame of the input signal to a frequency domain and dividing the frame into a plurality of sub frequency bands; and, if a signal of a sub frequency band is determined to be encoded in the frequency domain, encoding the signal of the sub frequency band in the frequency domain, and if the signal of the sub frequency band is determined to be encoded in a time domain, inverse transforming the signal of the sub frequency band to the time domain and encoding the inverse transformed signal in the time domain. According to the present invention, the audio/speech signal may be efficiently encoded by controlling time resolution and frequency resolution.

    Abstract translation: 提供了一种对音频/语音信号进行编码的方法,该方法包括根据输入信号中的攻击位置来确定帧的可变长度,即输入信号的处理单元; 将所述输入信号的每个帧变换为频域并将所述帧划分为多个子频带; 并且如果确定在频域中编码子频带的信号,则对频域中的子频带的信号进行编码,并且如果确定子频带的信号被编码在一个时间 域,将子频带的信号逆变换到时域,并对时域中的逆变换信号进行编码。 根据本发明,可以通过控制时间分辨率和频率分辨率来有效地编码音频/语音信号。

Patent Agency Ranking