SPEECH/AUDIO ENCODING APPARATUS AND METHOD THEREOF
    1.
    发明申请
    SPEECH/AUDIO ENCODING APPARATUS AND METHOD THEREOF 审中-公开
    语音/音频编码装置及其方法

    公开(公告)号:US20170076728A1

    公开(公告)日:2017-03-16

    申请号:US15358184

    申请日:2016-11-22

    Abstract: A speech/audio encoding device for selectively allocating bits for higher precision encoding. The speech/audio encoding device receives a time-domain speech/audio input signal, transforms the speech/audio input signal into a frequency domain, and quantizes an energy envelope corresponding to an energy level for a frequency spectrum of the speech/audio input signal. The speech/audio encoding device further groups quantized energy envelopes into a plurality of groups, determines a perceptual significant group including one or more significant bands and a local-peak frequency, and allocates bits to a plurality of subbands corresponding to the grouped quantized energy envelopes, in which each of the subbands is obtained by splitting the frequency spectrum of the speech/audio input signal. The speech/audio encoding device encodes the frequency spectrum using the bits allocated to the subbands.

    Abstract translation: 用于选择性地分配比特以用于更高精度编码的语音/音频编码装置。 语音/音频编码装置接收时域语音/音频输入信号,将语音/音频输入信号变换成频域,并量化对应于语音/音频输入信号的频谱的能级的能量包络 。 语音/音频编码装置进一步将量化的能量包络分组成多个组,确定包括一个或多个有效频带和局部峰值频率的感知有效组,并将比特分配给对应于分组的量化能量包络的多个子带 ,其中通过分割语音/音频输入信号的频谱来获得每个子带。 语音/音频编码设备使用分配给子带的比特对频谱进行编码。

    ENCODING APPARATUS AND ENCODING METHOD
    2.
    发明申请
    ENCODING APPARATUS AND ENCODING METHOD 有权
    编码装置和编码方法

    公开(公告)号:US20160203825A1

    公开(公告)日:2016-07-14

    申请号:US15079524

    申请日:2016-03-24

    Abstract: A threshold amplitude is calculated for each subband obtained by splitting an extension band. For each subband, an amplitude of transform coefficients is compared with the threshold amplitude to extract a transform coefficient having an amplitude larger than the threshold amplitude as a representative transform coefficient. When a number of the extracted representative transform coefficients is less than a predetermined number, the threshold amplitude is updated in accordance with an amount by which the number of the representative transform coefficients is less than the predetermined number. A transform coefficient is extracted again using the updated threshold amplitude. For each of the subbands, a value of correlation is calculated between the representative transform coefficient and a normalized core encoded low-band transform coefficient. A subband having a largest value of correlation is selected when the number of the extracted representative transform coefficients reaches the predetermined number.

    Abstract translation: 对通过分割扩展频带获得的每个子带计算阈值振幅。 对于每个子带,将变换系数的幅度与阈值幅度进行比较,以提取具有大于阈值振幅的幅度的变换系数作为代表变换系数。 当提取的代表性变换系数的数量小于预定数量时,根据代表变换系数的数量小于预定数量的量来更新阈值幅度。 使用更新的阈值振幅再次提取变换系数。 对于每个子带,在代表变换系数和归一化的核心编码低频带变换系数之间计算相关值。 当提取的代表变换系数的数量达到预定数量时,选择具有最大相关值的子带。

    ENCODING APPARATUS, DECODING APPARATUS, AND METHODS

    公开(公告)号:US20190198035A1

    公开(公告)日:2019-06-27

    申请号:US16290321

    申请日:2019-03-01

    CPC classification number: G10L19/265 G10L19/0204 G10L21/0388

    Abstract: A coding apparatus includes a processor and a memory that stores instructions, which when executed causes the processor to perform operations, including encoding a first band of an input audio signal to be a first spectrum and dividing the first spectrum into a plurality of sub-bands. The operations also include searching a largest amplitude value of the divided first spectrum in each of the plurality of sub-bands, and normalizing the divided first spectrum in each of the plurality of sub-bands. The operations further include emphasizing a harmonic structure in the normalized first spectrum, and searching a best band that has a largest correlation value between each divided band of a second band spectrum and the emphasized first spectrum in which the harmonic structure is emphasized, and encoding the second band spectrum using lag information identifying the best band and transmitting the lag information to a decoder side.

    AUDIO/SPEECH ENCODING APPARATUS AND METHOD, AND AUDIO/SPEECH DECODING APPARATUS AND METHOD

    公开(公告)号:US20190122682A1

    公开(公告)日:2019-04-25

    申请号:US16225851

    申请日:2018-12-19

    Abstract: An audio/speech encoding method is provided that includes transforming a time domain input signal to a frequency spectrum, and dividing the frequency spectrum to a plural of bands. The method also includes calculating a level of energies for each band, quantizing the energies for the each band, and calculating differential indices. The method additionally includes modifying a range of the differential indices for the Nth band when N is an integer of 2 or more, and replacing the differential index with the modified differential index, and not modifying a range of the differential indices for the Nth band when N is an integer of 1. The method further includes encoding the differential indices using a Huffman table selected based on a minimum value and a maximum value of the differential indices, and transmitting the encoded differential indices and a flag signal for indicating the selected Huffman table.

    VOICE AUDIO ENCODING DEVICE, VOICE AUDIO DECODING DEVICE, VOICE AUDIO ENCODING METHOD, AND VOICE AUDIO DECODING METHOD
    6.
    发明申请
    VOICE AUDIO ENCODING DEVICE, VOICE AUDIO DECODING DEVICE, VOICE AUDIO ENCODING METHOD, AND VOICE AUDIO DECODING METHOD 有权
    声音音频编码设备,语音音频解码设备,语音音频编码方法和语音音频解码方法

    公开(公告)号:US20150317991A1

    公开(公告)日:2015-11-05

    申请号:US14650093

    申请日:2013-11-26

    CPC classification number: G10L19/0204 G10L19/035

    Abstract: Provided are a voice audio encoding device, voice audio decoding device, voice audio encoding method, and voice audio decoding method that efficiently perform bit distribution and improve sound quality. Dominant frequency band identification unit identifies a dominant frequency band having a norm factor value that is the maximum value within the spectrum of an input voice audio signal. Dominant group determination units and non-dominant group determination unit group all sub-bands into a dominant group that contains the dominant frequency band and a non-dominant group that contains no dominant frequency band. Group bit distribution unit distributes bits to each group on the basis of the energy and norm variance of each group. Sub-band bit distribution unit redistributes the bits that have been distributed to each group to each sub-band in accordance with the ratio of the norm to the energy of the groups.

    Abstract translation: 提供了有效地执行位分配并提高声音质量的语音音频编码装置,语音音频解码装置,语音音频编码方法和语音音频解码方法。 主导频带识别单元识别具有作为输入语音音频信号的频谱内的最大值的范数因子值的主频带。 优势组确定单位和非优势组确定单元将所有子带分组为包含主频带的显性组和不包含主频带的非优势组。 组位分配单元根据每个组的能量和范数方差分配每个组的位。 子带位分配单元根据标准与组的能量的比例,将已经分配给每个组的位重新分配给每个子带。

    ENCODING APPARATUS, DECODING APPARATUS, AND METHODS

    公开(公告)号:US20180158466A1

    公开(公告)日:2018-06-07

    申请号:US15843842

    申请日:2017-12-15

    CPC classification number: G10L19/265 G10L19/0204 G10L21/0388

    Abstract: A coding apparatus, including a processor that performs operations including encoding a first band of an input audio signal to be a first spectrum, dividing the first spectrum into a plurality of subbands, at equal intervals each including a predetermined number of samples for flattening the first spectrum, searching a largest amplitude value of the divided first spectrum in each of the subbands, normalizing the divided first spectrum with the largest amplitude values searched in each of the subbands, searching best bands among each normalized divided first spectrum which has a largest correlation value between each divided band of a second band spectrum and each normalized divided first spectrum, the second spectrum being higher than a predetermined frequency, and encoding the second spectrum using lag information identifying the best bands for transmitting the lag information to a decoder side.

    ENCODING APPARATUS, DECODING APPARATUS, AND METHODS

    公开(公告)号:US20170337931A1

    公开(公告)日:2017-11-23

    申请号:US15646645

    申请日:2017-07-11

    CPC classification number: G10L19/265 G10L19/0204 G10L21/0388

    Abstract: A coding apparatus encodes a first band of an input audio signal, normalizes a first spectrum included in each sub-band of the first band using a spectrum power envelope, performs a clipping process on the normalized first spectrum, the clipping process comparing between a predetermined threshold and the absolute value of an amplitude of the spectrum and replaces the amplitude value of the spectrum with the threshold if the absolute value of the amplitude of the spectrum exceeds the threshold, calculates a correlation between a spectrum in each divided band of a second band and a spectrum in a plurality of candidate bands containing the clipped normalized first spectrum, the second spectrum being higher than a predetermined frequency, identifies the best bands of the plurality of candidate bands, and encodes the second spectrum using lag information identifying the best band for transmitting the lag information to a decoder.

    ENCODING APPARATUS, DECODING APPARATUS, AND METHODS
    10.
    发明申请
    ENCODING APPARATUS, DECODING APPARATUS, AND METHODS 审中-公开
    编码设备,解码设备和方法

    公开(公告)号:US20160293178A1

    公开(公告)日:2016-10-06

    申请号:US15168805

    申请日:2016-05-31

    CPC classification number: G10L19/265 G10L19/0204 G10L21/0388

    Abstract: A coding apparatus normalizes a low-frequency spectrum included in each of sub-bands obtained from dividing a low band part, using a largest amplitude value among the low-frequency spectrum included in each sub-band, obtains a normalized low-frequency spectrum by decoding the first encoded data, and calculates a correlation between each divided band of a high-frequency spectrum and a plurality of candidate bands of the normalized low-frequency spectrum. The best bands of a plurality of candidate bands are identified, each candidate band having a starting frequency position with non-zero amplitude in the normalized low-frequency spectrum, the high-frequency spectrum being in a high band part of the input audio signal that is higher than the predetermined frequency, and the high-frequency spectrum is encoded using lag information identifying the best band for transmitting the lag information to a decoder.

    Abstract translation: 编码装置使包含在每个子带中的低频频谱中的最大振幅值,通过分割低频部分而获得的每个子频带中包括的低频频谱进行归一化,通过以下方式获得归一化的低频谱: 对第一编码数据进行解码,并且计算高频谱的每个划分频带与归一化低频频谱的多个候选频带之间的相关性。 识别多个候选频带中的最佳频带,每个候选频带具有归一化低频频谱中具有非零幅度的起始频率位置,高频频谱位于输入音频信号的高频部分中, 高于预定频率,并且使用识别用于将滞后信息发送到解码器的最佳频带的滞后信息来编码高频频谱。

Patent Agency Ranking