Information processing for retrieving coded audiovisual data
    1.
    发明授权
    Information processing for retrieving coded audiovisual data 有权
    用于检索编码的视听数据的信息处理

    公开(公告)号:US06532445B1

    公开(公告)日:2003-03-11

    申请号:US09397234

    申请日:1999-09-16

    IPC分类号: G10L1508

    摘要: In order to efficiently retrieve AV data by using signal characteristics as retrieval conditions, in a first step, a comparison and determination section computes a correlation coefficient (degree of similarity) of a spectrum coefficient of coded audio data and a spectrum coefficient of a sample waveform, and extracts correlation coefficients such that the value of the computed spectrum coefficient is larger than a threshold value which is set in the first step, and assumes them to be retrieval results. In a second step, the comparison and determination section determines whether or not the retrieval result is satisfactory. When it is determined that the number of pieces of audio data retrieved in the first step is equal to or greater than the predetermined threshold value and the retrieval result is not satisfactory, the process proceeds to a third step. In the third step, the comparison and determination section determines whether or not the number of frequency bands of the sample waveform, which is the retrieval conditions, is less than its maximum value. When it is determined that the number of frequency bands is less than its maximum value, in a fourth step, the number of frequency bands of the waveform signal, which is the retrieval conditions, is incremented by 1, and the process returns to the first step.

    摘要翻译: 为了通过使用信号特性作为检索条件有效地检索AV数据,在第一步骤中,比较和确定部分计算编码音频数据的频谱系数和样本波形的频谱系数的相关系数(相似度) 并且提取相关系数,使得所计算的频谱系数的值大于在第一步骤中设置的阈值,并且假设它们是检索结果。 在第二步骤中,比较确定部分确定检索结果是否令人满意。 当确定在第一步骤中检索的音频数据的数量等于或大于预定阈值并且检索结果不令人满意时,处理进行到第三步骤。 在第三步骤中,比较确定部分确定作为检索条件的采样波形的频带数是否小于其最大值。 当确定频带的数量小于其最大值时,在第四步骤中,作为检索条件的波形信号的频带数增加1,并且处理返回到第一 步。

    Decoding apparatus and method, encoding apparatus and method, and program
    2.
    发明授权
    Decoding apparatus and method, encoding apparatus and method, and program 有权
    解码装置和方法,编码装置和方法以及程序

    公开(公告)号:US08972249B2

    公开(公告)日:2015-03-03

    申请号:US13634658

    申请日:2011-03-15

    IPC分类号: G10L19/02 G10L21/038

    CPC分类号: G10L21/038 G10L19/0212

    摘要: The present invention relates to a decoding apparatus, a decoding method, an encoding apparatus, an encoding method, and programs that can shorten the delay time caused by the band extension at the time of decoding, and restrain increases in resources on the decoding side.A higher frequency component generating unit (73) generates a pseudo higher frequency spectrum by using a lower frequency spectrum (SP-L) and a higher frequency envelope (ENV-H). A phase randomizing unit (74) randomizes the phase of the pseudo higher frequency spectrum, based on a random flag (RND). An inverse MDCT unit (75) denormalizes the lower frequency spectrum (SP-L) by using a lower frequency envelope (ENV-L), and combines the pseudo higher frequency spectrum supplied from the phase randomizing unit (74) with the denormalized lower frequency spectrum (SP-L). The combination result is used as the spectrum of the entire band. The present invention can be applied to a decoding apparatus that performs band extension decoding, for example.

    摘要翻译: 解码装置,解码方法,编码装置,编码方法和程序技术领域本发明涉及可以缩短在解码时由频带扩展引起的延迟时间的解码装置,解码方法,编码装置,编码方法和程序,并且抑制解码侧的资源增加。 较高频率分量产生单元(73)通过使用较低频谱(SP-L)和较高频率包络(ENV-H)产生伪较高频谱。 相位随机化单元(74)基于随机标记(RND)使伪较高频谱的相位随机化。 逆MDCT单元(75)通过使用较低频率包络(ENV-L)对较低频谱(SP-L)进行非归一化,并且将从相位随机化单元(74)提供的伪高频频谱与非归一化较低频率 光谱(SP-L)。 组合结果用作整个频带的频谱。 本发明可以应用于例如进行频带扩展解码的解码装置。

    Noise shaping for predictive audio coding apparatus
    3.
    发明授权
    Noise shaping for predictive audio coding apparatus 有权
    预测音频编码装置的噪声整形

    公开(公告)号:US08311816B2

    公开(公告)日:2012-11-13

    申请号:US12639676

    申请日:2009-12-16

    IPC分类号: G10L21/02 G10L19/00

    CPC分类号: H04B14/068 H03M7/3046

    摘要: An information coding apparatus includes a predictive signal generator that generates a predictive signal; a predictive residual signal generator that generates a predictive residual signal; a quantizer that quantizes a quantization input signal generated based on the predictive residual signal; a quantization error signal generator that generates a quantization error signal; a feedback signal generator that generates a feedback signal for controlling the frequency characteristic of the quantization noise after decoding based on the quantization error signal; and a quantization input signal generator that generates the quantization input signal. The feedback signal generator is configured by a pole-zero filter that includes a filter coefficient of an all-pole filter which is based on spectral envelope information estimated by the input audio signal, a parameter for adjusting a peak level in the frequency characteristic of the quantization noise caused by the all-pole filter, and the predictive filter coefficient.

    摘要翻译: 一种信息编码装置,包括产生预测信号的预测信号发生器; 产生预测残差信号的预测残差信号发生器; 量化器,其量化基于所述预测残差信号产生的量化输入信号; 产生量化误差信号的量化误差信号发生器; 反馈信号发生器,其基于量化误差信号生成用于控制解码之后的量化噪声的频率特性的反馈信号; 以及产生量化输入信号的量化输入信号发生器。 反馈信号发生器由极零滤波器构成,该极零滤波器包括基于由输入音频信号估计的频谱包络信息的全极滤波器的滤波器系数,用于调整频率特性中的峰值电平的参数 由全极滤波器引起的量化噪声和预测滤波器系数。

    DECODING DEVICE, DECODING METHOD, AND PROGRAM
    4.
    发明申请
    DECODING DEVICE, DECODING METHOD, AND PROGRAM 有权
    解码设备,解码方法和程序

    公开(公告)号:US20120026861A1

    公开(公告)日:2012-02-02

    申请号:US13191216

    申请日:2011-07-26

    IPC分类号: H04J11/00

    CPC分类号: H04L27/2649 H04L27/2639

    摘要: A decoding device includes an acquisition unit configured to acquire a first frequency signal including a narrowband signal and a wideband signal, a direct inverse orthogonal transform unit configured to perform a direct matrix operation with respect to the narrowband signal of the first frequency signal so as to perform inverse orthogonal transform, and a high-speed inverse orthogonal transform unit configured to perform inverse orthogonal transform employing a high-speed operation method with respect to the wideband signal of the first frequency signal.

    摘要翻译: 解码装置包括:获取单元,被配置为获取包括窄带信号和宽带信号的第一频率信号;直接逆正交变换单元,被配置为对所述第一频率信号的窄带信号执行直接矩阵运算,以便 执行逆正交变换,以及高速逆正交变换单元,被配置为使用相对于第一频率信号的宽带信号的高速运算方法执行逆正交变换。

    Information retrieving method and apparatus
    5.
    发明授权
    Information retrieving method and apparatus 有权
    信息检索方法和装置

    公开(公告)号:US07747435B2

    公开(公告)日:2010-06-29

    申请号:US12075872

    申请日:2008-03-15

    IPC分类号: G10L17/00

    CPC分类号: G10L17/00

    摘要: A speaker of encoded speech data recorded in a semiconductor storage device in an IC recorder is to be retrieved easily. An information receiving unit 10 in a speaker retrieval apparatus 1 reads out the encoded speech data recorded in a semiconductor storage device 107 in an IC recorder 100. A speech decoding unit 12 decodes the encoded speech data. A speaker frequency detection unit 13 discriminates the speaker based on a feature of the speech waveform decoded to find the frequency of conversation (frequency of occurrence) of the speaker in a preset time interval. A speaker frequency graph displaying unit 14 displays the speaker frequency on a picture as a two-dimensional graph having time and the frequency as two axes. A speech reproducing unit 16 reads out the portion of the encoded speech data corresponding to a time position or a time range specified by a reproducing position input unit 15 based on this two-dimensional graph from the storage device 11 and decodes the read-out data to output the decoded data to a speech outputting unit 17.

    摘要翻译: 记录在IC记录器中的半导体存储装置中的编码语音数据的扬声器容易被检索。 扬声器检索装置1中的信息接收单元10读出记录在IC记录器100中的半导体存储装置107中的编码语音数据。语音解码单元12对编码的语音数据进行解码。 扬声器频率检测单元13基于解码的语音波形的特征来区分扬声器,以在预设的时间间隔内找到说话者的会话频率(出现频率)。 扬声器频率图显示单元14将作为具有时间和频率的二维图形的图像上的扬声器频率显示为两个轴。 语音再现单元16基于来自存储装置11的二维图形读出对应于由再现位置输入单元15指定的时间位置或时间范围的编码语音数据的部分,并对读出的数据进行解码 以将解码的数据输出到语音输出单元17。

    Apparatus for performing speaker identification and speaker searching in speech or sound image data, and method thereof
    6.
    发明授权
    Apparatus for performing speaker identification and speaker searching in speech or sound image data, and method thereof 有权
    用于在语音或声音图像数据中执行说话者识别和说话者搜索的装置及其方法

    公开(公告)号:US07315819B2

    公开(公告)日:2008-01-01

    申请号:US10201069

    申请日:2002-07-23

    IPC分类号: G10L17/00

    CPC分类号: G10L17/02 G10L17/00 G10L19/04

    摘要: A process of identifying a speaker in coded speech data and a process of searching for the speaker are efficiently performed with fewer computations and with a smaller storage capacity. In an information search apparatus, an LSP decoding section extracts and decodes only LSP information from coded speech data which is read for each block. An LPC conversion section converts the LSP information into LPC information. A Cepstrum conversion section converts the obtained LPC information into an LPC Cepstrum which represents features of speech. A vector quantization section performs vector quantization on the LPC Cepstrum. A speaker identification section identifies a speaker on the basis of the result of the vector quantization. Furthermore, the identified speaker is compared with a search condition in a condition comparison section, and based on the result, the search result is output.

    摘要翻译: 通过较少的计算和较小的存储容量来有效地执行识别编码语音数据中的扬声器的处理和搜索扬声器的处理。 在信息搜索装置中,LSP解码部仅从针对每个块读取的编码语音数据提取并解码LSP信息。 LPC转换部将LSP信息转换为LPC信息。 倒谱转换部分将获得的LPC信息转换成表示语音特征的LPC倒频谱。 矢量量化部分对LPC倒频谱进行矢量量化。 扬声器识别部根据矢量量化的结果识别扬声器。 此外,将所识别的扬声器与条件比较部分中的搜索条件进行比较,并且基于该结果,输出搜索结果。

    Decoding device, decoding method, and program
    7.
    发明授权
    Decoding device, decoding method, and program 有权
    解码设备,解码方法和程序

    公开(公告)号:US08976642B2

    公开(公告)日:2015-03-10

    申请号:US13191216

    申请日:2011-07-26

    IPC分类号: H04J11/00 H04L27/26

    CPC分类号: H04L27/2649 H04L27/2639

    摘要: A decoding device includes an acquisition unit configured to acquire a first frequency signal including a narrowband signal and a wideband signal, a direct inverse orthogonal transform unit configured to perform a direct matrix operation with respect to the narrowband signal of the first frequency signal so as to perform inverse orthogonal transform, and a high-speed inverse orthogonal transform unit configured to perform inverse orthogonal transform employing a high-speed operation method with respect to the wideband signal of the first frequency signal.

    摘要翻译: 解码装置包括:获取单元,被配置为获取包括窄带信号和宽带信号的第一频率信号;直接逆正交变换单元,被配置为对所述第一频率信号的窄带信号执行直接矩阵运算,以便 执行逆正交变换,以及高速逆正交变换单元,被配置为使用相对于第一频率信号的宽带信号的高速运算方法执行逆正交变换。

    Decoding device, decoding method, and program for generating a substitute signal when an error has occurred during decoding
    8.
    发明授权
    Decoding device, decoding method, and program for generating a substitute signal when an error has occurred during decoding 有权
    解码装置,解码方法以及在解码时发生错误时产生替代信号的程序

    公开(公告)号:US08812927B2

    公开(公告)日:2014-08-19

    申请号:US13301542

    申请日:2011-11-21

    IPC分类号: H03M13/00 H04J11/00 H04L27/00

    摘要: A decoding device including a decoding unit which decodes encoded data, an inverse orthogonal transformation unit which performs inverse orthogonal transformation for the encoded data and obtains a time series waveform element in a unit of blocks, a correlation calculation unit which obtains a correlation between a time series waveform element of a block arranged immediately before an error block which is a block in which an error has occurred during decoding by the decoding unit and a time series waveform element of a block arranged a predetermined number of blocks before the block, a cycle calculation unit which obtains a basic cycle of a block unit of the error block based on the correlation obtained by the correlation calculation unit, and a generation unit which generates a substitute signal of the time series waveform element of the error block.

    摘要翻译: 一种解码装置,包括对编码数据进行解码的解码单元,对编码数据执行逆正交变换的逆正交变换单元,以块为单位求出时间序列波形要素;相关计算单元, 紧接在作为在解码单元进行解码期间发生错误的块的错误块之前布置的块的串联波形元素以及在块前排列预定数量的块的块的时间序列波形元素,周期计算 单元,其基于由相关计算单元获得的相关性获得误差块的块单位的基本周期;以及生成单元,其生成误差块的时间序列波形元素的替代信号。

    Apparatus and method for encoding and decoding of audio data using a rounding off unit which eliminates residual sign bit without loss of precision
    9.
    发明授权
    Apparatus and method for encoding and decoding of audio data using a rounding off unit which eliminates residual sign bit without loss of precision 有权
    使用舍入单元对音频数据进行编码和解码的装置和方法,其消除残余符号位而不损失精度

    公开(公告)号:US08566105B2

    公开(公告)日:2013-10-22

    申请号:US11459513

    申请日:2006-07-24

    摘要: A method and apparatus for encoding audio data and a method and apparatus for decoding audio data, which can generate and decode, respectively, scalable lossless streams and which can shorten the time necessary to generate and decode lossless streams. A lossy-core encoder unit performs lossy compression on an input audio signal, generating a core stream. A simplified lossy-core decoding unit decodes only spectral signals of a specified band, e.g., a lower frequency band to generate a lossy decoded audio signal. A subtracter subtracts a lossy decoded audio signal from the input audio signal delayed to generate a residual signal. A rounding-off unit performs a process of rounding off the number of bits constituting the residual signal by eliminating the residual sign bit without loss of precision. A lossless-enhance encoder unit performs lossless compression on the residual signal to generate an enhanced stream. A stream-combining unit combines the core stream and the enhanced stream to generate a scalable lossless stream.

    摘要翻译: 用于对音频数据进行编码的方法和装置以及用于对音频数据进行解码的方法和装置,其可以分别生成和解码可伸缩的无损流,并且可以缩短生成和解码无损流所需的时间。 有损核心编码器单元对输入音频信号执行有损压缩,产生核心流。 简化的有损码解码单元仅解码指定频带(例如较低频带)的频谱信号,以产生有损解码音频信号。 减法器从延迟的输入音频信号中减去有损解码音频信号以产生残留信号。 四舍五入单元通过在不损失精度的情况下消除残留符号位来执行舍弃构成残留信号的位数的处理。 无损增强编码器单元对残余信号执行无损压缩以产生增强的流。 流合并单元组合核心流和增强流,以生成可伸缩的无损流。

    ENCODING DEVICE AND ENCODING METHOD, DECODING DEVICE AND DECODING METHOD, AND PROGRAM
    10.
    发明申请
    ENCODING DEVICE AND ENCODING METHOD, DECODING DEVICE AND DECODING METHOD, AND PROGRAM 有权
    编码设备和编码方法,解码设备和解码方法以及程序

    公开(公告)号:US20130006647A1

    公开(公告)日:2013-01-03

    申请号:US13583994

    申请日:2011-03-08

    IPC分类号: G10L19/00

    CPC分类号: G10L19/035 G10L19/0212

    摘要: The present invention relates to an encoding device and an encoding method, a decoding device and a decoding method, and a program that reduce deterioration of sound quality due to encoding of audio signals.An envelope emphasis part (51) emphasizes an envelope (ENV). A noise shaping part (52) divides an emphasized envelope (D) formed by emphasis of the envelope (ENV) by a value larger than 1, and subtracts noise shaping (G) specified by information (NS) from a result of the division. A quantization part (14) sets a result of the subtraction as a quantization bit count (WL), and quantizes a normalized spectrum (S1) formed by normalization of a spectrum (S0) based on the quantization bit count (WL). A multiplexing part (53) multiplexes the information (NS), a quantized spectrum (QS) formed by quantization of the normalized spectrum (S1), and the envelope (ENV). The present invention can be applied to an encoding device encoding audio signals, for example.

    摘要翻译: 编码装置和编码方法,解码装置和解码方法技术领域本发明涉及一种减少音频信号编码导致的音质劣化的程序。 信封重点部分(51)强调信封(ENV)。 噪声整形部分(52)将由包络(ENV)的强调形成的强调包络(D)除以大于1的值,并从分割结果中减去由信息(NS)指定的噪声整形(G)。 量化部分(14)将减法的结果设置为量化位计数(WL),并且通过基于量化位计数(WL)对通过频谱归一化形成的归一化频谱(S1)进行量化。 复用部分(53)复用信息(NS),通过归一化频谱(S1)的量化形成的量化频谱(QS)和信封(ENV)。 例如,本发明可以应用于编码音频信号的编码装置。