Bitrate constrained variable bitrate audio encoding

    公开(公告)号:US08442838B2

    公开(公告)日:2013-05-14

    申请号:US13031963

    申请日:2011-02-22

    IPC分类号: G10L19/00 G10L19/02

    CPC分类号: G10L19/035

    摘要: A hybrid audio encoding technique incorporates both ABR, or CBR, and VBR encoding modes. For each audio coding block, after a VBR quantization loop meets the NMR target, a second quantization loop might be called to adaptively control the final bitrate. That is, if the NMR-based quantization loop results in a bitrate that is not within a specified range, then a bitrate-based CBR or ABR quantization loop determines a final bitrate that is within the range and is adaptively determined based on the encoding difficulty of the audio data. Excessive bitrates from use of conventional VBR mode are eliminated, while still providing much more constant perceptual sound quality than use of conventional CBR mode can achieve.

    BITRATE CONSTRAINED VARIABLE BITRATE AUDIO ENCODING
    2.
    发明申请
    BITRATE CONSTRAINED VARIABLE BITRATE AUDIO ENCODING 有权
    双向限制可变的双极音频编码

    公开(公告)号:US20110145004A1

    公开(公告)日:2011-06-16

    申请号:US13031963

    申请日:2011-02-22

    IPC分类号: G10L19/00

    CPC分类号: G10L19/035

    摘要: A hybrid audio encoding technique incorporates both ABR, or CBR, and VBR encoding modes. For each audio coding block, after a VBR quantization loop meets the NMR target, a second quantization loop might be called to adaptively control the final bitrate. That is, if the NMR-based quantization loop results in a bitrate that is not within a specified range, then a bitrate-based CBR or ABR quantization loop determines a final bitrate that is within the range and is adaptively determined based on the encoding difficulty of the audio data. Excessive bitrates from use of conventional VBR mode are eliminated, while still providing much more constant perceptual sound quality than use of conventional CBR mode can achieve.

    摘要翻译: 混合音频编码技术包括ABR或CBR和VBR编码模式。 对于每个音频编码块,在VBR量化循环满足NMR目标之后,可以调用第二量化循环来自适应地控制最终比特率。 也就是说,如果基于NMR的量化循环导致不在特定范围内的比特率,则基于比特率的CBR或ABR量化循环确定在该范围内的最终比特率,并且基于编码难度自适应地确定 的音频数据。 消除了使用常规VBR模式的过多比特率,同时仍然提供比使用常规CBR模式可以实现更多的恒定的感知音质。

    BITRATE CONSTRAINED VARIABLE BITRATE AUDIO ENCODING
    3.
    发明申请
    BITRATE CONSTRAINED VARIABLE BITRATE AUDIO ENCODING 有权
    双向限制可变的双极音频编码

    公开(公告)号:US20100049532A1

    公开(公告)日:2010-02-25

    申请号:US12610615

    申请日:2009-11-02

    IPC分类号: G10L19/00

    CPC分类号: G10L19/035

    摘要: A hybrid audio encoding technique incorporates both ABR, or CBR, and VBR encoding modes. For each audio coding block, after a VBR quantization loop meets the NMR target, a second quantization loop might be called to adaptively control the final bitrate. That is, if the NMR-based quantization loop results in a bitrate that is not within a specified range, then a bitrate-based CBR or ABR quantization loop determines a final bitrate that is within the range and is adaptively determined based on the encoding difficulty of the audio data. Excessive bitrates from use of conventional VBR mode are eliminated, while still providing much more constant perceptual sound quality than use of conventional CBR mode can achieve.

    摘要翻译: 混合音频编码技术包括ABR或CBR和VBR编码模式。 对于每个音频编码块,在VBR量化循环满足NMR目标之后,可以调用第二量化循环来自适应地控制最终比特率。 也就是说,如果基于NMR的量化循环导致不在特定范围内的比特率,则基于比特率的CBR或ABR量化循环确定在该范围内的最终比特率,并且基于编码难度自适应地确定 的音频数据。 消除了使用常规VBR模式的过多比特率,同时仍然提供比使用常规CBR模式可以实现更多的恒定的感知音质。

    Enhanced audio decoder
    4.
    发明授权
    Enhanced audio decoder 有权
    增强音频解码器

    公开(公告)号:US08515768B2

    公开(公告)日:2013-08-20

    申请号:US12551450

    申请日:2009-08-31

    IPC分类号: G10L19/00

    CPC分类号: G10L19/24

    摘要: Methods, systems, and apparatus are presented for decoding an audio signal that includes bandwidth extension data. An audio signal that includes core audio data and bandwidth extension data can be received in a decoder. The core audio data can be associated with a core portion of an audio signal, such as the frequency range below a cutoff frequency, and the bandwidth extension data can be associated with an extended portion of the audio signal, such as a frequency range above the cutoff frequency. The core audio data can be decoded to generate a decoded core audio signal in a time domain representation. Further, an extended portion of the audio signal can be reconstructed in accordance with extension data and decoded core audio signal. Additionally, the decoded core audio signal can be lowpass filtered and the extended portion can be highpass filtered before being combined to generate a decoded output signal.

    摘要翻译: 呈现用于对包括带宽扩展数据的音频信号进行解码的方法,系统和装置。 可以在解码器中接收包括核心音频数据和带宽扩展数据的音频信号。 核心音频数据可以与诸如低于截止频率的频率范围的音频信号的核心部分相关联,并且带宽扩展数据可以与音频信号的扩展部分相关联,例如高于 截止频率。 核心音频数据可以被解码以在时域表示中产生解码的核心音频信号。 此外,可以根据扩展数据和解码的核心音频信号来重构音频信号的扩展部分。 此外,解码的核心音频信号可以被低通滤波,并且扩展部分可以在被组合之前被高通滤波以产生解码的输出信号。

    System and method of retrieving a watermark within a signal
    5.
    发明授权
    System and method of retrieving a watermark within a signal 失效
    在信号中检索水印的系统和方法

    公开(公告)号:US07802101B2

    公开(公告)日:2010-09-21

    申请号:US12414602

    申请日:2009-03-30

    CPC分类号: G10L19/018

    摘要: A system and method of retrieving a watermark in a watermarked signal are disclosed. The watermarked signal comprises odd and even overlapped blocks where the watermark is contained in the even blocks. The method comprises, for each k-th even block, subtracting the two adjacent odd numbered blocks from the k-th even block of the watermarked signal to retrieve s *k(n), transforming s *k(n) into the frequency domain to generate S k(f), calculating a phase of S k(f) as φ (f) and a phase of Sk(f) as φ(f), calculating the difference Ψ (f) between φ (f) and φ(f), unwrapping Ψ (f) to obtain the phase modulation {tilde over (Φ)} k(f), and using a Viterbi search to retrieve the watermark embedded in {tilde over (Φ)} k(f).

    摘要翻译: 公开了一种在水印信号中检索水印的系统和方法。 水印信号包括奇偶重叠块,其中水印包含在偶数块中。 该方法包括对于每个第k个偶数块,从水印信号的第k个偶数块中减去两个相邻的奇数块,以检索s * k(n),将s * k(n)变换为频域 以产生S k(f),计算S k(f)的相位为&phgr; (f)和Sk(f)的阶段,(f),计算&phgr;(f)之间的差Ψ(f) (f)和(f),展开Ψ(f)以获得相位调制(())(k)(f)的波形,并使用维特比搜索来检索嵌入在{ k(f)。

    SYSTEM AND METHOD FOR DEPLOYING FILTERS FOR PROCESSING SIGNALS
    6.
    发明申请
    SYSTEM AND METHOD FOR DEPLOYING FILTERS FOR PROCESSING SIGNALS 有权
    用于处理信号的滤波器的系统和方法

    公开(公告)号:US20090180645A1

    公开(公告)日:2009-07-16

    申请号:US12396732

    申请日:2009-03-03

    IPC分类号: H02B1/00

    CPC分类号: G10L19/03

    摘要: A system, method and computer-readable medium are disclosed for using filters signal processing. The system includes a module that calculates a filter for each of a plurality of frequency bands, a module that groups the filters into a plurality of groups, a module that determines a representative filter for each group of the plurality of groups and a module that uses the representative filter of each group for frequency bands of the each group. The filters are temporal noise shaping filters (TNS) filters.

    摘要翻译: 公开了一种使用滤波器信号处理的系统,方法和计算机可读介质。 该系统包括为多个频带中的每一个计算滤波器的模块,将滤波器分组为多个组的模块,确定多个组中的每个组的代表性滤波器的模块以及使用 每个组的频带的代表性滤波器。 滤波器是时间噪声整形滤波器(TNS)滤波器。

    System and method of watermarking a signal
    7.
    发明授权
    System and method of watermarking a signal 有权
    信号水印的系统和方法

    公开(公告)号:US07451319B1

    公开(公告)日:2008-11-11

    申请号:US11553133

    申请日:2006-10-26

    IPC分类号: H04L9/00 H04K1/02 H04N7/167

    摘要: A system and method of generating a watermarked signal are disclosed. The system segments the signal into overlapping blocks using a window function and processes the overlapping blocks according to whether each block is odd- or even-numbered. The system windows the odd-numbered blocks, modulates the phase of each block in the frequency domain, transforms each modulated block in the time domain, windows each block transformed into the time domain and overlap-adds each odd-numbered block with each even-numbered block to generate the watermarked signal.

    摘要翻译: 公开了一种产生水印信号的系统和方法。 该系统使用窗口功能将信号分割成重叠块,并根据每个块是奇数还是偶数编号来处理重叠块。 系统对奇数块进行窗口调制,对频域中每个块的相位进行调制,对时域中的每个调制块进行变换,将每个块变换为时域,将每个奇数块与每个偶数块相加, 编号块生成水印信号。

    Method for enhancing connected and degraded text recognition
    8.
    发明授权
    Method for enhancing connected and degraded text recognition 失效
    增强连接和降级文本识别的方法

    公开(公告)号:US5559902A

    公开(公告)日:1996-09-24

    申请号:US251676

    申请日:1994-05-31

    摘要: The present invention provides a method and apparatus for enhancing and recognizing connected and degraded text. The enhancement process comprises filtering a scanned image to determine whether a binary image value of an image pixel should be complemented, determining whether complementing the value of the pixel reduces the sharpness of wedge-like figures in the image, and complementing the binary value of the pixel when doing so does not reduce sharpness. The recognition process may comprise determining primitive strokes in a scanned image, segmenting the scanned image into sub-character segments based on the primitive strokes, identifying features which characterize the sub-character segments, and comparing identified features to stochastic models of known characters and determining an optimum sequence of known characters based on the comparisons through the use of Viterbi scoring and level building procedures.

    摘要翻译: 本发明提供一种用于增强和识别连接和退化文本的方法和装置。 增强处理包括对扫描图像进行滤波以确定图像像素的二进制图像值是否应被补充,确定是否补充像素的值降低图像中的楔形图形的清晰度,并且补充图像的二进制值 这样做时,像素不会降低锐度。 识别过程可以包括确定扫描图像中的原始笔画,基于原始笔划将扫描图像分割成子字符段,识别表征子字符段的特征,以及将识别的特征与已知字符的随机模型进行比较并确定 基于通过使用维特比评分和水平建立程序进行比较的已知角色的最佳顺序。

    ADAPTING MASKING THRESHOLDS FOR ENCODING AUDIO DATA
    9.
    发明申请
    ADAPTING MASKING THRESHOLDS FOR ENCODING AUDIO DATA 有权
    适应编码音频数据的屏蔽功能

    公开(公告)号:US20120016679A1

    公开(公告)日:2012-01-19

    申请号:US13244542

    申请日:2011-09-25

    IPC分类号: G10L19/00

    CPC分类号: G10L19/025

    摘要: According to one embodiment, an improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set of masking thresholds calculated for the 8 short blocks corresponding to the long block are calculated. The masking thresholds for low frequency critical bands are adapted based on the thresholds calculated for the short blocks, and the resulting adapted masking thresholds are used to encode the long block of audio data. The result is encoded audio with rich harmonic content and negligible coder noise resulting from the low frequency transient signal.

    摘要翻译: 根据一个实施例,改进的音频编码技术使用长块编码具有低频瞬态信号的音频,但是具有一组适配的屏蔽阈值。 在识别包含低频瞬态信号的音频窗口时,可以照常计算长块的掩蔽阈值。 计算对应于长块的8个短块计算的一组掩蔽阈值。 基于为短块计算的阈值来适应低频临界频带的掩蔽阈值,并且使用所产生的适应屏蔽阈值对长音频数据块进行编码。 结果是具有丰富的谐波含量的编码音频和由低频瞬态信号产生的可忽略的编码器噪声。

    System and method for switching between a first filter and a second filter for a received audio signal
    10.
    发明授权
    System and method for switching between a first filter and a second filter for a received audio signal 有权
    用于在接收到的音频信号的第一滤波器和第二滤波器之间切换的系统和方法

    公开(公告)号:US07970604B2

    公开(公告)日:2011-06-28

    申请号:US12396732

    申请日:2009-03-03

    IPC分类号: G10L19/00

    CPC分类号: G10L19/03

    摘要: System, method and computer-readable medium are disclosed for using filters signal processing. The system includes a module that receives information regarding a first filter, a module that receives information regarding a second filter, and a module that receives date to indicate switching between the first filter and the second filter across the spectrum of the received audio signal, and a module that processes the received audio signal according to the received data and switching between the first filter and the second filter, wherein at least one of the first filter and the second filter represent a merger of two initial filters.

    摘要翻译: 公开了使用滤波器信号处理的系统,方法和计算机可读介质。 该系统包括接收关于第一滤波器的信息的模块,接收关于第二滤波器的信息的模块以及接收日期以在接收到的音频信号的频谱上指示在第一滤波器和第二滤波器之间切换的模块的模块,以及 模块,其根据所接收的数据处理所接收的音频信号,并且在所述第一滤波器和所述第二滤波器之间切换,其中所述第一滤波器和所述第二滤波器中的至少一个表示两个初始滤波器的合并。