专利检索 ap:("Sanjeev Mehrotra" OR "Kishore Kotteri" OR "Bharath Siravara" OR "Thomas W. Holcomb" OR "Hui Gao" OR "Serge Smirnov") AND inv:"Sanjeev Mehrotra" 第 2 页

11.

发明申请
Multi-Channel Hole-Filling For Audio Compression 审中-公开
标题翻译：用于音频压缩的多通道孔填充

公开(公告)号：US20090210222A1

公开(公告)日：2009-08-20

申请号：US12032119

申请日：2008-02-15

申请人： Sanjeev Mehrotra , Hui Gao , Kazuhito Kioshida , Chao He

发明人： Sanjeev Mehrotra , Hui Gao , Kazuhito Kioshida , Chao He

IPC分类号： G10L19/00

CPC分类号： G10L19/008 , G10L19/0204

摘要： Multi-channel hole-filling for audio compression is disclosed. Channel dependency groups (CDGs) are explicitly extracted based on channel transform information. Holes are detected within each CDG for each bark, and a CDG hole is identified as requiring filling as a particular section of frequency bandwidth larger than a predetermined hole bandwidth threshold and with all zero-value coefficients in all channels after quantizing. Bark weights are adjusted by multiplying the original bark weights with one calculated scalar so as to remove each detected CDG hole.

摘要翻译： 公开了用于音频压缩的多通道孔填充。基于信道变换信息明确提取频道依赖组（CDG）。在每个CDG内为每个树皮检测孔，并且将CDG孔识别为需要填充作为大于预定孔带宽阈值的频率带宽的特定部分，并且在量化之后具有所有通道中的所有零值系数。通过将原始树皮重量乘以一个计算标量来调整树皮重量，以便去除每个检测到的CDG孔。

12.

发明授权
Bitstream syntax for multi-process audio decoding 有权
标题翻译：多进程音频解码的比特流语法

公开(公告)号：US08645146B2

公开(公告)日：2014-02-04

申请号：US13595939

申请日：2012-08-27

申请人： Kazuhito Koishida , Sanjeev Mehrotra , Chao He , Wei-Ge Chen

发明人： Kazuhito Koishida , Sanjeev Mehrotra , Chao He , Wei-Ge Chen

IPC分类号： G10L19/00

CPC分类号： G10L19/167 , G10L19/002 , G10L19/008 , G10L19/022 , G10L19/03 , G10L19/038 , G10L19/04 , G10L19/24

摘要： An audio decoder provides a combination of decoding components including components implementing base band decoding, spectral peak decoding, frequency extension decoding and channel extension decoding techniques. The audio decoder decodes a compressed bitstream structured by a bitstream syntax scheme to permit the various decoding components to extract the appropriate parameters for their respective decoding technique.

摘要翻译： 音频解码器提供包括实现基带解码，频谱峰值解码，频率扩展解码和信道扩展解码技术的组件的解码组件的组合。音频解码器解码由比特流语法方案构成的压缩比特流，以允许各种解码组件为它们各自的解码技术提取适当的参数。

13.

发明授权
Efficient coding of digital media spectral data using wide-sense perceptual similarity 有权
标题翻译：使用广义感知相似性对数字媒体光谱数据进行高效编码

公开(公告)号：US08645127B2

公开(公告)日：2014-02-04

申请号：US12324689

申请日：2008-11-26

申请人： Sanjeev Mehrotra , Wei-Ge Chen

发明人： Sanjeev Mehrotra , Wei-Ge Chen

IPC分类号： G10L11/04

CPC分类号： G10L19/0208 , G10L19/0204 , G10L19/035

摘要： Traditional audio encoders may conserve coding bit-rate by encoding fewer than all spectral coefficients, which can produce a blurry low-pass sound in the reconstruction. An audio encoder using wide-sense perceptual similarity improves the quality by encoding a perceptually similar version of the omitted spectral coefficients, represented as a scaled version of already coded spectrum. The omitted spectral coefficients are divided into a number of sub-bands. The sub-bands are encoded as two parameters: a scale factor, which may represent the energy in the band; and a shape parameter, which may represent a shape of the band. The shape parameter may be in the form of a motion vector pointing to a portion of the already coded spectrum, an index to a spectral shape in a fixed code-book, or a random noise vector. The encoding thus efficiently represents a scaled version of a similarly shaped portion of spectrum to be copied at decoding.

摘要翻译： 传统的音频编码器可以通过编码少于所有频谱系数来节省编码比特率，这可以在重建中产生模糊的低通声音。使用广义感知相似性的音频编码器通过编码被忽略的频谱系数的感知相似版本来提高质量，表示为已编码频谱的缩放版本。省略的频谱系数被划分为多个子带。子带被编码为两个参数：比例因子，其可以表示频带中的能量; 以及形状参数，其可以表示带的形状。形状参数可以是指向已编码频谱的一部分的运动矢量的形式，固定码本中的频谱形状的索引或随机噪声向量。因此，编码有效地表示在解码时要复制的类似形状的频谱部分的缩放版本。

14.

发明申请
ADAPTIVE BANDWIDTH ESTIMATION 有权
标题翻译：自适应带宽估计

公开(公告)号：US20130114421A1

公开(公告)日：2013-05-09

申请号：US13288968

申请日：2011-11-04

申请人： Tin Qian , Jin Li , Tanner M. Hodgeson , Sanjeev Mehrotra , Jiannan Zheng , Timothy M. Moore

发明人： Tin Qian , Jin Li , Tanner M. Hodgeson , Sanjeev Mehrotra , Jiannan Zheng , Timothy M. Moore

IPC分类号： H04L12/26

CPC分类号： H04L43/0829 , H04L43/0852 , H04L43/16

摘要： It can be determined whether relative one way delay for data packets in a data stream exceeds a delay threshold. If so, then a delay congestion signal indicating that the relative one way delay exceeds the delay threshold can be generated. The delay congestion signal can be used in calculating an adaptive bandwidth estimate for the data stream. A packet loss rate congestion signal may also be used in calculating the bandwidth estimate. It can be determined whether a data stream of data packets is in a contention state. If the data stream is in the contention state, then an adaptive bandwidth estimate can be calculated for the data stream using a first bandwidth estimation technique. If the data stream is not in the contention state, then the bandwidth estimate for the data stream can be calculated using a second bandwidth estimation technique.

摘要翻译： 可以确定数据流中的数据分组的相对单向延迟是否超过延迟阈值。如果是，则可以产生指示相对单向延迟超过延迟阈值的延迟拥塞信号。延迟拥塞信号可用于计算数据流的自适应带宽估计。丢包率拥塞信号也可用于计算带宽估计。可以确定数据包的数据流是否处于争用状态。如果数据流处于竞争状态，则可以使用第一带宽估计技术对数据流计算自适应带宽估计。如果数据流不处于竞争状态，则可以使用第二带宽估计技术来计算数据流的带宽估计。

15.

发明授权
Low complexity decoder for complex transform coding of multi-channel sound 有权
标题翻译：低复杂度解码器，用于多声道声音的复杂变换编码

公开(公告)号：US08046214B2

公开(公告)日：2011-10-25

申请号：US11767457

申请日：2007-06-22

申请人： Sanjeev Mehrotra , Wei-Ge Chen

发明人： Sanjeev Mehrotra , Wei-Ge Chen

IPC分类号： G10L19/00

CPC分类号： G10L19/008

摘要： A multi-channel audio decoder provides a reduced complexity processing to reconstruct multi-channel audio from an encoded bitstream in which the multi-channel audio is represented as a coded subset of the channels along with a complex channel correlation matrix parameterization. The decoder translates the complex channel correlation matrix parameterization to a real transform that satisfies the magnitude of the complex channel correlation matrix. The multi-channel audio is derived from the coded subset of channels via channel extension processing using a real value effect signal and real number scaling.

摘要翻译： 多声道音频解码器提供了一种降低复杂度的处理，从编码比特流重建多声道音频，其中多声道音频被表示为频道的编码子集以及复信道相关矩阵参数化。解码器将复信道相关矩阵参数化转换为满足复信道相关矩阵幅度的实数变换。多声道音频通过使用实数值效应信号和实数缩放的信道扩展处理从编码的信道子集导出。

16.

发明授权
Adaptive vector Huffman coding and decoding based on a sum of values of audio data symbols 有权
标题翻译：基于音频数据符号值的和的自适应向量霍夫曼编码和解码

公开(公告)号：US07822601B2

公开(公告)日：2010-10-26

申请号：US12122553

申请日：2008-05-16

申请人： Sanjeev Mehrotra , Wei-Ge Chen

发明人： Sanjeev Mehrotra , Wei-Ge Chen

IPC分类号： G10L19/00 , H03M7/38 , H03M7/40

CPC分类号： G10L19/032 , H03M7/40 , H03M7/4006 , H03M7/4093 , H03M7/46

摘要： An audio encoder performs entropy encoding of audio data. For example, an audio encoder determines a Huffman code from a Huffman code table to use for encoding a vector of audio data symbols, where the determining is based on a sum of values of the audio data symbols. An audio decoder performs corresponding entropy decoding.

摘要翻译： 音频编码器执行音频数据的熵编码。例如，音频编码器从霍夫曼码表确定霍夫曼码，以用于对音频数据符号的矢量进行编码，其中该确定基于音频数据符号的值之和。音频解码器执行相应的熵解码。

17.

发明授权
Coding of sparse digital media spectral data 有权
标题翻译：稀疏数字媒体光谱数据编码

公开(公告)号：US07774205B2

公开(公告)日：2010-08-10

申请号：US11764108

申请日：2007-06-15

申请人： Kazuhito Koishida , Sanjeev Mehrotra , Wei-Ge Chen

发明人： Kazuhito Koishida , Sanjeev Mehrotra , Wei-Ge Chen

IPC分类号： G10L21/04

CPC分类号： G10L19/02 , G10L19/0212 , G10L19/032 , G10L19/18

摘要： An audio encoder/decoder provides efficient compression of spectral transform coefficient data characterized by sparse spectral peaks. The audio encoder/decoder applies a temporal prediction of the frequency position of spectral peaks. The spectral peaks in the transform coefficients that are predicted from those in a preceding transform coding block are encoded as a shift in frequency position from the previous transform coding block and two non-zero coefficient levels. The prediction may avoid coding very large zero-level transform coefficient runs as compared to conventional run length coding. For spectral peaks not predicted from those in a preceding transform coding block, the spectral peaks are encoded as a value trio of a length of a run of zero-level spectral transform coefficients, and two non-zero coefficient levels.

摘要翻译： 音频编码器/解码器提供以稀疏频谱峰值为特征的频谱变换系数数据的有效压缩。音频编码器/解码器对频谱峰值的频率位置进行时间预测。从前一变换编码块中预测的变换系数中的频谱峰值被编码为来自先前变换编码块和两个非零系数电平的频率位置的移位。与常规游程长度编码相比，预测可以避免编码非常大的零电平变换系数运行。对于未在前面的变换编码块中预测的频谱峰值，频谱峰值被编码为零电平频谱变换系数的行程的长度和两个非零系数电平的三值。

18.

发明申请
ENTROPY CODING BY ADAPTING CODING BETWEEN LEVEL AND RUN LENGTH/LEVEL MODES 有权

公开(公告)号：US20080262855A1

公开(公告)日：2008-10-23

申请号：US12122553

申请日：2008-05-16

申请人： Sanjeev Mehrotra , Wei-Ge Chen

发明人： Sanjeev Mehrotra , Wei-Ge Chen

IPC分类号： G10L19/00

CPC分类号： G10L19/032 , H03M7/40 , H03M7/4006 , H03M7/4093 , H03M7/46

摘要： An audio encoder performs adaptive entropy encoding of audio data. For example, an audio encoder switches between variable dimension vector Huffman coding of direct levels of quantized audio data and run-level coding of run lengths and levels of quantized audio data. The encoder can use, for example, context-based arithmetic coding for coding run lengths and levels. The encoder can determine when to switch between coding modes by counting consecutive coefficients having a predominant value (e.g., zero). An audio decoder performs corresponding adaptive entropy decoding.

19.

发明申请
Selectively using multiple entropy models in adaptive coding and decoding 有权
标题翻译：在自适应编码和解码中选择性地使用多个熵模型

公开(公告)号：US20070016418A1

公开(公告)日：2007-01-18

申请号：US11183266

申请日：2005-07-15

申请人： Sanjeev Mehrotra , Wei-Ge Chen

发明人： Sanjeev Mehrotra , Wei-Ge Chen

IPC分类号： G10L15/00

CPC分类号： G10L19/0017 , H03M7/40 , H04N19/13 , H04N19/46

摘要： Techniques and tools for selectively using multiple entropy models in adaptive coding and decoding are described herein. For example, for multiple symbols, an audio encoder selects an entropy model from a first model set that includes multiple entropy models. Each of the multiple entropy models includes a model switch point for switching to a second model set that includes one or more entropy models. The encoder processes the multiple symbols using the selected entropy model and outputs results. Techniques and tools for generating entropy models are also described.

摘要翻译： 本文描述了在自适应编码和解码中选择性地使用多个熵模型的技术和工具。例如，对于多个符号，音频编码器从包括多个熵模型的第一模型集合中选择熵模型。多个熵模型中的每一个包括用于切换到包括一个或多个熵模型的第二模型集合的模型切换点。编码器使用所选择的熵模型处理多个符号并输出结果。还描述了用于生成熵模型的技术和工具。

20.

发明申请
Modification of codewords in dictionary used for efficient coding of digital media spectral data 有权
标题翻译：用于数字媒体频谱数据有效编码的字典中的码字的修改

公开(公告)号：US20070016414A1

公开(公告)日：2007-01-18

申请号：US11183084

申请日：2005-07-15

申请人： Sanjeev Mehrotra , Wei-Ge Chen , Kazuhito Koishida

发明人： Sanjeev Mehrotra , Wei-Ge Chen , Kazuhito Koishida

IPC分类号： G10L19/00

CPC分类号： G10L19/038 , G10L19/24

摘要： Coding of spectral data by representing certain portions of the spectral data as a scaled version of a code-vector, where the code-vector is chosen from either a fixed predetermined codebook or a codebook taken from a baseband. Various optional features are described for modifying the code-vectors in the codebook according to some rules which allow the code-vector to better represent the data they are modeling. The code-vector modification comprises a linear or non-linear transform of one or more code-vectors, such as, by exponentiation, negation, reversing, or combining elements from plural code-vectors.

摘要翻译： 通过将频谱数据的某些部分表示为编码矢量的缩放版本来编码频谱数据，其中从固定预定码本或从基带获取的码本中选择码矢量。描述了各种可选特征，用于根据允许代码矢量更好地表示其正在建模的数据的一些规则来修改码本中的码矢量。码矢量修改包括一个或多个码矢量的线性或非线性变换，例如通过取幂，否定，反转或从多个码矢量组合元素。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类