专利检索 ap:("Sanjeev Mehrotra" OR "Kishore Kotteri") AND inv:"Sanjeev Mehrotra" 第 1 页

1.

发明申请
CHANNEL EXTENSION CODING FOR MULTI-CHANNEL SOURCE 有权
标题翻译：多通道源的通道扩展编码

公开(公告)号：US20090112606A1

公开(公告)日：2009-04-30

申请号：US11925733

申请日：2007-10-26

申请人： Sanjeev Mehrotra , Kishore Kotteri

发明人： Sanjeev Mehrotra , Kishore Kotteri

IPC分类号： G10L19/00

CPC分类号： G10L19/008

摘要： A multi-channel audio decoder reconstructs multi-channel audio of more than two physical channels from a reduced set of coded channels based on correlation parameters that specify a full power cross-correlation matrix of the physical channels, or merely preserve a partial correlation matrix (such as power of the physical channels, and some subset of cross-correlations between the physical channels, or cross-correlations of the physical channels with coded or virtual channels).

摘要翻译： 多声道音频解码器基于指定物理信道的全功率互相关矩阵的相关参数来重建来自缩减编码信道集合的多于两个物理信道的多声道音频，或仅保留部分相关矩阵（例如物理信道的功率，以及物理信道之间的互相关的一些子集，或者物理信道与编码或虚拟信道的交叉相关性）。

2.

发明授权
Optimized client side rate control and indexed file layout for streaming media 有权
标题翻译：针对流媒体优化客户端速率控制和索引文件布局

公开(公告)号：US08379851B2

公开(公告)日：2013-02-19

申请号：US12119364

申请日：2008-05-12

申请人： Sanjeev Mehrotra , Kishore Kotteri , Bharath Siravara , Thomas W. Holcomb , Hui Gao , Serge Smirnov

发明人： Sanjeev Mehrotra , Kishore Kotteri , Bharath Siravara , Thomas W. Holcomb , Hui Gao , Serge Smirnov

IPC分类号： H04N7/167 , G06F15/16

CPC分类号： H04L65/607 , H04L65/608 , H04N21/00

摘要： An indexed file layout, comprising index information, is defined for segmented streaming of multimedia content. The index information can comprise program description information and streaming segment index information. In addition, the layout can comprise files containing streaming segments of the program, where the streaming segments are each encoded at one or more bitrates independently of other streaming segments of the program. The layout supports client switching between different bitrates at segment boundaries. Optimized client-side rate control of streaming content can be provided by defining a plurality of states, selecting available paths based on constraint conditions, and selecting a best path through the states (e.g., based on a distortion measure). In one client-side rate control solution states correspond to a specific bitrate of a specific streaming segment, and in another client-side rate control solution states correspond to a measure of client buffer fullness.

摘要翻译： 包括索引信息的索引文件布局被定义用于多媒体内容的分段流。索引信息可以包括节目描述信息和流分片索引信息。此外，布局可以包括包含程序的流片段的文件，其中流片段每个以独立于节目的其他流片段的一个或多个比特率进行编码。该布局支持在段边界处的不同比特率之间的客户端切换。可以通过定义多个状态，基于约束条件选择可用路径以及选择通过状态的最佳路径（例如，基于失真度量）来提供流内容的优化客户端速率控制。在一个客户端速率控制解决方案中，状态对应于特定流分段的特定比特率，并且在另一客户端速率控制解决方案状态对应于客户端缓冲区充满度的度量。

3.

发明授权
Channel extension coding for multi-channel source 有权
标题翻译：用于多通道源的通道扩展编码

公开(公告)号：US08249883B2

公开(公告)日：2012-08-21

申请号：US11925733

申请日：2007-10-26

申请人： Sanjeev Mehrotra , Kishore Kotteri

发明人： Sanjeev Mehrotra , Kishore Kotteri

IPC分类号： G10L19/00

CPC分类号： G10L19/008

摘要： A multi-channel audio decoder reconstructs multi-channel audio of more than two physical channels from a reduced set of coded channels based on correlation parameters that specify a full power cross-correlation matrix of the physical channels, or merely preserve a partial correlation matrix (such as power of the physical channels, and some subset of cross-correlations between the physical channels, or cross-correlations of the physical channels with coded or virtual channels).

摘要翻译： 多声道音频解码器基于指定物理信道的全功率互相关矩阵的相关参数来重建来自缩减编码信道集合的多于两个物理信道的多声道音频，或仅保留部分相关矩阵（例如物理信道的功率，以及物理信道之间的互相关的一些子集，或者物理信道与编码或虚拟信道的交叉相关性）。

4.

发明申请
OPTIMIZED CLIENT SIDE RATE CONTROL AND INDEXED FILE LAYOUT FOR STREAMING MEDIA 有权
标题翻译：优化的客户端速率控制和用于流媒体的索引文件布局

公开(公告)号：US20090282162A1

公开(公告)日：2009-11-12

申请号：US12119364

申请日：2008-05-12

申请人： Sanjeev Mehrotra , Kishore Kotteri , Bharath Siravara , Thomas W. Holcomb , Hui Gao , Serge Smirnov

发明人： Sanjeev Mehrotra , Kishore Kotteri , Bharath Siravara , Thomas W. Holcomb , Hui Gao , Serge Smirnov

IPC分类号： G06F15/173

CPC分类号： H04L65/607 , H04L65/608 , H04N21/00

摘要： An indexed file layout, comprising index information, is defined for segmented streaming of multimedia content. The index information can comprise program description information and streaming segment index information. In addition, the layout can comprise files containing streaming segments of the program, where the streaming segments are each encoded at one or more bitrates independently of other streaming segments of the program. The layout supports client switching between different bitrates at segment boundaries. Optimized client-side rate control of streaming content can be provided by defining a plurality of states, selecting available paths based on constraint conditions, and selecting a best path through the states (e.g., based on a distortion measure). In one client-side rate control solution states correspond to a specific bitrate of a specific streaming segment, and in another client-side rate control solution states correspond to a measure of client buffer fullness.

摘要翻译： 包括索引信息的索引文件布局被定义用于多媒体内容的分段流。索引信息可以包括节目描述信息和流分片索引信息。此外，布局可以包括包含程序的流片段的文件，其中流片段每个以独立于节目的其他流片段的一个或多个比特率进行编码。该布局支持在段边界处的不同比特率之间的客户端切换。可以通过定义多个状态，基于约束条件选择可用路径以及选择通过状态的最佳路径（例如，基于失真度量）来提供流内容的优化的客户端速率控制。在一个客户端速率控制解决方案中，状态对应于特定流分段的特定比特率，并且在另一客户端速率控制解决方案状态对应于客户端缓冲区充满度的度量。

5.

发明授权
Bitstream syntax for multi-process audio decoding 有权
标题翻译：多进程音频解码的比特流语法

公开(公告)号：US08645146B2

公开(公告)日：2014-02-04

申请号：US13595939

申请日：2012-08-27

申请人： Kazuhito Koishida , Sanjeev Mehrotra , Chao He , Wei-Ge Chen

发明人： Kazuhito Koishida , Sanjeev Mehrotra , Chao He , Wei-Ge Chen

IPC分类号： G10L19/00

CPC分类号： G10L19/167 , G10L19/002 , G10L19/008 , G10L19/022 , G10L19/03 , G10L19/038 , G10L19/04 , G10L19/24

摘要： An audio decoder provides a combination of decoding components including components implementing base band decoding, spectral peak decoding, frequency extension decoding and channel extension decoding techniques. The audio decoder decodes a compressed bitstream structured by a bitstream syntax scheme to permit the various decoding components to extract the appropriate parameters for their respective decoding technique.

摘要翻译： 音频解码器提供包括实现基带解码，频谱峰值解码，频率扩展解码和信道扩展解码技术的组件的解码组件的组合。音频解码器解码由比特流语法方案构成的压缩比特流，以允许各种解码组件为它们各自的解码技术提取适当的参数。

6.

发明授权
Efficient coding of digital media spectral data using wide-sense perceptual similarity 有权
标题翻译：使用广义感知相似性对数字媒体光谱数据进行高效编码

公开(公告)号：US08645127B2

公开(公告)日：2014-02-04

申请号：US12324689

申请日：2008-11-26

申请人： Sanjeev Mehrotra , Wei-Ge Chen

发明人： Sanjeev Mehrotra , Wei-Ge Chen

IPC分类号： G10L11/04

CPC分类号： G10L19/0208 , G10L19/0204 , G10L19/035

摘要： Traditional audio encoders may conserve coding bit-rate by encoding fewer than all spectral coefficients, which can produce a blurry low-pass sound in the reconstruction. An audio encoder using wide-sense perceptual similarity improves the quality by encoding a perceptually similar version of the omitted spectral coefficients, represented as a scaled version of already coded spectrum. The omitted spectral coefficients are divided into a number of sub-bands. The sub-bands are encoded as two parameters: a scale factor, which may represent the energy in the band; and a shape parameter, which may represent a shape of the band. The shape parameter may be in the form of a motion vector pointing to a portion of the already coded spectrum, an index to a spectral shape in a fixed code-book, or a random noise vector. The encoding thus efficiently represents a scaled version of a similarly shaped portion of spectrum to be copied at decoding.

摘要翻译： 传统的音频编码器可以通过编码少于所有频谱系数来节省编码比特率，这可以在重建中产生模糊的低通声音。使用广义感知相似性的音频编码器通过编码被忽略的频谱系数的感知相似版本来提高质量，表示为已编码频谱的缩放版本。省略的频谱系数被划分为多个子带。子带被编码为两个参数：比例因子，其可以表示频带中的能量; 以及形状参数，其可以表示带的形状。形状参数可以是指向已编码频谱的一部分的运动矢量的形式，固定码本中的频谱形状的索引或随机噪声向量。因此，编码有效地表示在解码时要复制的类似形状的频谱部分的缩放版本。

7.

发明申请
ADAPTIVE BANDWIDTH ESTIMATION 有权
标题翻译：自适应带宽估计

公开(公告)号：US20130114421A1

公开(公告)日：2013-05-09

申请号：US13288968

申请日：2011-11-04

申请人： Tin Qian , Jin Li , Tanner M. Hodgeson , Sanjeev Mehrotra , Jiannan Zheng , Timothy M. Moore

发明人： Tin Qian , Jin Li , Tanner M. Hodgeson , Sanjeev Mehrotra , Jiannan Zheng , Timothy M. Moore

IPC分类号： H04L12/26

CPC分类号： H04L43/0829 , H04L43/0852 , H04L43/16

摘要： It can be determined whether relative one way delay for data packets in a data stream exceeds a delay threshold. If so, then a delay congestion signal indicating that the relative one way delay exceeds the delay threshold can be generated. The delay congestion signal can be used in calculating an adaptive bandwidth estimate for the data stream. A packet loss rate congestion signal may also be used in calculating the bandwidth estimate. It can be determined whether a data stream of data packets is in a contention state. If the data stream is in the contention state, then an adaptive bandwidth estimate can be calculated for the data stream using a first bandwidth estimation technique. If the data stream is not in the contention state, then the bandwidth estimate for the data stream can be calculated using a second bandwidth estimation technique.

摘要翻译： 可以确定数据流中的数据分组的相对单向延迟是否超过延迟阈值。如果是，则可以产生指示相对单向延迟超过延迟阈值的延迟拥塞信号。延迟拥塞信号可用于计算数据流的自适应带宽估计。丢包率拥塞信号也可用于计算带宽估计。可以确定数据包的数据流是否处于争用状态。如果数据流处于竞争状态，则可以使用第一带宽估计技术对数据流计算自适应带宽估计。如果数据流不处于竞争状态，则可以使用第二带宽估计技术来计算数据流的带宽估计。

8.

发明授权
Encoding streaming media as a high bit rate layer, a low bit rate layer, and one or more intermediate bit rate layers 有权
标题翻译：将流媒体编码为高比特率层，低比特率层和一个或多个中间比特率层

公开(公告)号：US08325800B2

公开(公告)日：2012-12-04

申请号：US12116878

申请日：2008-05-07

申请人： Thomas W. Holcomb , Sanjeev Mehrotra , Serge Smirnov , Bharath Siravara

发明人： Thomas W. Holcomb , Sanjeev Mehrotra , Serge Smirnov , Bharath Siravara

IPC分类号： H04N7/12 , H04N11/02 , H04N11/04

CPC分类号： H04N19/34 , H04N19/114 , H04N19/115 , H04N19/124 , H04N19/14 , H04N19/147 , H04N19/164 , H04N19/166 , H04N19/179 , H04N19/187

摘要： A method of encoding an input video stream comprising a video component and an audio component is disclosed. The input video stream is split into a plurality of segments, each comprising a plurality of frames. Each of the segments is encoded as a low bit rate layer, a high bit rate layer, and one or more intermediate bit rate layers. The bit rate of the low bit rate layer is selected such that a network streaming the segment will always be able to stream the segment encoded as the low bit rate layer. The bit rate of the high bit rate layer is selected such that the segment is able to be decoded and played back at or above a quality threshold. The bit rates of the intermediate bit rate layers are produced by applying a bit rate factor to another bit rate.

摘要翻译： 公开了一种编码包括视频分量和音频分量的输入视频流的方法。输入视频流被分割成多个段，每个段包括多个帧。每个段被编码为低比特率层，高比特率层和一个或多个中间比特率层。选择低比特率层的比特率，使得流分段的网络将总是能够将编码为低比特率层的段流传输。选择高比特率层的比特率使得该片段能够在质量阈值以上或高于质量阈值时被解码和回放。中间比特率层的比特率通过将比特率因子应用于另一个比特率来产生。

9.

发明授权
Low complexity decoder for complex transform coding of multi-channel sound 有权
标题翻译：低复杂度解码器，用于多声道声音的复杂变换编码

公开(公告)号：US08046214B2

公开(公告)日：2011-10-25

申请号：US11767457

申请日：2007-06-22

申请人： Sanjeev Mehrotra , Wei-Ge Chen

发明人： Sanjeev Mehrotra , Wei-Ge Chen

IPC分类号： G10L19/00

CPC分类号： G10L19/008

摘要： A multi-channel audio decoder provides a reduced complexity processing to reconstruct multi-channel audio from an encoded bitstream in which the multi-channel audio is represented as a coded subset of the channels along with a complex channel correlation matrix parameterization. The decoder translates the complex channel correlation matrix parameterization to a real transform that satisfies the magnitude of the complex channel correlation matrix. The multi-channel audio is derived from the coded subset of channels via channel extension processing using a real value effect signal and real number scaling.

摘要翻译： 多声道音频解码器提供了一种降低复杂度的处理，从编码比特流重建多声道音频，其中多声道音频被表示为频道的编码子集以及复信道相关矩阵参数化。解码器将复信道相关矩阵参数化转换为满足复信道相关矩阵幅度的实数变换。多声道音频通过使用实数值效应信号和实数缩放的信道扩展处理从编码的信道子集导出。

10.

发明授权
Adaptive vector Huffman coding and decoding based on a sum of values of audio data symbols 有权
标题翻译：基于音频数据符号值的和的自适应向量霍夫曼编码和解码

公开(公告)号：US07822601B2

公开(公告)日：2010-10-26

申请号：US12122553

申请日：2008-05-16

申请人： Sanjeev Mehrotra , Wei-Ge Chen

发明人： Sanjeev Mehrotra , Wei-Ge Chen

IPC分类号： G10L19/00 , H03M7/38 , H03M7/40

CPC分类号： G10L19/032 , H03M7/40 , H03M7/4006 , H03M7/4093 , H03M7/46

摘要： An audio encoder performs entropy encoding of audio data. For example, an audio encoder determines a Huffman code from a Huffman code table to use for encoding a vector of audio data symbols, where the determining is based on a sum of values of the audio data symbols. An audio decoder performs corresponding entropy decoding.

摘要翻译： 音频编码器执行音频数据的熵编码。例如，音频编码器从霍夫曼码表确定霍夫曼码，以用于对音频数据符号的矢量进行编码，其中该确定基于音频数据符号的值之和。音频解码器执行相应的熵解码。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类