专利检索 ap:("Sharath Manjunath" OR "William Gardner") AND inv:"Sharath Manjunath" 第 4 页

31.

发明授权
Deblock filtering techniques for video coding according to multiple video standards 有权
标题翻译：根据多种视频标准进行视频编码的去块滤波技术

公开(公告)号：US08045615B2

公开(公告)日：2011-10-25

申请号：US11136980

申请日：2005-05-25

申请人： Yi Liang , Sharath Manjunath

发明人： Yi Liang , Sharath Manjunath

IPC分类号： H04B1/66

CPC分类号： H04N19/86 , H04N19/117 , H04N19/42 , H04N19/61 , H04N19/82

摘要： This disclosure describes deblock filtering techniques in which an in-loop deblock filter of a first codec is used as a post deblock filter of a second codec. A number of techniques are also described to facilitate input parameter adjustments and allow for the effective use of the filter with both codecs. The techniques can simplify the architecture of a device that includes multiple codecs operating according to different coding standards. Specifically, the different codecs can use the same deblocking filter regardless of whether the coding standard calls for in-loop filtering or whether post filtering is used. For example, a filter designed as an in-loop deblocking filter for a codec that complies with the ITU-T H.264 coding standard can be used as a post deblocking filter for MPEG-4 video.

摘要翻译： 本公开描述了去块滤波技术，其中第一编解码器的循环去块滤波器被用作第二编解码器的后去块滤波器。还描述了许多技术来促进输入参数调整并允许有效地使用具有两个编解码器的滤波器。这些技术可以简化包括根据不同编码标准操作的多个编解码器的设备的架构。具体来说，不管编码标准是要求进行环路过滤还是使用后置过滤，不同的编解码器都可以使用相同的去块滤波器。例如，设计为符合ITU-T H.264编码标准的编解码器的内循环去块滤波器的滤波器可以用作MPEG-4视频的后去块滤波器。

32.

发明授权
Arbitrary average data rates for variable rate coders 有权
标题翻译：可变利率编码人员的任意平均数据汇率

公开(公告)号：US08032369B2

公开(公告)日：2011-10-04

申请号：US11625788

申请日：2007-01-22

申请人： Sharath Manjunath , Ananthapadmanabhan A. Kandhadai

发明人： Sharath Manjunath , Ananthapadmanabhan A. Kandhadai

IPC分类号： G10L19/02

CPC分类号： G10L19/22 , G10L19/24

摘要： Methods and apparatus are provided for achieving an arbitrary average data rate for a variable rate coder. One method includes selecting a set (e.g., a pair) of initial composite rates surrounding the arbitrary average data rate. A reallocation fraction is then calculated based on the initial composite rates. The reallocation fraction is used to reassign a number of frames from one component rate of an initial composite rate to another in order to achieve the arbitrary average data rate. Such a method may be configured such that selecting an initial composite rate on one side of (e.g., less than) the arbitrary average data rate implicitly selects the initial composite rate on the other side of the arbitrary average data rate.

摘要翻译： 提供了用于实现可变速率编码器的任意平均数据速率的方法和装置。一种方法包括选择围绕任意平均数据速率的初始复合速率的集合（例如，一对）。然后基于初始复合速率计算重新分配分数。重新分配部分用于将多个帧从初始复合速率的一个分量速率重新分配给另一个，以便实现任意的平均数据速率。这样的方法可以被配置为使得在任意平均数据速率的一侧（例如小于）选择初始复合速率隐含地选择任意平均数据速率的另一侧上的初始复合速率。

33.

发明申请
CODING OF TRANSITIONAL SPEECH FRAMES FOR LOW-BIT-RATE APPLICATIONS 审中-公开

公开(公告)号：US20090319263A1

公开(公告)日：2009-12-24

申请号：US12261518

申请日：2008-10-30

申请人： Alok Kumar Gupta , Sharath Manjunath

发明人： Alok Kumar Gupta , Sharath Manjunath

IPC分类号： G10L19/02 , G10L19/00

CPC分类号： G10L19/10 , G10L19/125 , G10L25/90

摘要： Systems, methods, and apparatus for low-bit-rate coding of transitional speech frames are disclosed.

34.

发明申请
CODING OF TRANSITIONAL SPEECH FRAMES FOR LOW-BIT-RATE APPLICATIONS 审中-公开
标题翻译：用于低比特率应用的过渡语音框架的编码

公开(公告)号：US20090319261A1

公开(公告)日：2009-12-24

申请号：US12143719

申请日：2008-06-20

申请人： Alok Kumar Gupta , Sharath Manjunath , Ananthapadmanabhan A. Kandhadai

发明人： Alok Kumar Gupta , Sharath Manjunath , Ananthapadmanabhan A. Kandhadai

IPC分类号： G10L21/00

CPC分类号： G10L19/20 , G10L19/025 , G10L19/125 , G10L19/22 , G10L25/90

摘要： Systems, methods, and apparatus for low-bit-rate coding of transitional speech frames are disclosed.

摘要翻译： 公开了用于过渡语音帧的低比特率编码的系统，方法和装置。

35.

发明申请
Methods of Performing Error Concealment For Digital Video 有权
标题翻译：执行数字视频错误隐藏的方法

公开(公告)号：US20080232478A1

公开(公告)日：2008-09-25

申请号：US11690132

申请日：2007-03-23

申请人： Chia-Yuan Teng , Sharath Manjunath

发明人： Chia-Yuan Teng , Sharath Manjunath

IPC分类号： H04N7/68

CPC分类号： H04N19/00939 , H04N19/107 , H04N19/117 , H04N19/142 , H04N19/159 , H04N19/166 , H04N19/174 , H04N19/895

摘要： Error concealment is used to hide the effects of errors detected within digital video information. A complex error concealment mode decision is disclosed to determine whether spatial error concealment (SEC) or temporal error concealment (TEC) should be used. The error concealment mode decision system uses different methods depending on whether the damaged frame is an intra-frame or an inter-frame. If the video frame is an intra-frame then a similarity metric is used to determine if the intra-frame represents a scene-change or not. If the video frame is an intra-frame, a complex multi-termed equation is used to determine whether SEC or TEC should be used. A novel spatial error concealment technique is disclosed for use when the error concealment mode decision determines that spatial error concealment should be used for reconstruction. The novel spatial error concealment technique divides a corrupt macroblock into four different regions, a corner region, a row adjacent to the corner region, a column adjacent to the corner region, and a remainder main region. Those regions are then reconstructed in that order and information from earlier reconstructed regions may be used in later reconstructed regions. Finally, a macroblock refreshment technique is disclosed for preventing error propagation from harming non-corrupt inter-blocks. Specifically, an inter-macroblock may be ‘refreshed’ using spatial error concealment if there has been significant error caused damage that may cause the inter-block to propagate the errors.

摘要翻译： 错误隐藏用于隐藏数字视频信息中检测到的错误的影响。公开了一种复杂的错误隐藏模式决定，以确定是否应使用空间误差隐藏（SEC）或时间误差隐藏（TEC）。错误隐藏模式决策系统使用不同的方法，取决于损坏的帧是帧内还是帧间。如果视频帧是帧内帧，则使用相似性度量来确定帧内是否表示场景改变。如果视频帧是帧内帧，则使用复数多方程来确定是否应使用SEC或TEC。当错误隐藏模式决定确定空间误差隐藏应用于重建时，公开了一种新颖的空间误差隐藏技术。新颖的空间误差隐藏技术将腐败的宏块分为四个不同的区域，一个角区域，一个与拐角区域相邻的一行，一个邻近拐角区域的列以及一个剩余的主区域。然后按照该顺序重建那些区域，并且可以在稍后的重建区域中使用来自较早重建区域的信息。最后，公开了一种宏块刷新技术，用于防止错误传播损害非破坏的块间。具体地说，如果存在可能导致块间传播错误的严重错误导致的损坏，则可以使用空间错误隐藏来“刷新”宏块间宏块。

36.

发明授权
Method and apparatus for quantizing pitch, amplitude, phase and linear spectrum of voiced speech 有权
标题翻译：用于量化浊音的音调，幅度，相位和线性频谱的方法和装置

公开(公告)号：US07426466B2

公开(公告)日：2008-09-16

申请号：US10897746

申请日：2004-07-22

申请人： Arasanipalai K. Ananthapadmanabhan , Sharath Manjunath , Pengjun Huang , Eddie-Lun Tik Choy , Andrew P. DeJaco

发明人： Arasanipalai K. Ananthapadmanabhan , Sharath Manjunath , Pengjun Huang , Eddie-Lun Tik Choy , Andrew P. DeJaco

IPC分类号： G10L19/14

CPC分类号： G10L19/04 , G10L19/0204 , G10L19/032 , G10L19/08 , G10L19/097 , G10L19/26 , G10L25/12

摘要： A method and apparatus for predictively quantizing voiced speech includes a parameter generator and a quantizer. The parameter generator is configured to extract parameters from frames of predictive speech such as voiced speech, and to transform the extracted information to a frequency-domain representation. The quantizer is configured to subtract a weighted sum of the parameters for previous frames from the parameter for the current frame. The quantizer is configured to quantize the difference value. A prototype extractor may be added to first extract a pitch period prototype to be processed by the parameter generator.

摘要翻译： 用于预测量化浊音的方法和装置包括参数发生器和量化器。参数发生器被配置为从诸如有声语音的预测语音的帧中提取参数，并且将提取的信息变换为频域表示。量化器被配置为从当前帧的参数中减去先前帧的参数的加权和。量化器被配置为量化差值。可以添加原型提取器以首先提取要由参数发生器处理的音调周期原型。

37.

发明申请
RENDERING 3D VIDEO IMAGES ON A STEREO-ENABLED DISPLAY 有权
标题翻译：在STEREO-ENABLED显示器上呈现3D视频图像

公开(公告)号：US20080165181A1

公开(公告)日：2008-07-10

申请号：US11620621

申请日：2007-01-05

申请人： Haohong Wang , Hsiang-Tsun Li , Sharath Manjunath , Yingyong Qi

发明人： Haohong Wang , Hsiang-Tsun Li , Sharath Manjunath , Yingyong Qi

IPC分类号： G06T15/00

CPC分类号： G06T15/10 , H04N13/275

摘要： The rendering of 3D video images on a stereo-enabled display (e.g., stereoscopic or autostereoscopic display) is described. The process includes culling facets facing away from a viewer, defining foreground facets for Left and Right Views and common background facets, determining lighting for these facets, and performing screen mapping and scene rendering for one view (e.g., Right View) using computational results for facets of the other view (i.e., Left View). In one embodiment, visualization of images is provided on the stereo-enabled display of a low-power device, such as mobile phone, a computer, a video game platform, or a Personal Digital Assistant (PDA) device.

摘要翻译： 描述了在支持立体声的显示器上（例如，立体显示或自动立体显示）上的3D视频图像的呈现。该过程包括面向远离观察者的剔除面，定义用于左视图和右视图以及常见背景面的前景面，确定这些面的照明，以及使用计算结果为一个视图（例如，右视图）执行屏幕映射和场景渲染另一个视图的方面（即左视图）。在一个实施例中，在诸如移动电话，计算机，视频游戏平台或个人数字助理（PDA）设备的低功率设备的支持立体声的显示器上提供图像的可视化。

38.

发明申请
Mobile device with dual digital camera sensors and methods of using the same 有权
标题翻译：具有双数字摄像机传感器的移动设备及其使用方法

公开(公告)号：US20080024614A1

公开(公告)日：2008-01-31

申请号：US11493439

申请日：2006-07-25

申请人： Hsiang-Tsun Li , Behnam Katibian , Haohong Wang , Sharath Manjunath

发明人： Hsiang-Tsun Li , Behnam Katibian , Haohong Wang , Sharath Manjunath

IPC分类号： H04N5/225

CPC分类号： H04N5/225 , H04N5/2258 , H04N5/23238 , H04N13/239 , H04N13/296

摘要： A mobile device comprising a first image sensor, a second image sensor configured to change position with respect to the first image sensor, a controller configured to control the position of the second image sensor, and an image processing module configured to process and combine images captured by the first and second image sensors.

摘要翻译： 一种移动设备，包括第一图像传感器，被配置为相对于第一图像传感器改变位置的第二图像传感器，被配置为控制第二图像传感器的位置的控制器，以及被配置为处理和组合捕获的图像的图像处理模块由第一和第二图像传感器。

39.

发明申请
SYSTEMS, METHODS, AND APPARATUS FOR DETECTION OF TONAL COMPONENTS 有权
标题翻译：用于检测TONAL组件的系统，方法和装置

公开(公告)号：US20070174052A1

公开(公告)日：2007-07-26

申请号：US11567052

申请日：2006-12-05

申请人： Sharath Manjunath , Ananthapadmanabhan Kandhadai

发明人： Sharath Manjunath , Ananthapadmanabhan Kandhadai

IPC分类号： G10L19/00

CPC分类号： G10L19/22 , G10L19/18 , G10L25/78

摘要： Systems, methods, and apparatus for the detection of signals having spectral peaks with narrow bandwidth are described herein. The range of described configurations includes implementations that perform such detection using parameters of a linear prediction coding (LPC) analysis scheme.

摘要翻译： 本文描述了用于检测具有窄带宽的频谱峰值的信号的系统，方法和装置。所描述的配置的范围包括使用线性预测编码（LPC）分析方案的参数来执行这种检测的实现。

40.

发明授权
Low bit-rate coding of unvoiced segments of speech 有权

公开(公告)号：US07146310B2

公开(公告)日：2006-12-05

申请号：US10954851

申请日：2004-09-29

申请人： Amitava Das , Sharath Manjunath

发明人： Amitava Das , Sharath Manjunath

IPC分类号： G10L11/06

CPC分类号： G10L19/18 , G10L19/08 , G10L25/21

摘要： A low-bit-rate coding technique for unvoiced segments of speech includes the steps of extracting high-time-resolution energy coefficients from a frame of speech, quantizing the energy coefficients, generating a high-time-resolution energy envelope from the quantized energy coefficients, and reconstituting a residue signal by shaping a randomly generated noise vector with quantized values of the energy envelope. The energy envelope may be generated with a linear interpolation technique. A post-processing measure may be obtained and compared with a predefined threshold to determine whether the coding algorithm is performing adequately.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类