专利检索 ap:("Alok Kumar Gupta" OR "Sharath Manjunath") AND inv:"Sharath Manjunath" 第 1 页

1.

发明申请
CODING OF TRANSITIONAL SPEECH FRAMES FOR LOW-BIT-RATE APPLICATIONS 审中-公开

公开(公告)号：US20090319263A1

公开(公告)日：2009-12-24

申请号：US12261518

申请日：2008-10-30

申请人： Alok Kumar Gupta , Sharath Manjunath

发明人： Alok Kumar Gupta , Sharath Manjunath

IPC分类号： G10L19/02 , G10L19/00

CPC分类号： G10L19/10 , G10L19/125 , G10L25/90

摘要： Systems, methods, and apparatus for low-bit-rate coding of transitional speech frames are disclosed.

2.

发明申请
CODING OF TRANSITIONAL SPEECH FRAMES FOR LOW-BIT-RATE APPLICATIONS 审中-公开
标题翻译：用于低比特率应用的过渡语音框架的编码

公开(公告)号：US20090319261A1

公开(公告)日：2009-12-24

申请号：US12143719

申请日：2008-06-20

申请人： Alok Kumar Gupta , Sharath Manjunath , Ananthapadmanabhan A. Kandhadai

发明人： Alok Kumar Gupta , Sharath Manjunath , Ananthapadmanabhan A. Kandhadai

IPC分类号： G10L21/00

CPC分类号： G10L19/20 , G10L19/025 , G10L19/125 , G10L19/22 , G10L25/90

摘要： Systems, methods, and apparatus for low-bit-rate coding of transitional speech frames are disclosed.

摘要翻译： 公开了用于过渡语音帧的低比特率编码的系统，方法和装置。

3.

发明授权
Adaptive intra-refresh for digital video encoding 有权
标题翻译：适用于数字视频编码的内部刷新

公开(公告)号：US08948266B2

公开(公告)日：2015-02-03

申请号：US11025297

申请日：2004-12-28

申请人： Yi Liang , Khaled Helmi El-Maleh , Sharath Manjunath

发明人： Yi Liang , Khaled Helmi El-Maleh , Sharath Manjunath

IPC分类号： H04N7/12 , H04N19/176 , H04N19/164 , H04N19/89 , H04N19/14 , H04N19/107 , H04N19/172 , H04N19/147 , H04N19/61

CPC分类号： H04N19/00278 , H04N19/107 , H04N19/14 , H04N19/147 , H04N19/164 , H04N19/172 , H04N19/176 , H04N19/61 , H04N19/89

摘要： An adaptive Intra-refresh (IR) technique for digital video encoding adjusts IR rate based on video content, or a combination of video content and channel condition. The IR rate may be applied at the frame level or macroblock (MB) level. At the frame level, the IR rate specifies the percentage of MBs to be Intra-coded within the frame. At the MB level, the IR rate defines a statistical probability that a particular MB is to be Intra-coded. The IR rate is adjusted in proportion to a combined metric that weighs estimated channel loss probability, frame-to-frame variation, and texture information. The IR rate can be determined using a close-form solution that requires relatively low implementation complexity. For example, such a close-form does not require iteration or an exhaustive search. In addition, the IR rate can be determined from parameters that are available before motion estimation and compensation are performed.

摘要翻译： 用于数字视频编码的自适应内部刷新（IR）技术基于视频内容或视频内容和频道条件的组合来调整IR速率。可以在帧级或宏块（MB）级应用IR速率。在帧级别，IR速率指定帧内帧内编码的百分比。在MB级别，IR率定义了特定MB被内部编码的统计概率。 IR速率与重量估计的信道丢失概率，帧到帧变化和纹理信息的组合度量成比例地调整。 IR速率可以使用需要较低实现复杂度的紧密形式的解决方案来确定。例如，这种关闭形式不需要迭代或穷尽搜索。另外，可以在执行运动估计和补偿之前可用的参数来确定IR速率。

4.

发明授权
3D video encoding 有权
标题翻译： 3D视频编码

公开(公告)号：US08594180B2

公开(公告)日：2013-11-26

申请号：US11677335

申请日：2007-02-21

申请人： Kai Chieh Yang , Haohong Wang , Khaled Helmi El-Maleh , Sharath Manjunath

发明人： Kai Chieh Yang , Haohong Wang , Khaled Helmi El-Maleh , Sharath Manjunath

IPC分类号： G06F21/00

CPC分类号： H04N19/194 , H04N13/122 , H04N19/124 , H04N19/147 , H04N19/149 , H04N19/172 , H04N19/176 , H04N19/597 , H04N19/61

摘要： A stereo 3D video frame includes left and right components that are combined to produce a stereo image. For a given amount of distortion, the left and right components may have different impacts on perceptual visual quality of the stereo image due to asymmetry in the distortion response of the human eye. A 3D video encoder adjusts an allocation of coding bits between left and right components of the 3D video based on a frame-level bit budget and a weighting between the left and right components. The video encoder may generate the bit allocation in the rho (ρ) domain. The weighted bit allocation may be derived based on a quality metric that indicates overall quality produced by the left and right components. The weighted bit allocation compensates for the asymmetric distortion response to reduce overall perceptual distortion in the stereo image and thereby enhance or maintain visual quality.

摘要翻译： 立体3D视频帧包括组合以产生立体图像的左和右组件。对于给定量的失真，由于人眼的失真响应的不对称，左和右分量可能对立体图像的感知视觉质量具有不同的影响。 3D视频编码器基于帧级比特预算和左右分量之间的加权来调整3D视频的左和右分量之间的编码比特的分配。视频编码器可以在rho（rho）域中生成比特分配。可以基于指示左组件和右组件产生的总体质量的质量度量来导出加权比特分配。加权比特分配补偿非对称失真响应，以减少立体图像中的整体感知失真，从而增强或维持视觉质量。

5.

发明授权
Methods of performing error concealment for digital video 有权
标题翻译：对数字视频执行错误隐藏的方法

公开(公告)号：US08379734B2

公开(公告)日：2013-02-19

申请号：US11690132

申请日：2007-03-23

申请人： Chia-Yuan Teng , Sharath Manjunath

发明人： Chia-Yuan Teng , Sharath Manjunath

IPC分类号： H04N7/68

CPC分类号： H04N19/00939 , H04N19/107 , H04N19/117 , H04N19/142 , H04N19/159 , H04N19/166 , H04N19/174 , H04N19/895

摘要： Error concealment is used to hide the effects of errors detected within digital video information. A complex error concealment mode decision is disclosed to determine whether spatial error concealment (SEC) or temporal error concealment (TEC) should be used. The error concealment mode decision system uses different methods depending on whether the damaged frame is an intra-frame or an inter-frame. If the video frame is an intra-frame then a similarity metric is used to determine if the intra-frame represents a scene-change or not. If the video frame is an intra-frame, a complex multi-termed equation is used to determine whether SEC or TEC should be used. A novel spatial error concealment technique is disclosed for use when the error concealment mode decision determines that spatial error concealment should be used for reconstruction. The novel spatial error concealment technique divides a corrupt macroblock into four different regions, a corner region, a row adjacent to the corner region, a column adjacent to the corner region, and a remainder main region. Those regions are then reconstructed in that order and information from earlier reconstructed regions may be used in later reconstructed regions. Finally, a macroblock refreshment technique is disclosed for preventing error propagation from harming non-corrupt inter-blocks. Specifically, an inter-macroblock may be ‘refreshed’ using spatial error concealment if there has been significant error caused damage that may cause the inter-block to propagate the errors.

摘要翻译： 错误隐藏用于隐藏数字视频信息中检测到的错误的影响。公开了一种复杂的错误隐藏模式决定，以确定是否应使用空间误差隐藏（SEC）或时间误差隐藏（TEC）。错误隐藏模式决策系统使用不同的方法，取决于损坏的帧是帧内还是帧间。如果视频帧是帧内帧，则使用相似性度量来确定帧内是否表示场景改变。如果视频帧是帧内帧，则使用复数多方程来确定是否应使用SEC或TEC。当错误隐藏模式决定确定空间误差隐藏应用于重建时，公开了一种新颖的空间误差隐藏技术。新颖的空间误差隐藏技术将腐败的宏块分为四个不同的区域，一个角区域，一个与拐角区域相邻的一行，一个邻近拐角区域的列以及一个剩余的主区域。然后按照该顺序重建那些区域，并且可以在稍后的重建区域中使用来自较早重建区域的信息。最后，公开了一种宏块刷新技术，用于防止错误传播损害非损坏的块间。具体地，如果存在可能导致块间传播错误的严重错误引起的损坏，则可以使用空间错误隐藏来刷新宏块间宏块。

6.

发明授权
Systems, methods, and apparatus for computationally efficient, iterative alignment of speech waveforms 有权
标题翻译：系统，方法和设备，用于语音波形的计算高效，迭代对齐

公开(公告)号：US08145477B2

公开(公告)日：2012-03-27

申请号：US11566039

申请日：2006-12-01

申请人： Sharath Manjunath , Ananthapadmanabhan A. Kandhadai

发明人： Sharath Manjunath , Ananthapadmanabhan A. Kandhadai

IPC分类号： G10L11/00 , G06F17/15

CPC分类号： G10L19/097

摘要： Systems, methods, and apparatus described include waveform alignment operations in which a single set of evaluated cosines and sines is used to calculate cross-correlations of two periodic waveforms at two different phase shifts.

摘要翻译： 所描述的系统，方法和装置包括波形对准操作，其中使用单组估计的余弦和正弦来计算两个不同相移处的两个周期波形的互相关。

7.

发明授权
Bandwidth-adaptive quantization 有权
标题翻译：带宽自适应量化

公开(公告)号：US08090577B2

公开(公告)日：2012-01-03

申请号：US10215533

申请日：2002-08-08

申请人： Khaled Helmi El-Maleh , Ananthapadmanabhan Arasanipalai Kandhadai , Sharath Manjunath

发明人： Khaled Helmi El-Maleh , Ananthapadmanabhan Arasanipalai Kandhadai , Sharath Manjunath

IPC分类号： G10L19/00

CPC分类号： G10L19/002 , G10L19/0208 , G10L19/12 , G10L2019/0005

摘要： Methods and apparatus are presented for determining the type of acoustic signal and the type of frequency spectrum exhibited by the acoustic signal in order to selectively delete parameter information before vector quantization. The bits that would otherwise be allocated to the deleted parameters can then be re-allocated to the quantization of the remaining parameters, which results in an improvement of the perceptual quality of the synthesized acoustic signal. Alternatively, the bits that would have been allocated to the deleted parameters are dropped, resulting in an overall bit-rate reduction.

摘要翻译： 提出了用于确定声信号的类型和声信号显示的频谱的类型的方法和装置，以便在矢量量化之前选择性地删除参数信息。否则将分配给删除的参数的位可以被重新分配给剩余参数的量化，这导致合成声信号的感知质量的改善。或者，将分配给删除的参数的位将被丢弃，导致整体比特率降低。

8.

发明授权
Method and apparatus for using coding scheme selection patterns in a predictive speech coder to reduce sensitivity to frame error conditions 有权
标题翻译：在预测语音编码器中使用编码方案选择模式以降低对帧错误状况的灵敏度的方法和装置

公开(公告)号：US06438518B1

公开(公告)日：2002-08-20

申请号：US09429754

申请日：1999-10-28

申请人： Sharath Manjunath , Andrew P. Dejaco , Arasanipalai K. Ananthapadmanabhan , Eddie Lun Tik Choy

发明人： Sharath Manjunath , Andrew P. Dejaco , Arasanipalai K. Ananthapadmanabhan , Eddie Lun Tik Choy

IPC分类号： G10L1904

CPC分类号： G10L19/18 , G10L19/02

摘要： A method and apparatus for using coding scheme selection patterns in a predictive speech coder to reduce sensitivity to frame error conditions includes a speech coder configured to select from among various predictive coding modes. After a predefined number of speech frames have been predictively coded, the speech coder codes one frame with a nonpredictive coding mode or a mildly predictive coding mode. The predefined number of frames can be determined in advance from the subjective standpoint of a listener. The predefined number of frames may be varied periodically. An average coding bit rate may be maintained for the speech coder by ensuring that an average coding bit rate is maintained for each successive pattern, or group, of predictively coded speech frames including at least one nonpredictively coded or mildly predictively coded speech frame.

摘要翻译： 一种用于在预测语音编码器中使用编码方案选择模式以降低对帧错误状况的灵敏度的方法和装置包括配置成从各种预测编码模式中进行选择的语音编码器。在预定数量的语音帧已被预测编码之后，语音编码器以非预测编码模式或轻度预测编码模式对一帧进行编码。可以从收听者的主观角度预先确定预定数量的帧。预定数量的帧可以周期性地改变。可以通过确保针对包括至少一个非预测编码或温和预测编码的语音帧的预测编码语音帧的每个连续模式或组维持平均编码比特率来维持语音编码器的平均编码比特率。

9.

发明授权
Method and apparatus for maintaining a target bit rate in a speech coder 有权
标题翻译：用于在语音编码器中维持目标比特率的方法和装置

公开(公告)号：US06330532B1

公开(公告)日：2001-12-11

申请号：US09356493

申请日：1999-07-19

申请人： Sharath Manjunath , Andrew P. Dejaco

发明人： Sharath Manjunath , Andrew P. Dejaco

IPC分类号： G10L2104

CPC分类号： G10L19/002 , G10L19/18

摘要： A method and apparatus for maintaining a target bit rate in a speech coder includes a speech coder for encoding a frame at a preselected encoding rate, computing a running average bit rate for a predefined number of encoded frames, subtracting the running average bit rate from a predefined target average bit rate, and dividing the difference by the preselected encoding rate. If the quotient value is negative, a predefined number of possible occurrence counts of speech coder performance threshold values that are less than a current performance threshold value is accumulated, the accumulated number being greater than the absolute value of the quotient. The product of a decrement-per-occurrence-count-value and the predefined number of occurrence counts is subtracted from the current performance threshold value to obtain a new performance threshold value. If the quotient value is positive, a predefined number of possible occurrence counts of speech coder performance threshold values that are greater than the current performance threshold value is accumulated, the accumulated number being greater than the quotient. The product of an increment-per-occurrence-count-value and the predefined number of occurrence counts is added to the current performance threshold value to obtain a new performance.

摘要翻译： 用于在语音编码器中维持目标比特率的方法和装置包括语音编码器，用于以预先选择的编码速率对帧进行编码，计算预定数量编码帧的运行平均比特率，从预定义的目标平均比特率，并且将差除以预选的编码率。如果商值为负，则累积小于当前性能阈值的语音编码器性能阈值的预定数量的可能发生计数，累积数大于商的绝对值。从当前性能阈值中减去每次出现计数值递减和预定发生次数的乘积，以获得新的性能阈值。如果商值为正，则累积大于当前性能阈值的语音编码器性能阈值的预定数量的可能发生计数，累积数大于商。将每个出现次数增量值和预定发生次数的乘积加到当前性能阈值以获得新的性能。

10.

发明授权
Method and apparatus for generating and encoding line spectral square roots 失效
标题翻译：用于生成和编码线谱平方根的方法和装置

公开(公告)号：US5754733A

公开(公告)日：1998-05-19

申请号：US509848

申请日：1995-08-01

申请人： William R. Gardner , Sharath Manjunath , Peter A. Monta

发明人： William R. Gardner , Sharath Manjunath , Peter A. Monta

IPC分类号： G10L19/02 , G10L19/00 , G10L19/04 , G10L19/06 , H03M7/30 , H03M7/36 , H04B14/04 , G10L3/02

CPC分类号： G10L19/07

摘要： A novel and improved method and apparatus for encoding line predictive coding (LPC) data in a speech compression system using line spectral square root values is disclosed. A novel and computationally efficient procedure for determining the set of quantization sensitivities for the line spectral square root values is disclosed, which results in a computationally efficient error measure for use in vector quantization of the line spectral square root values. A novel method of weighting the quantization error is disclosed, which accumulates the quantization error in each line spectral square root value and weights that error by the sensitivity of that line spectral square root value.

摘要翻译： 公开了一种用于使用线谱平方根值在语音压缩系统中对行预测编码（LPC）数据进行编码的新颖和改进的方法和装置。公开了一种用于确定线谱平方根值的量化灵敏度集合的新颖且计算上有效的过程，其导致在线谱平方根值的矢量量化中使用的计算有效的误差测量。公开了一种加权量化误差的新颖方法，其在每个线谱平方根中累积量化误差，并通过该线谱平方根值的灵敏度对该误差进行加权。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类