专利检索 ap:("Eddie Lun Tik Choy" OR "Sharath Manjunath") AND inv:"Sharath Manjunath" 第 1 页

1.

发明授权
Method and apparatus for using coding scheme selection patterns in a predictive speech coder to reduce sensitivity to frame error conditions 有权
标题翻译：在预测语音编码器中使用编码方案选择模式以降低对帧错误状况的灵敏度的方法和装置

公开(公告)号：US06438518B1

公开(公告)日：2002-08-20

申请号：US09429754

申请日：1999-10-28

申请人： Sharath Manjunath , Andrew P. Dejaco , Arasanipalai K. Ananthapadmanabhan , Eddie Lun Tik Choy

发明人： Sharath Manjunath , Andrew P. Dejaco , Arasanipalai K. Ananthapadmanabhan , Eddie Lun Tik Choy

IPC分类号： G10L1904

CPC分类号： G10L19/18 , G10L19/02

摘要： A method and apparatus for using coding scheme selection patterns in a predictive speech coder to reduce sensitivity to frame error conditions includes a speech coder configured to select from among various predictive coding modes. After a predefined number of speech frames have been predictively coded, the speech coder codes one frame with a nonpredictive coding mode or a mildly predictive coding mode. The predefined number of frames can be determined in advance from the subjective standpoint of a listener. The predefined number of frames may be varied periodically. An average coding bit rate may be maintained for the speech coder by ensuring that an average coding bit rate is maintained for each successive pattern, or group, of predictively coded speech frames including at least one nonpredictively coded or mildly predictively coded speech frame.

摘要翻译： 一种用于在预测语音编码器中使用编码方案选择模式以降低对帧错误状况的灵敏度的方法和装置包括配置成从各种预测编码模式中进行选择的语音编码器。在预定数量的语音帧已被预测编码之后，语音编码器以非预测编码模式或轻度预测编码模式对一帧进行编码。可以从收听者的主观角度预先确定预定数量的帧。预定数量的帧可以周期性地改变。可以通过确保针对包括至少一个非预测编码或温和预测编码的语音帧的预测编码语音帧的每个连续模式或组维持平均编码比特率来维持语音编码器的平均编码比特率。

2.

发明授权
Frame erasure compensation method in a variable rate speech coder 有权
标题翻译：可变速率语音编码器中的帧擦除补偿方法

公开(公告)号：US06584438B1

公开(公告)日：2003-06-24

申请号：US09557283

申请日：2000-04-24

申请人： Sharath Manjunath , Pengjun Huang , Eddie-Lun Tik Choy

发明人： Sharath Manjunath , Pengjun Huang , Eddie-Lun Tik Choy

IPC分类号： G10L1300

CPC分类号： G10L21/02 , G10L19/005 , G10L19/097

摘要： A frame erasure compensation method in a variable-rate speech coder includes quantizing, with a first encoder, a pitch lag value for a current frame and a first delta pitch lag value equal to the difference between the pitch lag value for the current frame and the pitch lag value for the previous frame. A second, predictive encoder quantizes only a second delta pitch lag value for the previous frame (equal to the difference between the pitch lag value for the previous frame and the pitch lag value for the frame prior to that frame). If the frame prior to the previous frame is processed as a frame erasure, the pitch lag value for the previous frame is obtained by subtracting the first delta pitch lag value from the pitch lag value for the current frame. The pitch lag value for the erasure frame is then obtained by subtracting the second delta pitch lag value from the pitch lag value for the previous frame. Additionally, a waveform interpolation method may be used to smooth discontinuities caused by changes in the coder pitch memory.

摘要翻译： 可变速率语音编码器中的帧擦除补偿方法包括：利用第一编码器量化当前帧的音调滞后值，以及等于当前帧的音调滞后值与第前一帧的音调滞后值。第二预测编码器仅量化前一帧的第二增量音调滞后值（等于先前帧的音调滞后值与该帧之前的帧的音调滞后值之间的差）。如果先前帧之前的帧被作为帧擦除处理，则通过从当前帧的音调滞后值中减去第一增量音调滞后值来获得先前帧的音调滞后值。然后通过从前一帧的音调滞后值减去第二增量音调滞后值来获得擦除帧的音调滞后值。此外，可以使用波形插值方法来平滑由编码器音调存储器的变化引起的不连续性。

3.

发明授权
Method and apparatus for quantizing pitch, amplitude, phase and linear spectrum of voiced speech 有权
标题翻译：用于量化浊音的音调，幅度，相位和线性频谱的方法和装置

公开(公告)号：US07426466B2

公开(公告)日：2008-09-16

申请号：US10897746

申请日：2004-07-22

申请人： Arasanipalai K. Ananthapadmanabhan , Sharath Manjunath , Pengjun Huang , Eddie-Lun Tik Choy , Andrew P. DeJaco

发明人： Arasanipalai K. Ananthapadmanabhan , Sharath Manjunath , Pengjun Huang , Eddie-Lun Tik Choy , Andrew P. DeJaco

IPC分类号： G10L19/14

CPC分类号： G10L19/04 , G10L19/0204 , G10L19/032 , G10L19/08 , G10L19/097 , G10L19/26 , G10L25/12

摘要： A method and apparatus for predictively quantizing voiced speech includes a parameter generator and a quantizer. The parameter generator is configured to extract parameters from frames of predictive speech such as voiced speech, and to transform the extracted information to a frequency-domain representation. The quantizer is configured to subtract a weighted sum of the parameters for previous frames from the parameter for the current frame. The quantizer is configured to quantize the difference value. A prototype extractor may be added to first extract a pitch period prototype to be processed by the parameter generator.

摘要翻译： 用于预测量化浊音的方法和装置包括参数发生器和量化器。参数发生器被配置为从诸如有声语音的预测语音的帧中提取参数，并且将提取的信息变换为频域表示。量化器被配置为从当前帧的参数中减去先前帧的参数的加权和。量化器被配置为量化差值。可以添加原型提取器以首先提取要由参数发生器处理的音调周期原型。

4.

发明授权
Method and apparatus for identifying frequency bands to compute linear phase shifts between frame prototypes in a speech coder 有权
标题翻译：用于识别频带以计算语音编码器中的帧原型之间的线性相移的方法和装置

公开(公告)号：US06434519B1

公开(公告)日：2002-08-13

申请号：US09356861

申请日：1999-07-19

申请人： Sharath Manjunath , Andrew P. Dejaco , Arasanipalai K. Ananthapadmanabhan , Pengjun Huang , Eddie Lun Tik Choy

发明人： Sharath Manjunath , Andrew P. Dejaco , Arasanipalai K. Ananthapadmanabhan , Pengjun Huang , Eddie Lun Tik Choy

IPC分类号： G10L1914

CPC分类号： G10L19/0208 , G10L19/10

摘要： A method and apparatus for identifying frequency bands to compute linear phase shifts between frame prototypes in a speech coder includes partitioning the frequency spectrum of a prototype of a frame by dividing the frequency spectrum into segments, assigning one or more bands to each segment, and establishing, for each segment, a set of bandwidths for the bands. The bandwidths may be fixed and uniformly distributed in any given segment. The bandwidths may be fixed and non-uniformly distributed in any segment. The bandwidths may be variable and non-uniformly distributed in any given segment.

摘要翻译： 用于识别用于计算语音编码器中的帧原型之间的线性相移的频带的方法和装置包括：通过将频谱划分成段，将一个或多个频带分配给每个分段来建立帧的原型的频谱，并建立，对于每个段，一组带宽的带宽。带宽可以是固定的，并且均匀分布在任何给定的段中。带宽可以是固定的，并且不均匀地分布在任何段中。带宽可以是可变的，并且不均匀地分布在任何给定的段中。

5.

发明申请
METHOD AND APPARATUS FOR PREDICTIVELY QUANTIZING VOICED SPEECH 有权
标题翻译：用于预测定语音的方法和装置

公开(公告)号：US20080312917A1

公开(公告)日：2008-12-18

申请号：US12190524

申请日：2008-08-12

申请人： Arasanipalai K. Ananthapadmanabhan , Sharath Manjunath , Pengjun Huang , Eddie-Lun Tik Choy , Andrew P. Dejaco

发明人： Arasanipalai K. Ananthapadmanabhan , Sharath Manjunath , Pengjun Huang , Eddie-Lun Tik Choy , Andrew P. Dejaco

IPC分类号： G10L19/00

CPC分类号： G10L19/04 , G10L19/0204 , G10L19/032 , G10L19/08 , G10L19/097 , G10L19/26 , G10L25/12

摘要： A method and apparatus for predictively quantizing voiced speech includes a parameter generator and a quantizer. The parameter generator is configured to extract parameters from frames of predictive speech such as voiced speech, and to transform the extracted information to a frequency-domain representation. The quantizer is configured to subtract a weighted sum of the parameters for previous frames from the parameter for the current frame. The quantizer is configured to quantize the difference value. A prototype extractor may be added to first extract a pitch period prototype to be processed by the parameter generator.

摘要翻译： 用于预测量化浊音的方法和装置包括参数发生器和量化器。参数发生器被配置为从诸如有声语音的预测语音的帧中提取参数，并且将提取的信息变换为频域表示。量化器被配置为从当前帧的参数中减去先前帧的参数的加权和。量化器被配置为量化差值。可以添加原型提取器以首先提取要由参数发生器处理的音调周期原型。

6.

发明授权
Amplitude quantization scheme for low-bit-rate speech coders 有权
标题翻译：低比特率语音编码器的幅度量化方案

公开(公告)号：US06324505B1

公开(公告)日：2001-11-27

申请号：US09356756

申请日：1999-07-19

申请人： Eddie Lun Tik Choy , Sharath Manjunath

发明人： Eddie Lun Tik Choy , Sharath Manjunath

IPC分类号： G10L2102

CPC分类号： G10L19/0204 , G10L25/18

摘要： An amplitude quantization scheme for low-bit-rate speech coders includes the first step of extracting a vector of spectral information from a frame. The energy of the vector is normalized to generate gain factors. The gain factors are differentially vector quantized. The normalized gain factors are non-uniformly downsampled to generate a fixed-dimension vector with elements associated with a set of non-uniform frequency bands. The fixed-dimension vector is split into two or more sub-vectors. The sub-vectors are differentially quantized, to best advantage with a harmonic cloning process.

摘要翻译： 用于低比特率语音编码器的幅度量化方案包括从帧提取频谱信息的向量的第一步骤。向量的能量被归一化以产生增益因子。增益因子是差分矢量量化的。归一化的增益因子被非均匀地下采样以产生具有与一组非均匀频带相关联的元素的固定维度向量。固定维度向量被分成两个或多个子向量。子矢量被差分量化，以利用谐波克隆过程的最佳优势。

7.

发明授权
Adaptive intra-refresh for digital video encoding 有权
标题翻译：适用于数字视频编码的内部刷新

公开(公告)号：US08948266B2

公开(公告)日：2015-02-03

申请号：US11025297

申请日：2004-12-28

申请人： Yi Liang , Khaled Helmi El-Maleh , Sharath Manjunath

发明人： Yi Liang , Khaled Helmi El-Maleh , Sharath Manjunath

IPC分类号： H04N7/12 , H04N19/176 , H04N19/164 , H04N19/89 , H04N19/14 , H04N19/107 , H04N19/172 , H04N19/147 , H04N19/61

CPC分类号： H04N19/00278 , H04N19/107 , H04N19/14 , H04N19/147 , H04N19/164 , H04N19/172 , H04N19/176 , H04N19/61 , H04N19/89

摘要： An adaptive Intra-refresh (IR) technique for digital video encoding adjusts IR rate based on video content, or a combination of video content and channel condition. The IR rate may be applied at the frame level or macroblock (MB) level. At the frame level, the IR rate specifies the percentage of MBs to be Intra-coded within the frame. At the MB level, the IR rate defines a statistical probability that a particular MB is to be Intra-coded. The IR rate is adjusted in proportion to a combined metric that weighs estimated channel loss probability, frame-to-frame variation, and texture information. The IR rate can be determined using a close-form solution that requires relatively low implementation complexity. For example, such a close-form does not require iteration or an exhaustive search. In addition, the IR rate can be determined from parameters that are available before motion estimation and compensation are performed.

摘要翻译： 用于数字视频编码的自适应内部刷新（IR）技术基于视频内容或视频内容和频道条件的组合来调整IR速率。可以在帧级或宏块（MB）级应用IR速率。在帧级别，IR速率指定帧内帧内编码的百分比。在MB级别，IR率定义了特定MB被内部编码的统计概率。 IR速率与重量估计的信道丢失概率，帧到帧变化和纹理信息的组合度量成比例地调整。 IR速率可以使用需要较低实现复杂度的紧密形式的解决方案来确定。例如，这种关闭形式不需要迭代或穷尽搜索。另外，可以在执行运动估计和补偿之前可用的参数来确定IR速率。

8.

发明授权
3D video encoding 有权
标题翻译： 3D视频编码

公开(公告)号：US08594180B2

公开(公告)日：2013-11-26

申请号：US11677335

申请日：2007-02-21

申请人： Kai Chieh Yang , Haohong Wang , Khaled Helmi El-Maleh , Sharath Manjunath

发明人： Kai Chieh Yang , Haohong Wang , Khaled Helmi El-Maleh , Sharath Manjunath

IPC分类号： G06F21/00

CPC分类号： H04N19/194 , H04N13/122 , H04N19/124 , H04N19/147 , H04N19/149 , H04N19/172 , H04N19/176 , H04N19/597 , H04N19/61

摘要： A stereo 3D video frame includes left and right components that are combined to produce a stereo image. For a given amount of distortion, the left and right components may have different impacts on perceptual visual quality of the stereo image due to asymmetry in the distortion response of the human eye. A 3D video encoder adjusts an allocation of coding bits between left and right components of the 3D video based on a frame-level bit budget and a weighting between the left and right components. The video encoder may generate the bit allocation in the rho (ρ) domain. The weighted bit allocation may be derived based on a quality metric that indicates overall quality produced by the left and right components. The weighted bit allocation compensates for the asymmetric distortion response to reduce overall perceptual distortion in the stereo image and thereby enhance or maintain visual quality.

摘要翻译： 立体3D视频帧包括组合以产生立体图像的左和右组件。对于给定量的失真，由于人眼的失真响应的不对称，左和右分量可能对立体图像的感知视觉质量具有不同的影响。 3D视频编码器基于帧级比特预算和左右分量之间的加权来调整3D视频的左和右分量之间的编码比特的分配。视频编码器可以在rho（rho）域中生成比特分配。可以基于指示左组件和右组件产生的总体质量的质量度量来导出加权比特分配。加权比特分配补偿非对称失真响应，以减少立体图像中的整体感知失真，从而增强或维持视觉质量。

9.

发明授权
Methods of performing error concealment for digital video 有权
标题翻译：对数字视频执行错误隐藏的方法

公开(公告)号：US08379734B2

公开(公告)日：2013-02-19

申请号：US11690132

申请日：2007-03-23

申请人： Chia-Yuan Teng , Sharath Manjunath

发明人： Chia-Yuan Teng , Sharath Manjunath

IPC分类号： H04N7/68

CPC分类号： H04N19/00939 , H04N19/107 , H04N19/117 , H04N19/142 , H04N19/159 , H04N19/166 , H04N19/174 , H04N19/895

摘要： Error concealment is used to hide the effects of errors detected within digital video information. A complex error concealment mode decision is disclosed to determine whether spatial error concealment (SEC) or temporal error concealment (TEC) should be used. The error concealment mode decision system uses different methods depending on whether the damaged frame is an intra-frame or an inter-frame. If the video frame is an intra-frame then a similarity metric is used to determine if the intra-frame represents a scene-change or not. If the video frame is an intra-frame, a complex multi-termed equation is used to determine whether SEC or TEC should be used. A novel spatial error concealment technique is disclosed for use when the error concealment mode decision determines that spatial error concealment should be used for reconstruction. The novel spatial error concealment technique divides a corrupt macroblock into four different regions, a corner region, a row adjacent to the corner region, a column adjacent to the corner region, and a remainder main region. Those regions are then reconstructed in that order and information from earlier reconstructed regions may be used in later reconstructed regions. Finally, a macroblock refreshment technique is disclosed for preventing error propagation from harming non-corrupt inter-blocks. Specifically, an inter-macroblock may be ‘refreshed’ using spatial error concealment if there has been significant error caused damage that may cause the inter-block to propagate the errors.

摘要翻译： 错误隐藏用于隐藏数字视频信息中检测到的错误的影响。公开了一种复杂的错误隐藏模式决定，以确定是否应使用空间误差隐藏（SEC）或时间误差隐藏（TEC）。错误隐藏模式决策系统使用不同的方法，取决于损坏的帧是帧内还是帧间。如果视频帧是帧内帧，则使用相似性度量来确定帧内是否表示场景改变。如果视频帧是帧内帧，则使用复数多方程来确定是否应使用SEC或TEC。当错误隐藏模式决定确定空间误差隐藏应用于重建时，公开了一种新颖的空间误差隐藏技术。新颖的空间误差隐藏技术将腐败的宏块分为四个不同的区域，一个角区域，一个与拐角区域相邻的一行，一个邻近拐角区域的列以及一个剩余的主区域。然后按照该顺序重建那些区域，并且可以在稍后的重建区域中使用来自较早重建区域的信息。最后，公开了一种宏块刷新技术，用于防止错误传播损害非损坏的块间。具体地，如果存在可能导致块间传播错误的严重错误引起的损坏，则可以使用空间错误隐藏来刷新宏块间宏块。

10.

发明授权
Systems, methods, and apparatus for computationally efficient, iterative alignment of speech waveforms 有权
标题翻译：系统，方法和设备，用于语音波形的计算高效，迭代对齐

公开(公告)号：US08145477B2

公开(公告)日：2012-03-27

申请号：US11566039

申请日：2006-12-01

申请人： Sharath Manjunath , Ananthapadmanabhan A. Kandhadai

发明人： Sharath Manjunath , Ananthapadmanabhan A. Kandhadai

IPC分类号： G10L11/00 , G06F17/15

CPC分类号： G10L19/097

摘要： Systems, methods, and apparatus described include waveform alignment operations in which a single set of evaluated cosines and sines is used to calculate cross-correlations of two periodic waveforms at two different phase shifts.

摘要翻译： 所描述的系统，方法和装置包括波形对准操作，其中使用单组估计的余弦和正弦来计算两个不同相移处的两个周期波形的互相关。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类