Method and apparatus for using coding scheme selection patterns in a predictive speech coder to reduce sensitivity to frame error conditions
    11.
    发明授权
    Method and apparatus for using coding scheme selection patterns in a predictive speech coder to reduce sensitivity to frame error conditions 有权
    在预测语音编码器中使用编码方案选择模式以降低对帧错误状况的灵敏度的方法和装置

    公开(公告)号:US06438518B1

    公开(公告)日:2002-08-20

    申请号:US09429754

    申请日:1999-10-28

    IPC分类号: G10L1904

    CPC分类号: G10L19/18 G10L19/02

    摘要: A method and apparatus for using coding scheme selection patterns in a predictive speech coder to reduce sensitivity to frame error conditions includes a speech coder configured to select from among various predictive coding modes. After a predefined number of speech frames have been predictively coded, the speech coder codes one frame with a nonpredictive coding mode or a mildly predictive coding mode. The predefined number of frames can be determined in advance from the subjective standpoint of a listener. The predefined number of frames may be varied periodically. An average coding bit rate may be maintained for the speech coder by ensuring that an average coding bit rate is maintained for each successive pattern, or group, of predictively coded speech frames including at least one nonpredictively coded or mildly predictively coded speech frame.

    摘要翻译: 一种用于在预测语音编码器中使用编码方案选择模式以降低对帧错误状况的灵敏度的方法和装置包括配置成从各种预测编码模式中进行选择的语音编码器。 在预定数量的语音帧已被预测编码之后,语音编码器以非预测编码模式或轻度预测编码模式对一帧进行编码。 可以从收听者的主观角度预先确定预定数量的帧。 预定数量的帧可以周期性地改变。 可以通过确保针对包括至少一个非预测编码或温和预测编码的语音帧的预测编码语音帧的每个连续模式或组维持平均编码比特率来维持语音编码器的平均编码比特率。

    Method and apparatus for maintaining a target bit rate in a speech coder
    12.
    发明授权
    Method and apparatus for maintaining a target bit rate in a speech coder 有权
    用于在语音编码器中维持目标比特率的方法和装置

    公开(公告)号:US06330532B1

    公开(公告)日:2001-12-11

    申请号:US09356493

    申请日:1999-07-19

    IPC分类号: G10L2104

    CPC分类号: G10L19/002 G10L19/18

    摘要: A method and apparatus for maintaining a target bit rate in a speech coder includes a speech coder for encoding a frame at a preselected encoding rate, computing a running average bit rate for a predefined number of encoded frames, subtracting the running average bit rate from a predefined target average bit rate, and dividing the difference by the preselected encoding rate. If the quotient value is negative, a predefined number of possible occurrence counts of speech coder performance threshold values that are less than a current performance threshold value is accumulated, the accumulated number being greater than the absolute value of the quotient. The product of a decrement-per-occurrence-count-value and the predefined number of occurrence counts is subtracted from the current performance threshold value to obtain a new performance threshold value. If the quotient value is positive, a predefined number of possible occurrence counts of speech coder performance threshold values that are greater than the current performance threshold value is accumulated, the accumulated number being greater than the quotient. The product of an increment-per-occurrence-count-value and the predefined number of occurrence counts is added to the current performance threshold value to obtain a new performance.

    摘要翻译: 用于在语音编码器中维持目标比特率的方法和装置包括语音编码器,用于以预先选择的编码速率对帧进行编码,计算预定数量编码帧的运行平均比特率,从 预定义的目标平均比特率,并且将差除以预选的编码率。 如果商值为负,则累积小于当前性能阈值的语音编码器性能阈值的预定数量的可能发生计数,累积数大于商的绝对值。 从当前性能阈值中减去每次出现计数值递减和预定发生次数的乘积,以获得新的性能阈值。 如果商值为正,则累积大于当前性能阈值的语音编码器性能阈值的预定数量的可能发生计数,累积数大于商。 将每个出现次数增量值和预定发生次数的乘积加到当前性能阈值以获得新的性能。

    Method and apparatus for generating and encoding line spectral square
roots
    13.
    发明授权
    Method and apparatus for generating and encoding line spectral square roots 失效
    用于生成和编码线谱平方根的方法和装置

    公开(公告)号:US5754733A

    公开(公告)日:1998-05-19

    申请号:US509848

    申请日:1995-08-01

    CPC分类号: G10L19/07

    摘要: A novel and improved method and apparatus for encoding line predictive coding (LPC) data in a speech compression system using line spectral square root values is disclosed. A novel and computationally efficient procedure for determining the set of quantization sensitivities for the line spectral square root values is disclosed, which results in a computationally efficient error measure for use in vector quantization of the line spectral square root values. A novel method of weighting the quantization error is disclosed, which accumulates the quantization error in each line spectral square root value and weights that error by the sensitivity of that line spectral square root value.

    摘要翻译: 公开了一种用于使用线谱平方根值在语音压缩系统中对行预测编码(LPC)数据进行编码的新颖和改进的方法和装置。 公开了一种用于确定线谱平方根值的量化灵敏度集合的新颖且计算上有效的过程,其导致在线谱平方根值的矢量量化中使用的计算有效的误差测量。 公开了一种加权量化误差的新颖方法,其在每个线谱平方根中累积量化误差,并通过该线谱平方根值的灵敏度对该误差进行加权。

    Methods of performing spatial error concealment for digital video
    15.
    发明授权
    Methods of performing spatial error concealment for digital video 失效
    数字视频空间误差隐藏方法

    公开(公告)号:US08526507B2

    公开(公告)日:2013-09-03

    申请号:US13616756

    申请日:2012-09-14

    IPC分类号: H04N7/26

    摘要: Error concealment is used to hide the effects of errors detected within digital video information. A novel spatial error concealment technique is disclosed for use when the error concealment mode decision determines that spatial error concealment should be used for reconstruction. The novel spatial error concealment technique divides a corrupt macroblock into multiple regions, such as, a corner region, a row adjacent to the corner region, a column adjacent to the corner region, and a remainder main region. Those regions are then reconstructed and information from earlier reconstructed regions may be used in later reconstructed regions. Finally, a macroblock refreshment technique is disclosed for preventing error propagation from harming non-corrupt inter-blocks. Specifically, an inter-macroblock may be ‘refreshed’ using spatial error concealment if there has been significant error caused damage that may cause the inter-block to propagate the errors.

    摘要翻译: 错误隐藏用于隐藏数字视频信息中检测到的错误的影响。 当错误隐藏模式决定确定空间误差隐藏应用于重建时,公开了一种新颖的空间误差隐藏技术。 新颖的空间误差隐藏技术将损坏的宏块划分成多个区域,例如角区域,与角区域相邻的行,与角区域相邻的列,以及余数主区域。 然后重建那些区域,并且可以在稍后的重建区域中使用来自较早重建区域的信息。 最后,公开了一种宏块刷新技术,用于防止错误传播损害非破坏的块间。 具体地说,如果存在可能导致块间传播错误的严重错误导致的损坏,则可以使用空间错误隐藏来“刷新”宏块间宏块。

    Stereo image and video directional mapping of offset
    16.
    发明授权
    Stereo image and video directional mapping of offset 有权
    立体图像和视频方向映射偏移

    公开(公告)号:US08456515B2

    公开(公告)日:2013-06-04

    申请号:US11493434

    申请日:2006-07-25

    IPC分类号: H04N13/00

    摘要: A method and apparatus for generating stereoscopic images of a scene is described. The apparatus may have a first image sensor, a second image sensor spaced apart from the first image sensor, a diversity combine module to combine image data from the first and second image sensors, and an image processing module configured to process combined image data from the diversity combine module may be used to generate stereoscopic images of a scene.

    摘要翻译: 描述用于产生场景的立体图像的方法和装置。 该装置可以具有第一图像传感器,与第一图像传感器间隔开的第二图像传感器,用于组合来自第一和第二图像传感器的图像数据的分集组合模块,以及被配置为处理来自第一图像传感器的组合图像数据的图像处理模块 分集组合模块可用于生成场景的立体图像。

    Tandem-free intersystem voice communication
    17.
    发明授权
    Tandem-free intersystem voice communication 有权
    无串联系统间语音通信

    公开(公告)号:US08432935B2

    公开(公告)日:2013-04-30

    申请号:US12181972

    申请日:2008-07-29

    IPC分类号: H04J3/16 G10L11/06

    CPC分类号: G10L19/173 H04W88/181

    摘要: Techniques are presented herein to provide tandem-free operation between two wireless terminals through two otherwise incompatible wireless networks. Specifically, embodiments provide tandem-free operation between a wireless terminal communicating through a continuous transmission (CTX) wireless channel to a wireless terminal communicating through a discontinuous transmission (DTX) wireless channel. In a first aspect, inactive speech frames are translated between DTX and CTX formats. In a second aspect, each wireless terminal includes an active speech decoder that is compatible with the active speech encoder on the opposite end of the mobile-to-mobile connection.

    摘要翻译: 本文提供了技术,以通过两个否则不兼容的无线网络在两个无线终端之间提供无串联操作。 具体地,实施例在通过连续传输(CTX)无线信道通信到通过不连续传输(DTX)无线信道进行通信的无线终端的无线终端之间提供无串联操作。 在第一方面,非活动语音帧在DTX和CTX格式之间被转换。 在第二方面,每个无线终端包括与移动到移动连接的相对端上的活动语音编码器兼容的活动语音解码器。

    Methods of Performing Spatial Error Concealment For Digital Video
    18.
    发明申请
    Methods of Performing Spatial Error Concealment For Digital Video 失效
    执行数字视频空间误差隐藏的方法

    公开(公告)号:US20130010876A1

    公开(公告)日:2013-01-10

    申请号:US13616756

    申请日:2012-09-14

    IPC分类号: H04N7/26

    摘要: Error concealment is used to hide the effects of errors detected within digital video information. A novel spatial error concealment technique is disclosed for use when the error concealment mode decision determines that spatial error concealment should be used for reconstruction. The novel spatial error concealment technique divides a corrupt macroblock into multiple regions, such as, a corner region, a row adjacent to the corner region, a column adjacent to the corner region, and a remainder main region. Those regions are then reconstructed and information from earlier reconstructed regions may be used in later reconstructed regions. Finally, a macroblock refreshment technique is disclosed for preventing error propagation from harming non-corrupt inter-blocks. Specifically, an inter-macroblock may be ‘refreshed’ using spatial error concealment if there has been significant error caused damage that may cause the inter-block to propagate the errors.

    摘要翻译: 错误隐藏用于隐藏数字视频信息中检测到的错误的影响。 当错误隐藏模式决定确定空间误差隐藏应用于重建时,公开了一种新颖的空间误差隐藏技术。 新颖的空间误差隐藏技术将损坏的宏块划分成多个区域,例如角区域,与角区域相邻的行,与角区域相邻的列,以及余数主区域。 然后重建那些区域,并且可以在稍后的重建区域中使用来自较早重建区域的信息。 最后,公开了一种宏块刷新技术,用于防止错误传播损害非破坏的块间。 具体地,如果存在可能导致块间传播错误的严重错误引起的损坏,则可以使用空间错误隐藏来刷新宏块间宏块。

    Selection of encoding modes and/or encoding rates for speech compression with open loop re-decision
    20.
    发明授权
    Selection of encoding modes and/or encoding rates for speech compression with open loop re-decision 有权
    使用开环重新决定来选择语音压缩的编码模式和/或编码率

    公开(公告)号:US08090573B2

    公开(公告)日:2012-01-03

    申请号:US11625797

    申请日:2007-01-22

    IPC分类号: G10L19/00

    CPC分类号: G10L19/22

    摘要: In a device configurable to encode speech performing an open loop re-decision may comprise representing a speech signal by amplitude components and phase components for a current frame and a past frame. During the current frame, there may be an extraction of uncompressed amplitude components and uncompressed phase components. The amplitude components and the phase components from the past frame may then be retrieved. A set of features may be generated based on the uncompressed amplitude components from the current frame, the uncompressed phase components from the current frame, the amplitude components from the past frame, and the phase components from the past frame. The set of features may be checked as part of the open loop re-decision, and determining a final encoding decision based on the checking may be performed. The final encoding decision may be an encoding mode and/or encoding rate.

    摘要翻译: 在可配置为对执行开环重新判定的语音进行编码的装置中的装置可以包括通过当前帧和过去帧的幅度分量和相位分量表示语音信号。 在当前帧中,可以提取未压缩幅度分量和未压缩相位分量。 然后可以检索来自过去帧的幅度分量和相位分量。 可以基于来自当前帧的未压缩幅度分量,来自当前帧的未压缩相位分量,来自过去帧的幅度分量和来自过去帧的相位分量来生成一组特征。 可以将这组特征作为开环重新判定的一部分进行检查,并且可以执行基于检查来确定最终编码决定。 最终编码决定可以是编码模式和/或编码速率。