Audio time scale modification algorithm for dynamic playback speed control
    31.
    发明授权
    Audio time scale modification algorithm for dynamic playback speed control 失效
    用于动态播放速度控制的音频时标修改算法

    公开(公告)号:US08078456B2

    公开(公告)日:2011-12-13

    申请号:US12119033

    申请日:2008-05-12

    IPC分类号: G10L19/00

    CPC分类号: G10L21/04

    摘要: A modified synchronized overlap add (SOLA) algorithm for performing high-quality, low-complexity audio time scale modification (TSM) is described. The algorithm produces good output audio quality with a very low complexity and without producing additional audible distortion during dynamic change of the audio playback speed. The algorithm may achieve complexity reduction by performing the maximization of normalized cross-correlation using decimated signals. By updating the input buffer and the output buffer in a precise sequence with careful checking of the appropriate array bounds, the algorithm may also achieve seamless audio playback during dynamic speed change with a minimal requirement on memory usage.

    摘要翻译: 描述了用于执行高质量,低复杂度的音频时标修正(TSM)的修改的同步重叠加法(SOLA)算法。 该算法以非常低的复杂度产生良好的输出音频质量,并且在音频播放速度的动态变化期间不产生额外的可听失真。 该算法可以通过使用抽取信号执行归一化互相关的最大化来实现复杂度降低。 通过仔细检查适当的阵列边界,以精确的顺​​序更新输入缓冲器和输出缓冲器,该算法还可以在动态速度改变期间实现无缝音频播放,同时对存储器使用的要求最低。

    PACKET LOSS CONCEALMENT FOR SUB-BAND PREDICTIVE CODING BASED ON EXTRAPOLATION OF SUB-BAND AUDIO WAVEFORMS
    32.
    发明申请
    PACKET LOSS CONCEALMENT FOR SUB-BAND PREDICTIVE CODING BASED ON EXTRAPOLATION OF SUB-BAND AUDIO WAVEFORMS 有权
    基于子带音频波形扩展的子带预测编码的分组丢失隐藏

    公开(公告)号:US20090240492A1

    公开(公告)日:2009-09-24

    申请号:US12474855

    申请日:2009-05-29

    IPC分类号: G10L19/00

    摘要: A technique is described for concealing the effect of a lost frame in a series of frames representing an encoded audio signal in a sub-band predictive coding system. In accordance with the technique, a first synthesized sub-band audio signal is synthesized, wherein synthesizing the first synthesized sub-band audio signal comprises performing waveform extrapolation based on a stored first sub-band decoded audio signal. A second synthesized sub-band audio signal is also synthesized, wherein synthesizing the second synthesized sub-band audio signal comprises performing waveform extrapolation based on the stored second sub-band decoded audio signal. The first synthesized sub-band audio signal and the second synthesized sub-band audio signal are combined to generate a synthesized full-band output audio signal corresponding to a lost frame.

    摘要翻译: 描述了一种用于在子带预测编码系统中隐藏表示编码音频信号的一系列帧中的丢失帧的影响的技术。 根据该技术,合成第一合成子带音频信号,其中合成第一合成子带音频信号包括基于存储的第一子带解码音频信号执行波形外推。 还合成了第二合成子带音频信号,其中合成第二合成子带音频信号包括基于所存储的第二子带解码音频信号执行波形外推。 第一合成子带音频信号和第二合成子带音频信号被组合以产生对应于丢失帧的合成全频带输出音频信号。

    Re-phasing of Decoder States After Packet Loss
    33.
    发明申请
    Re-phasing of Decoder States After Packet Loss 有权
    数据包丢失后重新分解解码器状态

    公开(公告)号:US20080046237A1

    公开(公告)日:2008-02-21

    申请号:US11838905

    申请日:2007-08-15

    IPC分类号: G10L21/00

    摘要: A technique is described herein for updating a state of a decoder configured to decode a series of frames representing an encoded audio signal. In accordance with the technique, an output audio signal associated with a lost frame in the series of frames is synthesized. The decoder state is set to align with the synthesized output audio signal at a frame boundary. An extrapolated signal is generated based on the synthesized output audio signal. A time lag is calculated between the extrapolated signal and a decoded audio signal associated with a first received frame after the lost frame in the series of frames, wherein the time lag represents a phase difference between the extrapolated signal and the decoded audio signal. The decoder state is then reset based on the time lag.

    摘要翻译: 本文描述了一种用于更新被配置为对表示编码音频信号的一系列帧进行解码的解码器的状态的技术。 根据该技术,合成与一系列帧中的丢失帧相关联的输出音频信号。 解码器状态被设置为与帧边界处的合成输出音频信号对准。 基于合成的输出音频信号产生外插信号。 在所述一系列帧中的丢失帧之后,在外推信号和与第一接收帧相关联的解码音频信号之间计算时间滞后,其中所述时间延迟表示外推信号和解码音频信号之间的相位差。 然后根据时间滞后重新设置解码器状态。

    Constrained and Controlled Decoding After Packet Loss
    34.
    发明申请
    Constrained and Controlled Decoding After Packet Loss 审中-公开
    丢包后约束和受控解码

    公开(公告)号:US20080046236A1

    公开(公告)日:2008-02-21

    申请号:US11838899

    申请日:2007-08-15

    IPC分类号: G10L21/02

    摘要: A technique is described herein for reducing audible artifacts in an audio output signal generated by decoding a received frame in a series of frames representing an encoded audio signal in a predictive coding system. In accordance with the technique, it is determined if the received frame is one of a predefined number of received frames that follow a lost frame in the series of the frames. Responsive to determining that the received frame is one of the predefined number of received frames, at least one parameter or signal associated with the decoding of the received frame is altered from a state associated with normal decoding. The received frame is then decoded in accordance with the at least one parameter or signal to generate a decoded audio signal. The audio output signal is then generated based on the decoded audio signal.

    摘要翻译: 本文描述了一种技术,用于通过在代表预测编码系统中的编码音频信号的一系列帧中对接收到的帧进行解码而产生的音频输出信号中减少可听见的伪影。 根据该技术,确定接收到的帧是否是在一系列帧中的丢失帧之后的预定数量的接收帧中的一个。 响应于确定接收到的帧是预定数量的接收帧之一,与所接收的帧的解码相关联的至少一个参数或信号从与正常解码相关联的状态改变。 接收的帧然后根据至少一个参数或信号被解码以产生解码的音频信号。 然后基于解码的音频信号产生音频输出信号。

    Packet Loss Concealment for Sub-band Predictive Coding Based on Extrapolation of Full-band Audio Waveform
    35.
    发明申请
    Packet Loss Concealment for Sub-band Predictive Coding Based on Extrapolation of Full-band Audio Waveform 审中-公开
    基于全波段音频波形外推的子带预测编码的丢包隐藏

    公开(公告)号:US20080046233A1

    公开(公告)日:2008-02-21

    申请号:US11838885

    申请日:2007-08-15

    IPC分类号: G10L11/00

    摘要: A technique for concealing the effect of a lost frame in a series of frames representing an encoded audio signal in a sub-band predictive coding system is provided. In accordance with the technique, one or more received frames in the series of frames are decoded to generate a full-band output audio signal, wherein the full-band output audio signal comprises a combination of at least a first sub-band decoded audio signal and a second sub-band decoded audio signal. The full-band output audio signal corresponding to the one or more received frames is stored. Then, a full-band output audio signal corresponding to the lost frame is synthesized, wherein synthesizing the full-band output audio signal corresponding to the lost frame comprises performing waveform extrapolation based on the stored full-band output audio signal corresponding to the one or more received frames.

    摘要翻译: 提供了一种用于在子带预测编码系统中隐藏表示编码音频信号的一系列帧中的丢失帧的影响的技术。 根据该技术,对一系列帧中的一个或多个接收帧进行解码以产生全频带输出音频信号,其中全频带输出音频信号包括至少第一子带解码音频信号 和第二子带解码音频信号。 存储对应于一个或多个接收帧的全频带输出音频信号。 然后,合成对应于丢失帧的全频带输出音频信号,其中合成对应于丢失帧的全频带输出音频信号包括基于所存储的全频带输出音频信号对应于一个或者 更多收到的帧。

    Packet Loss Concealment for a Sub-band Predictive Coder Based on Extrapolation of Excitation Waveform
    36.
    发明申请
    Packet Loss Concealment for a Sub-band Predictive Coder Based on Extrapolation of Excitation Waveform 有权
    基于激发波形外推的子带预测编码器的丢包隐藏

    公开(公告)号:US20080040122A1

    公开(公告)日:2008-02-14

    申请号:US11835716

    申请日:2007-08-08

    IPC分类号: G10L19/00

    CPC分类号: G10L19/0208 G10L19/005

    摘要: Systems and methods are described for performing packet loss concealment using an extrapolation of an excitation waveform in a sub-band predictive speech coder, such as an ITU-T Recommendation G.722 wideband speech coder. The systems and methods are useful for concealing the quality-degrading effects of packet loss in a sub-band predictive coder and address some sub-band architectural issues when applying excitation extrapolation techniques to such sub-band predictive coders.

    摘要翻译: 描述了使用在诸如ITU-T G.722建议书G.722宽带语音编码器的子带预测语音编码器中外推激励波形来执行分组丢失隐藏的系统和方法。 这些系统和方法对于隐藏子带预测编码器中的分组丢失的质量降级效应是有用的,并且当向这种子带预测编码器应用激励外推技术时,解决某些子带架构问题。

    Decimated Bisectional Pitch Refinement
    37.
    发明申请
    Decimated Bisectional Pitch Refinement 有权
    抽取二等分调幅

    公开(公告)号:US20080033585A1

    公开(公告)日:2008-02-07

    申请号:US11734824

    申请日:2007-04-13

    申请人: Robert W. Zopf

    发明人: Robert W. Zopf

    IPC分类号: G06F17/00

    CPC分类号: G10L25/90 G10L19/005

    摘要: A method and system for refining an estimated pitch period estimate based on a coarse pitch useful for performing frame loss concealment in an audio decoder as well as for other applications. A normalized correlation at the coarse pitch lag is computed and used as the current best candidate. The normalized correlation is then evaluated at the midpoint of the refinement pitch range on either side of the current best candidate. If the normalized correlation at either midpoint is greater than the current best lag, the midpoint with the maximum correlation is selected as the current best lag. After each iteration, the refinement range is decreased by a factor of two and centered on the current best lag. This bisectional search continues until the pitch has been refined to an acceptable tolerance or until the refinement range has been exhausted. During each step of the bisectional pitch refinement, the signal is decimated to reduce the complexity of computing the normalized correlation.

    摘要翻译: 一种用于基于用于在音频解码器中执行帧丢失隐藏以及用于其它应用的粗略音调来精确估计音调周期估计的方法和系统。 计算粗音调滞后的归一化相关,并将其用作当前最佳候选。 然后在当前最佳候选者的任一侧上的细化间距范围的中点评估归一化相关性。 如果在任一中点处的归一化相关性大于当前最佳滞后,则具有最大相关性的中点被选为当前最佳滞后。 每次迭代后,细化范围减少一倍,并以当前最佳滞后为中心。 这种二等分搜索继续进行,直到音调已经被精炼到可接受的容限,或者直到细化范围已经耗尽。 在二等分音调细化的每个步骤期间,信号被抽取以降低计算归一化相关性的复杂度。

    User attribute derivation and update for network/peer assisted speech coding
    38.
    发明授权
    User attribute derivation and update for network/peer assisted speech coding 有权
    网络/对等辅助语音编码的用户属性导出和更新

    公开(公告)号:US09058818B2

    公开(公告)日:2015-06-16

    申请号:US12887329

    申请日:2010-09-21

    申请人: Robert W. Zopf

    发明人: Robert W. Zopf

    摘要: Systems, methods and apparatuses are described for deriving and updating user attribute information about users of a communications system. A communications network is then used to transfer the user attribute information to communication terminals, which use the user attribute information to configure a speech codec to operate in a speaker-dependent manner during a communication session, thereby improving speech coding efficiency. In a network-assisted model, the user attribute information is stored on the communications network and selectively transmitted to the communication terminals while in a peer-assisted model, the user attribute information is derived by and transferred between communication terminals.

    摘要翻译: 描述了用于导出和更新关于通信系统的用户的用户属性信息的系统,方法和装置。 然后使用通信网络将用户属性信息传送到通信终端,通信终端使用用户属性信息来配置语音编解码器,以在通信会话期间以说话者相关的方式操作,从而提高语音编码效率。 在网络辅助模型中,用户属性信息存储在通信网络上,并且在对等辅助模型中选择性地发送到通信终端,用户属性信息由通信终端之间导出和传送。

    Network/peer assisted speech coding
    39.
    发明授权
    Network/peer assisted speech coding 有权
    网络/对等辅助语音编码

    公开(公告)号:US08818817B2

    公开(公告)日:2014-08-26

    申请号:US12901832

    申请日:2010-10-11

    IPC分类号: G10L21/00 G10L19/16

    摘要: A communications network is used to transfer user attribute information about participants in a communication session to their respective communication terminals for storage and use thereon to configure a speech codec to operate in a speaker-dependent manner, thereby improving speech coding efficiency. In a network-assisted model, the user attribute information is stored on the communications network and selectively transmitted to the communication terminals while in a peer-assisted model, the user attribute information is derived by and transferred between communication terminals.

    摘要翻译: 通信网络用于将通信会话中的参与者的用户属性信息传送到其各自的通信终端以进行存储和使用,以将语音编解码器配置为以说话者相关的方式操作,从而提高语音编码效率。 在网络辅助模型中,用户属性信息存储在通信网络上,并且在对等辅助模型中选择性地发送到通信终端,用户属性信息由通信终端之间导出和传送。

    Bit error concealment for audio coding systems
    40.
    发明授权
    Bit error concealment for audio coding systems 有权
    音频编码系统的位错误隐藏

    公开(公告)号:US08301440B2

    公开(公告)日:2012-10-30

    申请号:US12431155

    申请日:2009-04-28

    IPC分类号: G10L21/02 G10L19/00

    CPC分类号: G10L19/005

    摘要: A bit error concealment (BEC) system and method is described herein that detects and conceals the presence of click-like artifacts in an audio signal caused by bit errors introduced during transmission of the audio signal within an audio communications system. A particular embodiment of the present invention utilizes a low-complexity design that introduces no added delay and that is particularly well-suited for applications such as Bluetooth® wireless audio devices which have low cost and low power dissipation requirements.

    摘要翻译: 本文描述了一种错误隐藏(BEC)系统和方法,其检测和隐藏由音频通信系统内的音频信号传输期间引入的位错误引起的音频信号中出现的点击样伪像。 本发明的一个具体实施例利用低复杂度的设计,其不引入额外的延迟,并且特别适合于具有低成本和低功耗要求的蓝牙无线音频设备的应用。