Method, system and apparatus for providing signal based packet loss concealment for memoryless codecs
    11.
    发明授权
    Method, system and apparatus for providing signal based packet loss concealment for memoryless codecs 有权
    用于为无记忆编解码器提供基于信号的丢包隐藏的方法,系统和装置

    公开(公告)号:US07929520B2

    公开(公告)日:2011-04-19

    申请号:US12113964

    申请日:2008-05-02

    申请人: Dunling Li

    发明人: Dunling Li

    IPC分类号: H04L12/66

    摘要: In a method, apparatus and system for transmitting packet loss concealment (PLC) information, a subscriber device divides a voice sample into a plurality of packets, each including a plurality of successive frames having portions of the voice sample. The subscriber device determines if a predetermined look ahead time duration from the final frame of the plurality of successive frames in a current packet of the plurality of packets includes a noise to voice transition. When the predetermined look ahead time duration is determined to include the noise to voice transition, the subscriber device packs packing information regarding the predetermined look ahead time duration into the current packet. Finally, the subscriber device encodes the plurality of successive frames into the current packet for transmission.

    摘要翻译: 在用于发送分组丢失隐藏(PLC)信息的方法,装置和系统中,用户设备将语音样本划分为多个分组,每个分组包括具有语音样本部分的多个连续帧。 订户设备确定来自多个分组中的当前分组中的多个连续帧的最终帧的预定的超前时间持续时间是否包括噪声到语音转换。 当预定的前瞻时间持续时间被确定为将噪声包括到语音转换时,用户设备将关于预定的前瞻时间持续时间的打包信息打包到当前分组中。 最后,订户设备将多个连续的帧编码到当前分组中用于传输。

    Speech coder and method
    12.
    发明授权
    Speech coder and method 有权
    语音编码和方法

    公开(公告)号:US07386447B2

    公开(公告)日:2008-06-10

    申请号:US10287572

    申请日:2002-11-04

    IPC分类号: G10L21/00

    CPC分类号: G10L19/07

    摘要: An overflow problem of LSF quantization in G.729 Annex B speech encoding which may lead to non-assignment of a codebook index. Preferred embodiments fix the problem with default or limited random variable assignments or flagging the overflow and adjusting the frame encoding such as by limiting spectral components or changing quantization targets.

    摘要翻译: 在G.729附录B语音编码中LSF量化的溢出问题可能导致码本索引的非赋值。 优选实施例用默认或有限的随机变量分配来解决问题,或者通过限制频谱分量或改变量化目标来标记溢出并调整帧编码。

    Memory optimization packet loss concealment in a voice over packet network

    公开(公告)号:US20060182086A1

    公开(公告)日:2006-08-17

    申请号:US11057638

    申请日:2005-02-14

    IPC分类号: H04L12/66

    CPC分类号: H04L65/80

    摘要: A method to reduce memory requirements for a packet loss concealment algorithm in the event of packet loss in a receiver of pulse code modulated voice signals. A voice playout unit in the receiver shares its nominal delay buffer with a history buffer of a packet loss concealment algorithm up to a maximum limit described in a standard. This reduces or eliminates need to allocate memory for the history buffer. A history buffer can also be extended to retain an original portion of voice signal packets received prior to a packet loss as well as generated voice signals as they are generated. A scratch buffer is used as a working buffer and replaces the function of a pitch buffer.

    Voice activity identiftication for speaker tracking in a packet based conferencing system with distributed processing
    14.
    发明授权
    Voice activity identiftication for speaker tracking in a packet based conferencing system with distributed processing 有权
    用于具有分布式处理的基于分组的会议系统中的扬声器跟踪的语音活动识别

    公开(公告)号:US07020257B2

    公开(公告)日:2006-03-28

    申请号:US10123483

    申请日:2002-04-17

    申请人: Dunling Li

    发明人: Dunling Li

    IPC分类号: G10L21/02

    摘要: A distributed conferencing system has a plurality of conferencing nodes to connect groups of participants to a conference. Each of the conferencing nodes provides for the connection of one or more participants to the conference. Each node includes a DSP for distributed signal processing. The node DSP includes: A signal measuring device for measuring features of the signals from each of the participants such as power, zero crossing rate and short term energy. The nodes include voice activity determination and a communication device for communicating the measured signal characteristics for a plurality of participant input signals to all other conferencing nodes. Muting means for muting individual participant input signals so that only selected signals are transmitted over the conference bus to the other participants. The voice activity detection utilizes a state machine with three states, voice state, transition state and noise state, dependant upon the measured energy level, zero crossing rate and other features of the signals. A high threshold and a low energy threshold; zero crossing rates; average energies; energy level means and variances and other features are used in differentiating voice and noise. The state machine will not move directly from voice to noise state but will move to a transition state first, to reduce the likelihood of missclassification of a weak voice signal as noise and to avoid frequent clipping which can be caused if the state machine moves to noise state during brief pauses in voice.

    摘要翻译: 分布式会议系统具有多个会议节点,用于将参与者组连接到会议。 每个会议节点提供一个或多个参与者到会议的连接。 每个节点包括用于分布式信号处理的DSP。 节点DSP包括:用于测量来自每个参与者的信号的特征的信号测量装置,例如功率,过零点和短期能量。 节点包括语音活动确定和用于将多个参与者输入信号的测量信号特性传送到所有其它会议节点的通信装置。 静音装置用于使各个参与者输入信号静音,使得只有选定的信号通过会议总线传输给其他与会者。 语音活动检测利用具有三种状态,语音状态,过渡状态和噪声状态的状态机,取决于测量的能级,过零率和信号的其他特征。 高阈值和低能量阈值; 零交叉率; 平均能量 能量水平意味着和差异等特征用于区分语音和噪声。 状态机不会直接从语音状态转移到噪声状态,而是首先转移到转换状态,以减少弱音信号作为噪声的错误分类的可能性,并避免频率削波,如果状态机移动到噪声 状态在短暂的停顿声中。

    Adaptive two-threshold method for discriminating noise from speech in a communication signal
    15.
    发明授权
    Adaptive two-threshold method for discriminating noise from speech in a communication signal 有权
    用于在通信信号中识别噪声与语音的自适应二阈值方法

    公开(公告)号:US06381570B2

    公开(公告)日:2002-04-30

    申请号:US09249108

    申请日:1999-02-12

    IPC分类号: G10L1500

    CPC分类号: G10L19/18 G10L2021/02168

    摘要: A method of discriminating noise and voice energy in a communication signal. A signal is measured in a plurality of block periods, which are sampled to obtain a measurement of the block energy value for the signal. The blocks are compared to a noise threshold and to a voice threshold to discriminate between noise and voice. The thresholds for noise and voice are periodically updated based on the minimum and maximum energy levels measured for block energies. In a preferred embodiment, the voice energy threshold and noise energy threshold values are updated according to a formula where the revised thresholds are based upon a factor of the minimum and maximum energy levels of the current block and the most recent past block and the average energy of the previous blocks. Updating of threshold levels allows for more accurate estimation of noise and voice during changes in either noise, voice or both to avoid missclassification of noise and/or voice.

    摘要翻译: 一种在通信信号中识别噪声和语音能量的方法。 在多个块周期中测量信号,其被采样以获得信号的块能量值的测量。 将这些块与噪声阈值和语音阈值进行比较以区分噪声和语音。 基于块能量测量的最小和最大能量水平,定期更新噪声和声音的阈值。 在优选实施例中,语音能量阈值和噪声能量阈值根据其中修正的阈值基于当前块和最近的过去块的最小和最大能级的因子和平均能量的公式来更新 的以前的块。 阈值水平的更新允许在噪声,语音或两者的改变期间更准确地估计噪声和语音,以避免噪声和/或声音的错误分类。

    Memory optimization packet loss concealment in a voice over packet network
    17.
    发明授权
    Memory optimization packet loss concealment in a voice over packet network 有权
    内存优化包丢失隐藏在分组网络中的语音中

    公开(公告)号:US07590047B2

    公开(公告)日:2009-09-15

    申请号:US11057638

    申请日:2005-02-14

    IPC分类号: H04L12/26

    CPC分类号: H04L65/80

    摘要: A method to reduce memory requirements for a packet loss concealment algorithm in the event of packet loss in a receiver of pulse code modulated voice signals. A voice playout unit in the receiver shares its nominal delay buffer with a history buffer of a packet loss concealment algorithm up to a maximum limit described in a standard. This reduces or eliminates need to allocate memory for the history buffer. A history buffer can also be extended to retain an original portion of voice signal packets received prior to a packet loss as well as generated voice signals as they are generated. A scratch buffer is used as a working buffer and replaces the function of a pitch buffer.

    摘要翻译: 在脉冲编码调制语音信号的接收机中发生分组丢失的情况下,减少分组丢失隐藏算法的存储器要求的方法。 接收机中的语音播出单元使用分组丢失隐藏算法的历史缓冲器共享其标称延迟缓冲器,直到标准中描述的最大限度。 这减少或消除了为历史缓冲区分配内存的需要。 还可以扩展历史缓冲器以保留在分组丢失之前接收到的语音信号分组的原始部分以及在生成的语音信号时产生的语音信号。 暂存缓冲区用作工作缓冲区,并替代音调缓冲区的功能。

    METHOD, SYSTEM AND APPARATUS FOR PROVIDING SIGNAL BASED PACKET LOSS CONCEALMENT FOR MEMORYLESS CODECS
    18.
    发明申请
    METHOD, SYSTEM AND APPARATUS FOR PROVIDING SIGNAL BASED PACKET LOSS CONCEALMENT FOR MEMORYLESS CODECS 有权
    用于提供无信号编码器的基于信号的分组丢包隐藏的方法,系统和设备

    公开(公告)号:US20090059806A1

    公开(公告)日:2009-03-05

    申请号:US12113964

    申请日:2008-05-02

    申请人: Dunling Li

    发明人: Dunling Li

    IPC分类号: H04L12/26

    摘要: In a method, apparatus and system for transmitting packet loss concealment (PLC) information, a subscriber device divides a voice sample into a plurality of packets, each including a plurality of successive frames having portions of the voice sample. The subscriber device determines if a predetermined look ahead time duration from the final frame of the plurality of successive frames in a current packet of the plurality of packets includes a noise to voice transition. When the predetermined look ahead time duration is determined to include the noise to voice transition, the subscriber device packs packing information regarding the predetermined look ahead time duration into the current packet. Finally, the subscriber device encodes the plurality of successive frames into the current packet for transmission.

    摘要翻译: 在用于发送分组丢失隐藏(PLC)信息的方法,装置和系统中,用户设备将语音样本划分为多个分组,每个分组包括具有语音样本部分的多个连续帧。 订户设备确定来自多个分组中的当前分组中的多个连续帧的最终帧的预定的超前时间持续时间是否包括噪声到语音转换。 当预定的前瞻时间持续时间被确定为将噪声包括到语音转换时,用户设备将关于预定的前瞻时间持续时间的打包信息打包到当前分组中。 最后,订户设备将多个连续的帧编码到当前分组中用于传输。

    Method and Apparatus for Processing Analytical-Form Compression Noise in Images with Known Statistics
    19.
    发明申请
    Method and Apparatus for Processing Analytical-Form Compression Noise in Images with Known Statistics 有权
    用于处理具有已知统计图像的分析形式压缩噪声的方法和装置

    公开(公告)号:US20080013847A1

    公开(公告)日:2008-01-17

    申请号:US11621889

    申请日:2007-01-10

    申请人: Dunling Li

    发明人: Dunling Li

    IPC分类号: G06K9/40 G06K9/36

    摘要: Embodiments of the invention provide methods to calculate compression noise statistics of decompressed images in transform coding. They can be used in compressed image quality assessment, compression algorithm optimization, compression noise reduction, and other quantization and compression related applications.

    摘要翻译: 本发明的实施例提供了在变换编码中计算解压缩图像的压缩噪声统计的方法。 它们可用于压缩图像质量评估,压缩算法优化,压缩噪声降低以及其他量化和压缩相关应用。

    Tone, Modulated Tone, and Saturated Tone Detection in a Voice Activity Detection Device
    20.
    发明申请
    Tone, Modulated Tone, and Saturated Tone Detection in a Voice Activity Detection Device 有权
    语音活动检测设备中的音调,调制音和饱和音检测

    公开(公告)号:US20070291928A1

    公开(公告)日:2007-12-20

    申请号:US11846951

    申请日:2007-08-29

    申请人: Dunling Li

    发明人: Dunling Li

    IPC分类号: H04M1/00

    CPC分类号: H04Q1/44 G10L25/78

    摘要: In a voice activity detection (VAD) device a method for defining tone signals comprises defining a threshold for zero amplitude change, calculating a zero crossing rate of a signal, extracting a set of parameters from a plurality of duration periods of the signal, defining a tolerance threshold between the plurality of duration periods when a zero amplitude change occurs, calculating a maximum difference between the plurality of duration periods, and comparing the maximum difference with the threshold. The method is implemented in the International Telecommunications Union (ITU) recommendation G.729 Annex B VAD.

    摘要翻译: 在语音活动检测(VAD)装置中,用于定义音调信号的方法包括定义零幅度变化的阈值,计算信号的零交叉率,从信号的多个持续时间段中提取一组参数, 当发生零幅度变化时,在多个持续时间段之间的容许阈值,计算多个持续时间周期之间的最大差异,以及将最大差异与阈值进行比较。 该方法在国际电信联盟(ITU)建议G.729附件B VAD中实施。