System and method for winding audio content using a voice activity detection algorithm
    41.
    发明申请
    System and method for winding audio content using a voice activity detection algorithm 有权
    使用语音活动检测算法对音频内容进行绕线的系统和方法

    公开(公告)号:US20070112562A1

    公开(公告)日:2007-05-17

    申请号:US11274612

    申请日:2005-11-15

    IPC分类号: G10L11/06

    摘要: A system and method for locating a preferable playback start location after a winding or rewinding action in an audio playing device. In response to an adjustment of the playing location for audio content to a desired playing position, the system determines whether at least one non-speech or silent period of at least a predetermined duration exists within the vicinity of the desired playing position. If at least one such non-speech or silent period exists within the vicinity of the desired playing position, the system adjusts the playing position to fall within one of the at least one non-speech period or silent period.

    摘要翻译: 一种用于在音频播放设备中的卷绕或重绕动作之后定位优选回放开始位置的系统和方法。 响应于将音频内容的播放位置调整到期望的播放位置,系统确定在期望的播放位置附近是否存在至少一个预定持续时间的至少一个非语音或静音时段。 如果在期望的播放位置附近存在至少一个这样的非语音或静音时段,则系统将播放位置调整为落入至少一个非语音周期或无声时段之一。

    Multirate speech codecs
    43.
    发明授权
    Multirate speech codecs 有权
    多速率语音编解码器

    公开(公告)号:US06940967B2

    公开(公告)日:2005-09-06

    申请号:US10804099

    申请日:2004-03-19

    IPC分类号: G10L19/22 G10L19/00

    CPC分类号: G10L19/22

    摘要: A method of determining a codec mode for encoding a frame in a communications system, the method comprising the steps of: receiving a sequence of signal samples arranged in frames; analysing a current frame to select a codec mode appropriate for the current frame; predicting the characteristics of a subsequent frame using lookahead samples from the subsequent frame; and determining a codec mode for the current frame and the subsequent frame which suits the current frame and also suits a subsequent frame based on the predicted characteristics.

    摘要翻译: 一种确定用于对通信系统中的帧进行编码的编解码器模式的方法,所述方法包括以下步骤:接收以帧排列的信号样本序列; 分析当前帧以选择适合于当前帧的编解码器模式; 使用来自后续帧的前视采样来预测后续帧的特性; 以及基于所述预测特性,确定适合当前帧的当前帧和后续帧的编解码器模式,并适合后续帧。

    MULTIRATE SPEECH CODECS
    44.
    发明申请
    MULTIRATE SPEECH CODECS 有权
    多语音编解码器

    公开(公告)号:US20050143984A1

    公开(公告)日:2005-06-30

    申请号:US10804099

    申请日:2004-03-19

    IPC分类号: G10L19/22 G10L19/04

    CPC分类号: G10L19/22

    摘要: A method of determining a codec mode for encoding a frame in a communications system, the method comprising the steps of: receiving a sequence of signal samples arranged in frames; analysing a current frame to select a codec mode appropriate for the current frame; predicting the characteristics of a subsequent frame using lookahead samples from the subsequent frame; and determining a codec mode for the current frame and the subsequent frame which suits the current frame and also suits a subsequent frame based on the predicted characteristics.

    摘要翻译: 一种确定用于对通信系统中的帧进行编码的编解码器模式的方法,所述方法包括以下步骤:接收以帧排列的信号样本序列; 分析当前帧以选择适合于当前帧的编解码器模式; 使用来自后续帧的前视采样来预测后续帧的特性; 以及基于所述预测特性,确定适合当前帧的当前帧和后续帧的编解码器模式,并适合后续帧。

    Speech codecs
    45.
    发明申请
    Speech codecs 有权
    语音编解码器

    公开(公告)号:US20050102136A1

    公开(公告)日:2005-05-12

    申请号:US10804104

    申请日:2004-03-19

    CPC分类号: G10L19/012 G10L25/78

    摘要: A method of encoding speech in a communications system includes the steps of receiving a speech signal including voice signals and background signals, and detecting voice activity and providing an indicator when no voice activity is detected. The speech signal is encoded to generate a plurality of parameters representing the signal. When the indicator is not present, a first parametric representation of the speech signal is output, including the plurality of parameters. When the indicator is present, at least one of the plurality of parameters is modified and a second parametric representation of the speech signal, including the modified parameter is output.

    摘要翻译: 一种在通信系统中编码语音的方法包括以下步骤:接收包括语音信号和背景信号的语音信号,并且在没有检测到语音活动时检测语音活动并提供指示符。 语音信号被编码以产生表示该信号的多个参数。 当指示符不存在时,输出语音信号的第一参数表示,包括多个参数。 当指示符存在时,多个参数中的至少一个被修改,并且输出包括修改参数的语音信号的第二参数表示。

    Decoding method, speech coding processing unit and a network element
    46.
    发明授权
    Decoding method, speech coding processing unit and a network element 失效
    解码方法,语音编码处理单元和网元

    公开(公告)号:US06850883B1

    公开(公告)日:2005-02-01

    申请号:US09601827

    申请日:1998-02-09

    摘要: This invention is related to tandem free operation (TFO) in mobile cellular systems. The present invention implements a tandem free operation by using a special feedback loop which makes the decoded parameters available, performs the comfort noise insertion and bad frame handling operations, produces the parameter quantisation indices corresponding to the output of these operations, and synchronises the speech encoders and the speech decoders in the transmission path from the uplink mobile station to the downlink mobile station. This functionality is realized by partly decoding and re-encoding the parameters and synchronising and resetting the quantiser prediction memories in a specific manner. A basic idea of the invention is, that during BFH and CNI processes, a re-encoding block produces models of encoded speech parameters from the BFH/CNI processed speech parameters. These models of encoded speech parameters are then transmitted to the receiving end. The present invention provides a solution to the problem created by predictive, more generally non-stateless encoders in TFO operation.

    摘要翻译: 本发明涉及移动蜂窝系统中的串联自由操作(TFO)。 本发明通过使用使得解码参数可用,执行舒适噪声插入和不良帧处理操作的特殊反馈回路实现串联自由操作,产生与这些操作的输出对应的参数量化指标,并且使语音编码器 以及从上行链路移动台到下行移动站的传输路径中的语音解码器。 该功能通过部分解码和重新编码参数并以特定方式同步和重置量化器预测存储器来实现。 本发明的基本思想是,在BFH和CNI过程中,重新编码块从BFH / CNI处理的语音参数产生编码语音参数的模型。 然后将这些编码语音参数的模型发送到接收端。 本发明提供了在TFO操作中由预测的,更普遍的非无状态编码器产生的问题的解决方案。

    Method for transmitting background noise information in data transmission in data frames
    47.
    发明授权
    Method for transmitting background noise information in data transmission in data frames 有权
    在数据帧中数据传输中发送背景噪声信息的方法

    公开(公告)号:US06658064B1

    公开(公告)日:2003-12-02

    申请号:US09387369

    申请日:1999-08-31

    IPC分类号: G10L1106

    CPC分类号: H04L1/0072 H04L1/0071

    摘要: The invention relates to a method for transmitting background noise information including a silence descriptor identifier and background noise parameters in a communication system in which the information to be transmitted is formed into data frames. The data frames are subjected to channel coding to form channel-coded frames. The channel-coded frames are interleaved to be transmitted in two or more data transmission frames, and information of two channel-coded frames is transmitted in each data transmission frame. A first silence descriptor frame is formed provided with the silence descriptor identifier. The first silence descriptor frame is subjected to channel coding to form a channel-coded silence descriptor frame. The channel-coded silence descriptor frame is transmitted in two or more data transmission frames, and at least one data transmission frame transmitting part of the channel-coded silence descriptor frame is also used to transmit at least the background noise parameters.

    摘要翻译: 本发明涉及一种用于在其中要发送的信息被形成为数据帧的通信系统中发送包括静音描述符标识符和背景噪声参数的背景噪声信息的方法。 对数据帧进行信道编码以形成信道编码帧。 信道编码帧被交织以在两个或更多个数据传输帧中发送,并且在每个数据传输帧中发送两个信道编码帧的信息。 形成了具有静默描述符标识符的第一个静默描述符帧。 对第一个静默描述符帧进行信道编码以形成信道编码的静默描述符帧。 信道编码静音描述符帧在两个或多个数据传输帧中发送,并且至少一个数据传输帧发送部分信道编码静音描述符帧也用于至少发送背景噪声参数。

    High frequency enhancement layer coding in wideband speech codec
    48.
    发明授权
    High frequency enhancement layer coding in wideband speech codec 有权
    宽带语音编解码器中的高频增强层编码

    公开(公告)号:US06615169B1

    公开(公告)日:2003-09-02

    申请号:US09691440

    申请日:2000-10-18

    IPC分类号: G10L1902

    摘要: A speech coding method and device for encoding and decoding an input signal and providing synthesized speech, wherein the higher frequency components of the synthesized speech are achieved by high-pass filtering and coloring an artificial signal to provide a processed artificial signal. The processed artificial signal is scaled by a first scaling factor during the active speech periods of the input signal and a second scaling factor during the non-active speech periods, wherein the first scaling factor is characteristic of the higher frequency band of the input signal and the second scaling factor is characteristic of the lower frequency band of the input signal. In particular, the second scaling factor is estimated based on the lower frequency components of the synthesized speech and the coloring of the artificial signal is based on the linear predictive coding coefficients characteristic of the lower frequency of the input signal.

    摘要翻译: 一种用于对输入信号进行编码和解码并提供合成语音的语音编码方法和装置,其中通过对人造信号进行高通滤波和着色以提供经处理的人造信号来实现合成语音的较高频率分量。 处理的人造信号在输入信号的有效语音周期期间由第一缩放因子缩放,在非活动语音周期期间被缩放,其中第一缩放因子是输入信号的较高频带的特征, 第二比例因子是输入信号的较低频带的特征。 特别地,基于合成语音的较低频率分量来估计第二缩放因子,并且人造信号的着色基于输入信号的较低频率的线性预测编码系数特性。

    Pitch-lag estimation in speech coding
    49.
    发明授权
    Pitch-lag estimation in speech coding 失效
    语音编码中的音调滞后估计

    公开(公告)号:US06199035B1

    公开(公告)日:2001-03-06

    申请号:US09073697

    申请日:1998-05-06

    IPC分类号: G10L1104

    摘要: A method of speech coding a sampled speech signal using long term prediction (LTP). A LTP pitch-lag parameter is determined for each frame of the speech signal by first determining the autocorrelation function for the frame within the signal, between predefined maximum and minimum delays. The autocorrelation function is then weighted to emphasize the function for delays in the neighborhood of the pitch-lag parameter determined for the most recent voiced frame. The maximum value for the weighted autocorrelation function is then found and identified as the pitch-lag parameter for the frame.

    摘要翻译: 使用长期预测(LTP)语音编码采样语音信号的方法。 通过首先确定信号内的帧,在预定义的最大和最小延迟之间的自相关函数,确定语音信号的每个帧的LTP音调滞后参数。 然后对自相关函数进行加权,以强调为最近的有声帧确定的音调滞后参数的邻域中的延迟的功能。 然后找到加权自相关函数的最大值,并将其识别为帧的音调滞后参数。