Reducing memory requirements of a codebook vector search
    1.
    发明授权
    Reducing memory requirements of a codebook vector search 有权
    减少码本向量搜索的内存要求

    公开(公告)号:US06789059B2

    公开(公告)日:2004-09-07

    申请号:US09876352

    申请日:2001-06-06

    IPC分类号: G10L1910

    CPC分类号: G10L19/10 G10L2019/0013

    摘要: Methods and apparatus for quickly selecting an optimal excitation waveform from a codebook are presented herein. To reduce the number of computations required to choose the optimal codebook vector, a subset of codevectors are selected based upon optimal pulse locations, wherein the subset of codevectors form a subcodebook. Rather than searching the entire codebook, only the entries of the subcodebook are searched.

    摘要翻译: 本文给出了从码本快速选择最佳激励波形的方法和装置。 为了减少选择最佳码本向量所需的计算次数,基于最佳脉冲位置选择码矢量的子集,其中码矢量子集形成子码本。 而不是搜索整个码本,只搜索子码本的条目。

    SELECTION OF ENCODING MODES AND/OR ENCODING RATES FOR SPEECH COMPRESSION WITH OPEN LOOP RE-DECISION
    2.
    发明申请
    SELECTION OF ENCODING MODES AND/OR ENCODING RATES FOR SPEECH COMPRESSION WITH OPEN LOOP RE-DECISION 有权
    编码模式和/或编码速率用于语音压缩与开环重新决定

    公开(公告)号:US20070219787A1

    公开(公告)日:2007-09-20

    申请号:US11625797

    申请日:2007-01-22

    IPC分类号: G10L11/04

    CPC分类号: G10L19/22

    摘要: In a device configurable to encode speech performing an open loop re-decision may comprise representing a speech signal by amplitude components and phase components for a current frame and a past frame. During the current frame, there may be an extraction of uncompressed amplitude components and uncompressed phase components. The amplitude components and the phase components from the past frame may then be retrieved. A set of features may be generated based on the uncompressed amplitude components from the current frame, the uncompressed phase components from the current frame, the amplitude components from the past frame, and the phase components from the past frame. The set of features may be checked as part of the open loop re-decision, and determining a final encoding decision based on the checking may be performed. The final encoding decision may be an encoding mode and/or encoding rate.

    摘要翻译: 在可配置为对执行开环重新判定的语音进行编码的装置中的装置可以包括通过当前帧和过去帧的幅度分量和相位分量表示语音信号。 在当前帧中,可以提取未压缩幅度分量和未压缩相位分量。 然后可以检索来自过去帧的幅度分量和相位分量。 可以基于来自当前帧的未压缩幅度分量,来自当前帧的未压缩相位分量,来自过去帧的幅度分量和来自过去帧的相位分量来生成一组特征。 可以将这组特征作为开环重新判定的一部分进行检查,并且可以执行基于检查来确定最终编码决定。 最终编码决定可以是编码模式和/或编码速率。

    ARBITRARY AVERAGE DATA RATES FOR VARIABLE RATE CODERS
    4.
    发明申请
    ARBITRARY AVERAGE DATA RATES FOR VARIABLE RATE CODERS 有权
    用于可变速率编码器的仲裁平均数据速率

    公开(公告)号:US20070171931A1

    公开(公告)日:2007-07-26

    申请号:US11625788

    申请日:2007-01-22

    IPC分类号: H04J3/17

    CPC分类号: G10L19/22 G10L19/24

    摘要: Methods and apparatus are provided for achieving an arbitrary average data rate for a variable rate coder. One method includes selecting a set (e.g., a pair) of initial composite rates surrounding the arbitrary average data rate. A reallocation fraction is then calculated based on the initial composite rates. The reallocation fraction is used to reassign a number of frames from one component rate of an initial composite rate to another in order to achieve the arbitrary average data rate. Such a method may be configured such that selecting an initial composite rate on one side of (e.g., less than) the arbitrary average data rate implicitly selects the initial composite rate on the other side of the arbitrary average data rate.

    摘要翻译: 提供了用于实现可变速率编码器的任意平均数据速率的方法和装置。 一种方法包括选择围绕任意平均数据速率的初始复合速率的集合(例如,一对)。 然后基于初始复合速率计算重新分配分数。 重新分配部分用于将多个帧从初始复合速率的一个分量速率重新分配给另一个,以便实现任意的平均数据速率。 这样的方法可以被配置为使得在任意平均数据速率的一侧(例如小于)选择初始复合速率隐含地选择任意平均数据速率的另一侧上的初始复合速率。

    Fast code-vector searching
    6.
    发明授权
    Fast code-vector searching 有权
    快速码矢量搜索

    公开(公告)号:US06766289B2

    公开(公告)日:2004-07-20

    申请号:US09874657

    申请日:2001-06-04

    IPC分类号: G10L1910

    CPC分类号: G10L19/10 G10L2019/0013

    摘要: Methods and apparatus for quickly selecting an optimal excitation waveform from a codebook are presented herein. In encoding schemes that use forward and backward pitch enhancement, storage and processor load is reduced by approximating a two-dimensional autocorrelation matrix with a one-dimensional autocorrelation vector. The approximation is possible when a cross-correlation element is configured to determine the autocorrelation matrix of an impulse response and a pulse energy determination element is configured to determine the energy of a pulse code vector that incorporates secondary pulse positions.

    摘要翻译: 本文给出了从码本快速选择最佳激励波形的方法和装置。 在使用前向和后向间距增强的编码方案中,通过用一维自相关向量逼近二维自相关矩阵来减少存储和处理器负载。 当互相关元件被配置为确定脉冲响应的自相关矩阵并且脉冲能量确定元件被配置为确定包含次级脉冲位置的脉冲码矢量的能量时,近似是可能的。

    Voice modifier for speech processing systems
    7.
    发明申请
    Voice modifier for speech processing systems 有权
    语音处理系统的语音修改器

    公开(公告)号:US20070233472A1

    公开(公告)日:2007-10-04

    申请号:US11398364

    申请日:2006-04-04

    IPC分类号: G10L19/00

    CPC分类号: G10L21/00 G10L21/003

    摘要: A speech converter in a speech processing system modifies various aspects of input speech. The speech converter receives a formants signal representing an input speech signal. The speech converter may also receive a formant scaling command or a user selection of one of multiple control signals, each specifying a manner of modifying one or more of the received signals (i.e., formants, voicing, pitch, gain). The speech converter modifies at least one of the formants, voicing, pitch, and/or gain signals as specified by the selected voice font.

    摘要翻译: 语音处理系统中的语音转换器修改输入语音的各个方面。 语音转换器接收表示输入语音信号的共振峰信号。 语音转换器还可以接收共振峰缩放命令或多个控制信号之一的用户选择,每个控制信号指定修改接收信号中的一个或多个(即,共振峰,发音,音高,增益)的方式。 语音转换器修改由所选择的语音字体指定的共振峰,浊音,音调和/或增益信号中的至少一个。

    Automatic white balance method and apparatus
    8.
    发明申请
    Automatic white balance method and apparatus 有权
    自动白平衡方法和装置

    公开(公告)号:US20050286097A1

    公开(公告)日:2005-12-29

    申请号:US11043572

    申请日:2005-01-25

    IPC分类号: H04N1/46 H04N1/60 H04N9/73

    摘要: Automatic white balance of captured images can be performed based on a gray world assumption. Initially, a flat field gray image is captured for one or more reference illuminations. The statistics of the captured gray image are determined and stored for each reference illumination during a calibration process. For each subsequent captured image, the image is filtered to determine a subset of gray pixels. The gray pixels are further divided into a one or more gray clusters. The average weight of the one or more gray clusters is determined and a distance from the average weights to the reference illuminants is determined. An estimate of the illuminant is determined depending on the distances. White balance gains are applied to the image based on the estimated illuminant.

    摘要翻译: 可以基于灰色世界假设执行拍摄图像的自动白平衡。 最初,对于一个或多个参考照明捕获平坦场灰色图像。 在校准过程中,为每个参考照明确定捕获的灰度图像的统计数据并存储。 对于每个后续捕获的图像,滤波图像以确定灰色像素的子集。 灰色像素被进一步分成一个或多个灰色簇。 确定一个或多个灰色簇的平均重量,并确定从平均重量到参考光源的距离。 根据距离确定光源的估计。 基于估计的光源将白平衡增益应用于图像。

    Re-formatting variable-rate vocoder frames for inter-system transmissions
    9.
    发明申请
    Re-formatting variable-rate vocoder frames for inter-system transmissions 有权
    重新格式化用于系统间传输的可变速率声码器帧

    公开(公告)号:US20050265399A1

    公开(公告)日:2005-12-01

    申请号:US11197178

    申请日:2005-08-03

    摘要: Methods and apparatus are presented for supporting the transmission of variable-rate vocoder frames over non-compatible communication channels. Variable-rate vocoder frames are re-formatted as cargo in multi-rate vocoder frames. At the receiver, a determination is made as to whether a received multi-rate vocoder frame carries a variable-rate vocoder frame cargo. If a variable-rate vocoder frame is cargo, then a determination of the frame type is made. Various embodiments for conveying cargo information are presented.

    摘要翻译: 提出了用于支持通过不兼容通信信道传输可变速率声码器帧的方法和装置。 可变速率声码器帧被重新格式化为多速率声码器帧中的货物。 在接收机处,确定接收的多速率声码器帧是否携带可变速率声码器帧货物。 如果可变速率声码器帧是货物,则确定帧类型。 呈现了用于传送货物信息的各种实施例。

    Systems, methods, and apparatus for wideband speech coding
    10.
    发明申请
    Systems, methods, and apparatus for wideband speech coding 有权
    用于宽带语音编码的系统,方法和装置

    公开(公告)号:US20070088542A1

    公开(公告)日:2007-04-19

    申请号:US11397794

    申请日:2006-04-03

    IPC分类号: G10L19/00

    摘要: A wideband speech encoder according to one embodiment includes a narrowband encoder and a highband encoder. The narrowband encoder is configured to encode a narrowband portion of a wideband speech signal into a set of filter parameters and a corresponding encoded excitation signal. The highband encoder is configured to encode, according to a highband excitation signal, a highband portion of the wideband speech signal into a set of filter parameters. The highband encoder is configured to generate the highband excitation signal by applying a nonlinear function to a signal based on the encoded narrowband excitation signal to generate a spectrally extended signal.

    摘要翻译: 根据一个实施例的宽带语音编码器包括窄带编码器和高带编码器。 窄带编码器被配置为将宽带语音信号的窄带部分编码成一组滤波器参数和相应编码的激励信号。 高带编码器被配置为根据高频激励信号将宽带语音信号的高频部分编码成一组滤波器参数。 高带编码器被配置为通过基于编码的窄带激励信号对信号应用非线性函数来产生高频激励信号,以产生频谱扩展信号。