Early detection of zeros in the transform domain
    1.
    发明申请
    Early detection of zeros in the transform domain 有权
    在变换域中早期检测零

    公开(公告)号:US20040264575A1

    公开(公告)日:2004-12-30

    申请号:US10831158

    申请日:2004-04-26

    发明人: Gisle Bjontegaard

    IPC分类号: H04N007/12

    摘要: A method detects blocks that are to be indicated as skipped at an earlier stage of the encoding process, than would be the case with other implementations of the ITU H.263 and H.264 standards. The method includes transforming 4null4 blocks in the macro blocks having a skip vector of zero with a binary-transform function. The blocks having values of the four uppermost left binary-transform coefficients less than a predefined threshold, are defined as skipped, thus, minimizing the need for computationally demanding block transformation or quantization.

    摘要翻译: 一种方法检测在编码过程的较早阶段被跳过的块,与ITU H.263和H.264标准的其他实施方式相比。 该方法包括利用二进制变换函数来变换具有零跳跃向量的宏块中的4×4块。 将具有小于预定阈值的四个最上面的二进制变换系数的值的块定义为跳过,从而最小化对计算要求高的块变换或量化的需要。

    Echo canceller with reduced requirement for processing power
    2.
    发明申请
    Echo canceller with reduced requirement for processing power 有权
    回波消除器对处理能力要求降低

    公开(公告)号:US20040218755A1

    公开(公告)日:2004-11-04

    申请号:US10724043

    申请日:2003-12-01

    IPC分类号: H04M009/08

    CPC分类号: H04M9/082

    摘要: An echo canceller processing echo, noise and near end talk in a narrower, but still intelligible, frequency band for reducing required processing power and complexity. In a preferred embodiment of the present invention, an input audio signal of captured sound in an audio communication system is decimated and then divided into a number of sub bands by an analyze filter. Each sub band is processed as in background audio echo cancelling by subtracting the signal with an echo estimate from a model of the acoustic signal in the respective sub band, except from that the signal is also bypassed, adjusted by a filter and subtracted from the processed signal. The resulting signals are then recombined by a synthesize filter and interpolated to the original sampling rate and bandwidth. Finally, the output from the synthesize filter is added to the input audio signal, which has been delayed and adjusted by a filter. The filters are controlled by a control algorithm detecting the presence of near end sound, far end sound and noise, so that the filters, and consequently the high pass filter of the echo canceller, only pass high frequency (above low pass frequencies) when only near end sound is detected.

    摘要翻译: 回波消除器在较窄但仍然可理解的频带中处理回波,噪声和近端通话,以减少所需的处理能力和复杂性。 在本发明的优选实施例中,音频通信系统中的捕获声音的输入音频信号被抽取,然后由分析滤波器分成多个子频带。 通过从相应子带中的声学信号的模型中减去具有回波估计的信号来处理每个子带,除了信号被旁路之外,由滤波器调整并从处理的信号中减去 信号。 然后,所得到的信号由合成滤波器重新组合,并被内插到原始采样速率和带宽。 最后,将合成滤波器的输出添加到输入音频信号中,输入音频信号被滤波器延迟和调整。 滤波器由检测近端声音,远端声音和噪声的存在的控制算法控制,使得滤波器以及因此的回波消除器的高通滤波器仅在仅仅通过高频(高于低通频率)时才通过 检测到近端声音。

    Method for vector prediction
    3.
    发明申请
    Method for vector prediction 有权
    矢量预测方法

    公开(公告)号:US20040146110A1

    公开(公告)日:2004-07-29

    申请号:US10722479

    申请日:2003-11-28

    发明人: Gisle Bjontegaard

    IPC分类号: H04N007/12

    CPC分类号: H04N19/57 H04N19/52 H04N19/56

    摘要: A method for prediction of the motion vector of a pixel block in a video picture that is to be coded. The actual motion vectors of two adjacent blocks close to the uppermost left corner of the block are selected as candidates for the prediction. One additional block, also adjacent to the block, is selected to decide which of the motion vectors to be used as the prediction. The vector difference to the motion vector of the decision block is decisive for the final selection.

    Method and apparatus for video compression
    6.
    发明申请
    Method and apparatus for video compression 有权
    视频压缩的方法和装置

    公开(公告)号:US20040233993A1

    公开(公告)日:2004-11-25

    申请号:US10844054

    申请日:2004-05-12

    IPC分类号: H04N007/12

    摘要: A unified solution to coding/decoding of different video formats such as 4:2:0, 4:2:2 and 4:4:4 is provided. A method of video coding includes transforming a first mnulln macro block of residual chrominance pixel values of moving pictures by a first integer-transform function generating a corresponding second mnulln macro block of integer-transform coefficients, further transforming DC values of the integer-transform coefficients by a second integer-transform function to generate a third block of integer-transformed DC coefficients. The method further includes generating the second mnulln macro block of integer-transform coefficients by utilizing a knullk integer-transform function on each knullk sub-block of the first mnulln macro block, wherein n and m are each a multiple of k, and generating the third block of coefficients by utilizing a second inullj integer-transform function on the DC values resulting in a (m/k)null(n/k) third block of integer-transformed DC coefficients.

    摘要翻译: 提供了对4:2:0,4:2:2和4:4:4等不同视频格式进行编码/解码的统一解决方案。 一种视频编码方法包括:通过产生对应的整数变换系数的第二m×m宏块的第一整数变换函数来变换运动图像的残余色度像素值的第一m×m宏块,进一步变换整数变换系数的DC值 通过第二整数变换函数来生成整数变换DC系数的第三块。 该方法还包括通过在第一m×n宏块的每个kxk子块上利用kxk整数变换函数来生成整数变换系数的第二m×n宏块,其中n和m分别为k的倍数,并且生成 通过利用第二个ixj整数变换函数,得到第二个整数变换的直流系数的第(m / k)×(n / k)个第三块的DC值的第三个系数块。

    Video teleconferencing system with digital transcoding

    公开(公告)号:US20030231600A1

    公开(公告)日:2003-12-18

    申请号:US10426245

    申请日:2003-04-29

    发明人: Mark D. Polomski

    IPC分类号: H04L012/16

    CPC分类号: H04N7/15 H04N7/152 H04N19/40

    摘要: A video teleconferencing system uses digital transcoding to obtain algorithm transcoding, transmission rate matching, and spatial mixing. The video teleconferencing system comprises a multipoint control unit (MCU) for allowing multiple audiovisual terminals, which send and receive compressed digital data signals, to communicate with each other in a conference. The MCU has a video processing unit (VPU) that performs algorithm transcoding, rate matching, and spatial mixing among the terminals within a conference. The VPU includes a time division multiplex pixel bus and a plurality of processors. Each processor is assignable to an audiovisual terminal in the conference and is coupled to the pixel bus. In a receive mode, each processor receives and decodes compressed video signals from its assigned terminal and puts the decoded signal onto the pixel bus. In a transmit mode, each processor receives from the pixel bus uncompressed video signals from any terminal in the conference. The uncompressed video signals are processed and encoded for transmission to the respective assigned terminal. Video encoding time due to motion displacement search is reduced by passing displacement information from the compressed video signals to the encoder to be used directly or as a seed for further refinements of the motion displacement field.