摘要:
A method detects blocks that are to be indicated as skipped at an earlier stage of the encoding process, than would be the case with other implementations of the ITU H.263 and H.264 standards. The method includes transforming 4null4 blocks in the macro blocks having a skip vector of zero with a binary-transform function. The blocks having values of the four uppermost left binary-transform coefficients less than a predefined threshold, are defined as skipped, thus, minimizing the need for computationally demanding block transformation or quantization.
摘要:
An echo canceller processing echo, noise and near end talk in a narrower, but still intelligible, frequency band for reducing required processing power and complexity. In a preferred embodiment of the present invention, an input audio signal of captured sound in an audio communication system is decimated and then divided into a number of sub bands by an analyze filter. Each sub band is processed as in background audio echo cancelling by subtracting the signal with an echo estimate from a model of the acoustic signal in the respective sub band, except from that the signal is also bypassed, adjusted by a filter and subtracted from the processed signal. The resulting signals are then recombined by a synthesize filter and interpolated to the original sampling rate and bandwidth. Finally, the output from the synthesize filter is added to the input audio signal, which has been delayed and adjusted by a filter. The filters are controlled by a control algorithm detecting the presence of near end sound, far end sound and noise, so that the filters, and consequently the high pass filter of the echo canceller, only pass high frequency (above low pass frequencies) when only near end sound is detected.
摘要:
A method for prediction of the motion vector of a pixel block in a video picture that is to be coded. The actual motion vectors of two adjacent blocks close to the uppermost left corner of the block are selected as candidates for the prediction. One additional block, also adjacent to the block, is selected to decide which of the motion vectors to be used as the prediction. The vector difference to the motion vector of the decision block is decisive for the final selection.
摘要:
A system, apparatus, method, and computer program product for detecting and correcting packet losses that adversely affect video quality. This packet loss feature may be employed as part of the secure video conference system, method and computer program product, or be employed in a non-secure system, method and computer program product.
摘要:
A system, apparatus, method, and computer program product for establishing secure videoconferences between multiple nodes communicating via two or more transport protocols in which at least one encryption protocol is used.
摘要:
A unified solution to coding/decoding of different video formats such as 4:2:0, 4:2:2 and 4:4:4 is provided. A method of video coding includes transforming a first mnulln macro block of residual chrominance pixel values of moving pictures by a first integer-transform function generating a corresponding second mnulln macro block of integer-transform coefficients, further transforming DC values of the integer-transform coefficients by a second integer-transform function to generate a third block of integer-transformed DC coefficients. The method further includes generating the second mnulln macro block of integer-transform coefficients by utilizing a knullk integer-transform function on each knullk sub-block of the first mnulln macro block, wherein n and m are each a multiple of k, and generating the third block of coefficients by utilizing a second inullj integer-transform function on the DC values resulting in a (m/k)null(n/k) third block of integer-transformed DC coefficients.
摘要:
A video teleconferencing system uses digital transcoding to obtain algorithm transcoding, transmission rate matching, and spatial mixing. The video teleconferencing system comprises a multipoint control unit (MCU) for allowing multiple audiovisual terminals, which send and receive compressed digital data signals, to communicate with each other in a conference. The MCU has a video processing unit (VPU) that performs algorithm transcoding, rate matching, and spatial mixing among the terminals within a conference. The VPU includes a time division multiplex pixel bus and a plurality of processors. Each processor is assignable to an audiovisual terminal in the conference and is coupled to the pixel bus. In a receive mode, each processor receives and decodes compressed video signals from its assigned terminal and puts the decoded signal onto the pixel bus. In a transmit mode, each processor receives from the pixel bus uncompressed video signals from any terminal in the conference. The uncompressed video signals are processed and encoded for transmission to the respective assigned terminal. Video encoding time due to motion displacement search is reduced by passing displacement information from the compressed video signals to the encoder to be used directly or as a seed for further refinements of the motion displacement field.