摘要:
Embodiments of the present invention provide a control system for video processes that selectively control the operation of motion stabilization processes. According to the present invention, motion sensor data indicative of motion of a mobile device may be received and processed. A determination may be made by comparing processed motion sensor data to a threshold. Based on the determination, motion stabilization may be suspended on select portions of a captured video sequence.
摘要:
A rate control system is disclosed for video coding applications. The rate controller assigns a quantization parameter for video data in a picture in response to complexity indicators indicative of spatial complexity, motion complexity and/or bits per pel of the picture. A virtual buffer based quantizer parameter is proposed based on a virtual buffer fullness analysis and a target rate estimate, which is derived from the complexity indicators. A second quantizer parameter is proposed from a linear regression analysis of quantizer parameters used to code previously coded pictures of similar type (e.g., I pictures, P pictures or B pictures). A coding policy decision unit defines a final quantizer parameter from a comparison of the two proposed quantizer parameters.
摘要:
A video coding system and method for increasing a transmitted output bit rate of a video encoding system by altering the content of the bit stream. A video encoder may receive a coding mode signal from a computer application for coding source video data, the coding mode signal indicating a target bit rate having a risk factor related to transmission error associated to the target bit rate. The coded bitstream may be modified based on the risk factor indicated in the coding mode signal. A modified coded bitstream may be outputted at the target bit rate and at a reduced coding efficiency, and the channel may be tested for transmission errors. Based on the test results, a revised coding mode signal indicating the same target bit rate, but a revised risk factor may be provided. The coded bitstream may be revised by removing the modifications previously made to the coded bitstream and a revised coded bitstream having greater coding efficiency may be output at the target bit rate.
摘要:
Methods, systems, and apparatus are presented for reducing distortion in an image, such as a video image. A video image can be captured by an image capture device, e.g. during a video conferencing session. Distortion correction processing, such as the application of one or more warping techniques, can be applied to the captured image to produce a distortion corrected image, which can be transmitted to one or more participants. The warping techniques can be performed in accordance with one or more warp parameters specifying a transformation of the captured image. Further, the warp parameters can be generated in accordance with an orientation of the image capture device, which can be determined based on sensor data or can be a fixed value. Additionally or alternatively, the warp parameters can be determined in accordance with a reference image or model to which the captured image should be warped.
摘要:
A method and system are provided to encode a video sequence into a compressed bitstream. An encoder receives a video sequence from an image-capture device, together with metadata associated with the video sequence, and codes the video sequence into a first compressed bitstream using the metadata to select or revise a coding parameter associated with a coding operation. Optionally, the video sequence may be conditioned for coding by a preprocessor, which also may use the metadata to select or revise a preprocessing parameter associated with a preprocessing operation. The encoder may itself generate metadata associated with the first compressed bitstream, which may be used together with any metadata received by the encoder, to transcode the first compressed bitstream into a second compressed bitstream. The compressed bitstreams may be decoded by a decoder to generate recovered video data, and the recovered video data may be conditioned for viewing by a postprocessor, which may use the metadata to select or revise a postprocessing parameter associated with a postprocessing operation.
摘要:
Embodiments of the present invention provides a method and device for processing a source video. The method and device may provide computing an artifact estimation from differences among pixels selected from spatially-distributed sampling patterns in the source video; filtering the source video to produce a filtered version of the source video, computing a blending factor based on the artifact estimation in the source video, and computing an output video by blending the source video and the filtered version of the source video based on the blending factor.
摘要:
A method and device for dithering video from a decoder. The method and device provide: selecting a segment of video frames from the video; computing a noise map for the segment of the video, the noise map computed from differences among pixels selected from spatially-distributed sampling patterns in the segment; determining contours of quantization artifacts within the video frame based on the noise map and a histogram of pixel values in the video frame; computing a gradient measurement of pixel values along the contours; identifying regions along the contours of quantization artifacts to apply dithering based on the noise map and the gradient measurement; determining a dithering strength based on the noise map and the gradient measurement; and applying dithering noise to the identified regions at the determined dithering strength.
摘要:
Method For Implementing A Quantizer In A Multimedia Compression And Encoding System is disclosed. In the Quantizer system of the present invention, several new quantization ideas are disclosed. In one embodiment, adjacent macroblocks are grouped together into macroblock groups. The macroblock groups are then assigned a common quantizer value. The common quantizer value may be selected based upon how the macroblocks are encoded, the type of macroblocks within the macroblock group (intra-blocks or inter-blocks), the history of the motion vectors associated with the macroblocks in the macroblock group, the residuals of the macroblocks in the macroblock group, and the energy of the macroblocks in the macroblock group. The quantizer value may be adjusted in a manner that is dependent on the current quantizer value. Specifically, if the quantizer value is at the low end of the quantizer scale, then only small adjustments are made. If the quantizer value is at the high end then larger adjustments may be made to the quantizer. Finally, in one embodiment, the quantizer is implemented along with an inverse quantizer for efficient operation.
摘要:
Some embodiments provide an architecture for establishing multi-participant video conferences. This architecture has a central distributor that receives video images from two or more participants. From the received images, the central distributor generates composite images that the central distributor transmits back to the participants. Each composite image includes a set of sub images, where each sub image belongs to one participant. In some embodiments, the central distributor saves network bandwidth by removing each particular participant's image from the composite image that the central distributor sends to the particular participant. In some embodiments, images received from each participant are arranged in the composite in a non-interleaved manner. For instance, in some embodiments, the composite image includes at most one sub-image for each participant, and no two sub-images are interleaved.
摘要:
A coding technique is disclosed in which frames of a video sequence are assigned to one of a plurality of sub-channels to be transmitted to a decoder. The frames are coded according to predictive coding techniques such that ordinarily prediction references of the frames in each sub-channel only reach the reference frames that occur within the same sub-channel. Thus, if transmission errors arise with respect to one sub-channel, decoding may occur for other sub-channels until the transmission error is detected and corrected. The decoder may also try to reconstruct the frames in the failed sub-channel by interpolating from the frames in other channels. Furthermore, when feedback scheme is available between the encoder and decoder, the encoder may restart the failed sub-channel by coding the next frames in the sub-channel by predicting from correctly received frames in other sub-channels. And the encoder and decoder may resume normal encoding and decoding once the restart frame is sent and received, respectively. Additionally, the encoder and decoder can maintain an identical and correctly received long-term reference frame that can be used to restart all sub-channels in cases all sub-channels are corrupted at one point. The long-term reference frame can be refreshed periodically.