摘要:
Methods and systems for determining a surface color of a target surface under an environment with an environmental light source. A plurality of images of the target surface are captured as the target surface is illuminated with a variable intensity, constant color light source and a constant intensity, constant color environmental light source, wherein the intensity of the light source on the target surface is varied by a known amount between the capturing of the images. A color feature tensor, independent of the environmental light source, is extracted from the image data, and used to infer a surface color of the target surface.
摘要:
A method, device and computer-readable medium for generating a super-resolution version of a compressed video stream. By leveraging the motion information and residual information in compressed video streams, described examples are able to skip the time-consuming motion-estimation step for most frames and make the most use of the SR results of key frames. A key frame SR module generates SR versions of I-frames and other key frames of a compressed video stream using techniques similar to existing multi-frame approaches to VSR. A non-key frame SR module generates SR version of the non-key inter frames between these key frames by making use of motion information and residual information used to encode the inter frames in the compressed video stream.
摘要:
Systems and methods for multi-frame video frame interpolation. Higher-order motion modeling, such as cubic motion modeling, achieves predictions of intermediate optical flow between multiple interpolated frames, assisted by relaxation of the constraints imposed by the loss function used in initial optical flow estimation. A temporal pyramidal optical flow refinement module performs coarse-to-fine refinement of the optical flow maps used to generate the intermediate frames, focusing a proportionally greater amount of refinement attention to the optical flow maps for the high-error middle frames. A temporal pyramidal pixel refinement module performs coarse-to-fine refinement of the generated intermediate frames, focusing a proportionally greater amount of refinement attention to the high-error middle frames. A generative adversarial network (GAN) module calculates a loss function for training the neural networks used in the optical flow estimation module, temporal pyramidal optical flow refinement module, and/or temporal pyramidal pixel refinement module.
摘要:
Methods, devices and computer-readable media for processing a compressed video to perform an inference task are disclosed. Processing the compressed video may include selecting a subset of frame encodings of the compressed video, or zero or more modalities (RGB, motion vectors, residuals) of a frame encoding, for further processing to perform the inference task. Pre-existing motion vector and/or residual information in frame encodings of the compressed video are leveraged to adaptively and efficiently perform the inference task. In some embodiments, the inference task is an action recognition task, such as a human action recognition task.
摘要:
Systems, methods and computer-readable medium for predicting a depth for a video frame are disclosed. An example method may include steps of: receiving a plurality of training data, each comprising a set of consecutive video frames and a depth representation of a subsequent video frame to the consecutive video frames; receiving a pre-trained neural network model fθ having a plurality of weights θ; while the pre-trained neural network model fθ has not converged: computing a plurality of second weights θi′, based on each set of consecutive video frames, and updating the plurality of weights θ, based on the plurality of training data and the plurality of second weights θi′; receiving a plurality of new consecutive video frames with consecutive timestamps; and predicting a depth representation of video frame immediately subsequent to the new consecutive video frames based on the updated plurality of weights θ.
摘要:
Methods and systems for image deblurring are described. The weights of a deblurring network are first trained by meta-training the deblurring network on both a primary deblurring task and an auxiliary reconstruction task. Application-time training of the deblurring network is then performed, using an application-time blurry input image, to obtain values of application-time trained weights. Application-time training includes performing the auxiliary reconstruction task on the application-time blurry input image, and updating the weights of the deblurring network based on an auxiliary loss computed from the auxiliary reconstruction task. A deblurred output image is generated from the application-time blurry input image, using the application-time trained weights in the deblurring network.
摘要:
Methods and systems for image deblurring are described. The weights of a deblurring network are first trained by meta-training the deblurring network on both a primary deblurring task and an auxiliary reconstruction task. Application-time training of the deblurring network is then performed, using an application-time blurry input image, to obtain values of application-time trained weights. Application-time training includes performing the auxiliary reconstruction task on the application-time blurry input image, and updating the weights of the deblurring network based on an auxiliary loss computed from the auxiliary reconstruction task. A deblurred output image is generated from the application-time blurry input image, using the application-time trained weights in the deblurring network.
摘要:
Devices and methods for extracting motion vector data during decoding compressed of video data are described. At a video decoder, an encoded video data for a frame of video from an input buffer is obtained. The encoded video data is decoded to obtain decoded image data for a decoded frame, where the decoding includes extracting corresponding motion vector data for the decoded frame. The decoded image data is stored in a temporary storage indexed with a given index, and the corresponding motion vector data is stored in a same or different temporary storage indexed with the given index. An output buffer indexed with the given index is filled with the decoded image data and the corresponding motion vector data stored in the respective temporary storage indexed with the given index.
摘要:
The present invention relates to a method for positioning a mobile station and a repeater thereof. Said method comprises: the mobile communication network, when receiving a positioning request from a mobile station, instructing repeaters to send auxiliary positioning signals, and the mobile station performs measurement according to the received auxiliary positioning signals sent from the repeaters and downlink signals sent from the base station, and then estimating the position of the mobile station according to the measurement results, and thereby implementing the positioning of the mobile station. Said repeater is implemented through adding an auxiliary positioning unit in the downlink processing channel of a traditional repeater; said auxiliary positioning unit comprises a communication module, a frame timing recovery module, a timing control module, and a pilot modulating module. The present invention achieves the object of improving the accuracy of positioning the mobile station within the coverage area of repeaters; in addition, the repeater with auxiliary positioning function according to the present invention is simple to be implemented and doesn't affect the structure and signaling flow of the traditional mobile station.
摘要:
The invention provides a novel calcium-independent cytosolic phospholipase A2/B enzyme, polynucleotides encoding such enzyme antibodies to such enzyme, and methods for screening unknown compounds for anti-inflammatory activity mediated by the arachidonic acid cascade.