Abstract:
A method for enhancing at least a section of lower-quality visual data using a hierarchical algorithm, the method comprising receiving at least a plurality of neighbouring sections of lower-quality visual data. A plurality of input sections from the received plurality of neighbouring sections of lower quality visual data are selected and features are extracted from those plurality of input sections of lower-quality visual data. A target section based on the extracted features from the plurality of input sections of lower-quality visual data is then enhanced.
Abstract:
An artificial neural network system for image classification, formed of multiple independent individual convolutional neural networks (CNNs), each CNN being configured to process an input image patch to calculate a classification for the center pixel of the patch. The multiple CNNs have different receptive field of views for processing image patches of different sizes centered at the same pixel. A final classification for the center pixel is calculated by combining the classification results from the multiple CNNs. An image patch generator is provided to generate the multiple input image patches of different sizes by cropping them from the original input image. The multiple CNNs have similar configurations, and when training the artificial neural network system, one CNN is trained first, and the learned parameters are transferred to another CNN as initial parameters and the other CNN is further trained. The classification includes three classes, namely background, foreground, and edge.
Abstract:
The present invention relates to the use of machine learning to improve motion estimation in video encoding. According to a first aspect, there is provided a method for estimating the motion between pictures of video data using a hierarchical algorithm, the method comprising steps of: receiving one or more input pictures of video data; identifying, using a hierarchical algorithm, one or more reference elements in one or more reference pictures of video data that are similar to one or more input elements in the one or more input pictures of video data; determining an estimated motion vector relating the identified one or more reference elements to the one or more input elements; and outputting an estimated motion vector.
Abstract:
A method for enhancing at least a section of lower-quality visual data using a hierarchical algorithm, the method comprises receiving at least one section of lower- quality visual data; and extracting a subset of features, from the at least one section of lower-quality visual data. A plurality of layers of reduced-dimension visual data from the extracted features are formed and enhanced to form at least one section of higher- quality visual data. The at least one section of higher-quality visual data corresponds to the at least one section of lower-quality visual data received.
Abstract:
The present disclosure provides systems and methods to increase resolution of imagery. In one example embodiment, a computer-implemented method includes obtaining a current low-resolution image frame. The method includes obtaining a previous estimated high-resolution image frame, the previous estimated high-resolution frame being a high-resolution estimate of a previous low-resolution image frame. The method includes warping the previous estimated high-resolution image frame based on the current low-resolution image frame. The method includes inputting the warped previous estimated high-resolution image frame and the current low-resolution image frame into a machine-learned frame estimation model. The method includes receiving a current estimated high-resolution image frame as an output of the machine-learned frame estimation model, the current estimated high-resolution image frame being a high-resolution estimate of the current low-resolution image frame.
Abstract:
A disclosed facial recognition system (and method) includes face parsing. In one approach, the face parsing is based on hierarchical interlinked multiscale convolutional neural network (HIM) to identify locations and/or footprints of components of a face image. The HIM generates multiple levels of image patches from different resolution images of the face image, where image patches for different levels have different resolutions. Moreover, the HIM integrates the image patches for different levels to generate interlinked image patches for different levels, where interlinked image patches for different levels have different resolutions. Furthermore, the HIM combines the interlinked image patches to identify refined locations and/or footprints of components.
Abstract:
Substantial reduction of the radiation dose in computed tomography (CT) imaging is shown using a machine-learning dose-reduction technique. Techniques are provided that (1) enhance low-radiation dosage images, beyond just reducing noise, and (2) may be combined with other approaches, such as adaptive exposure techniques and iterative reconstruction, for radiation dose reduction.
Abstract:
A mechanism is described for facilitating cinematic space-time view synthesis in computing environments according to one embodiment. A method of embodiments, as described herein, includes capturing, by one or more cameras, multiple images at multiple positions or multiple points in times, where the multiple images represent multiple views of an object or a scene, where the one or more cameras are coupled to one or more processors of a computing device. The method further includes synthesizing, by a neural network, the multiple images into a single image including a middle image of the multiple images and representing an intermediary view of the multiple views.
Abstract:
A method of reducing image resolution in a deep convolutional network (DCN) includes dynamically selecting a reduction factor to be applied to an input image. The reduction factor can be selected at each layer of the DCN. The method also includes adjusting the DCN based on the reduction factor selected for each layer.
Abstract:
The present invention relates to a method for training a plurality of visual processing algorithms for processing visual data. Thee method comprising the steps of using a pre-processing hierarchical algorithm to process the visual data prior to encoding the visual data in visual data processing, and using a post-processing hierarchical algorithm to further process the visual data following decoding visual data in visual data processing. The steps of steps of encoding and decoding are performed with respect to a predetermined visual data codec and in some embodiments may be content specific.