Abstract:
Systems and methods are provided for providing patch size adaptation for patch-based image enhancement operations. In one embodiment, an image manipulation application receives an input image. The image manipulation application compares a value for an attribute of at least one input patch of the input image to a threshold value. Based on comparing the value for the to the threshold value, the image manipulation application adjusts a first patch size of the input patch to a second patch size that improves performance of a patch-based image enhancement operation as compared to the first patch size. The image manipulation application performs the patch-based image enhancement operation based on one or more input patches of the input image having the second patch size.
Abstract:
In techniques for fast dense patch search and quantization, partition center patches are determined for partitions of example image patches. Patch groups of an image each include similar image patches and a reference image patch that represents a respective patch group. A partition center patch of the partitions is determined as a nearest neighbor to the reference image patch of a patch group. The partition center patch can be determined based on a single-nearest neighbor (1-NN) distance determination, and the determined partition center patch is allocated as the nearest neighbor to the similar image patches in the patch group. Alternatively, a group of nearby partition center patches are determined as the nearest neighbors to the reference image patch based on a k-nearest neighbor (k-NN) distance determination, and the nearest neighbor to each of the similar image patches in the patch group is determined from the nearby partition center patches.
Abstract:
In techniques for object detection with boosted exemplars, weak classifiers of a real-adaboost technique can be learned as exemplars that are collected from example images. The exemplars are examples of an object that is detectable in image patches of an image, such as faces that are detectable in images. The weak classifiers of the real-adaboost technique can be applied to the image patches of the image, and a confidence score is determined for each of the weak classifiers as applied to an image patch of the image. The confidence score of a weak classifier is an indication of whether the object is detected in the image patch of the image based on the weak classifier. All of the confidence scores of the weak classifiers can then be summed to generate an overall object detection score that indicates whether the image patch of the image includes the object.
Abstract:
Image upscaling techniques are described. These techniques may include use of iterative and adjustment upscaling techniques to upscale an input image. A variety of functionality may be incorporated as part of these techniques, examples of which include content-adaptive patch finding techniques that may be employed to give preference to an in-place patch to minimize structure distortion. In another example, content metric techniques may be employed to assign weights for combining patches. In a further example, algorithm parameters may be adapted with respect to algorithm iterations, which may be performed to increase efficiency of computing device resource utilization and speed of performance. For instance, algorithm parameters may be adapted to enforce a minimum and/or maximum number to iterations, cease iterations for image sizes over a threshold amount, set sampling step sizes for patches, employ techniques based on color channels (which may include independence and joint processing techniques), and so on.
Abstract:
A method and systems of enhancing a video using a related image are provided. One or more patches are identified in the video, with each patch identifying a region that is present in one of the frames of the video that can be mapped to a similar region in at least one other frame of the video. For each identified patch in the video, a best matching patch in the related image is found. The video is enhanced using the best matching patch in the related image for each identified patch in the video.
Abstract:
Systems and methods are discussed to localize facial landmarks using a test facial image and a set of training images. The landmarks can be localized on a test facial image using training facial images. A plurality of candidate landmark locations on the test facial image can be determined. A subset of the training facial images with facial features similar to the facial features in the test facial image can be identified. A plurality of shape constraints can be determined for each test facial image in the subset of test facial images. These shape constraints graphically relate to one landmark location from a linear combination of the other landmark locations in the test facial image. Shape constraints can be determined for every landmark within each test facial image. A candidate landmark can be chosen from the plurality of candidate landmarks using the shape constraints.
Abstract:
Various embodiments of methods and apparatus for feature point localization are disclosed. A profile model and a shape model may be applied to an object in an image to determine locations of feature points for each object component. Input may be received to move one of the feature points to a fixed location. Other ones of the feature points may be automatically adjusted to different locations based on the moved feature point.
Abstract:
Various embodiments describe view switching of video on a computing device. In an example, a video processing application receives a stream of video data. The video processing application renders a major view on a display of the computing device. The major view presents a video from the stream of video data. The video processing application inputs the stream of video data to a deep learning system and receives back information that identifies a cropped video from the video based on a composition score of the cropped video, while the video is presented in the major view. The composition score is generated by the deep learning system. The video processing application renders a sub-view on a display of the device, the sub-view presenting the cropped video. The video processing application renders the cropped video in the major view based on a user interaction with the sub-view.
Abstract:
Text region detection techniques and systems for digital images using image tag filtering are described. These techniques and systems support numerous advantages over conventional techniques through use of image tags to filter text region candidates. A computing device, for instance, may first generate text region candidates through use of a variety of different techniques, such as text line detection. The computing device then assigns image tags to the text region candidates. The assigned image tags are then used by the computing device to filter the text region candidates based on whether image tags assigned to respective candidates are indicative of text.
Abstract:
Embodiments of the present disclosure relate to a sky editing system and related processes for sky editing. The sky editing system includes a composition detector to determine the composition of a target image. A sky search engine in the sky editing system is configured to find a reference image with similar composition with the target image. Subsequently, a sky editor replaces content of the sky in the target image with content of the sky in the reference image. As such, the sky editing system transforms the target image into a new image with a preferred sky background.