Abstract:
Techniques to identify and track a pre-identified region-of-interest (ROI) through a temporal sequence of frames/images are described. In general, a down-sampled color gradient (edge map) of an arbitrary sized ROI from a prior frame may be used to generate a small template. This initial template may be used to identify a region of a new or current frame that may be overscan and used to create a current frame's edge map. By comparing the prior frame's template to the current frame's edge map, a cost value or image may be found and used to identify the current frame's ROI center. The size of the current frame's ROI may be found by varying the size of putative new ROIs and testing for their congruence with the prior frame's template. Subsequent ROI's for subsequent frames may be identified to, effectively, track an arbitrarily sized ROI through a sequence of video frames.
Abstract:
Systems, methods, and computer readable media to improve image stabilization operations are described. Novel approaches for fusing non-reference images with a pre-selected reference frame in a set of commonly captured images are disclosed. The fusing approach may use a soft transition by using a weighted average for ghost/non-ghost pixels to avoid sudden transition between neighborhood and almost similar pixels. Additionally, the ghost/non-ghost decision can be made based on a set of neighboring pixels rather than independently for each pixel. An alternative approach may involve performing a multi-resolution decomposition of all the captured images, using temporal fusion, spatio-temporal fusion, or combinations thereof, at each level and combining the different levels to generate an output image.
Abstract:
In some embodiments, a method for compensating for lens motion includes estimating a starting position of a lens assembly associated with captured pixel data. The captured pixel data is captured from an image sensor. In some embodiments, the method further includes calculating from the starting position and position data received from the one or more position sensors lens movement associated with the captured pixel data. The lens movement is mapped into pixel movement associated with the captured pixel data. A transform matrix is adjusted to reflect at least the pixel movement. A limit factor associated with the position data is calculated. The captured pixel data is recalculated using the transform matrix and the limit factor.
Abstract:
A method for reducing noise in a sequence of frames may include generating a transformed frame from an input frame according to a perspective transform of a transform matrix, wherein the transform matrix corrects for motion associated with input frame. A determination may be made to identify pixels in the transformed frame that have a difference with corresponding pixels in a neighboring frame below a threshold. An output frame may be generated by adjusting pixels in the transformed frame that are identified to have the difference with the corresponding pixels in the neighboring frame below the threshold.
Abstract:
Systems, methods, and computer readable media to improve image stabilization operations are described. A novel combination of image quality and commonality metrics are used to identify a reference frame from a set of commonly captured images which, when the set's other images are combined with it, results in a quality stabilized image. The disclosed image quality and commonality metrics may also be used to optimize the use of a limited amount of image buffer memory during image capture sequences that return more images that the memory may accommodate at one time. Image quality and commonality metrics may also be used to effect the combination of multiple relatively long-exposure images which, when combined with a one or more final (relatively) short-exposure images, yields images exhibiting motion-induced blurring in interesting and visually pleasing ways.
Abstract:
Devices, methods, and non-transitory computer readable media are disclosed herein to repair or mitigate the appearance of unwanted reflection artifacts in captured video image streams. These unwanted reflection artifacts often present themselves as brightly-colored spots, circles, rings, or halos that reflect the shape of a bright light source in the captured image. These artifacts, also referred to herein as “ghosts” or “green ghosts” (due to often having a greenish tint), are typically located in regions of the captured images where there is not actually a bright light source located in the image. In fact, such unwanted reflection artifacts often present themselves on the image sensor across the principal point of the lens from where the actual bright light source in the captured image is located. Such devices, methods and computer readable media may be configured to detect, track, and repair such unwanted reflection artifacts in an intelligent and efficient fashion.
Abstract:
Devices, methods, and non-transitory program storage devices are disclosed to provide enhanced images in multi-camera systems, e.g., by using information from images captured by cameras with different properties in terms of optics and/or sensors. In one embodiment, the techniques comprise: obtaining a first image from a first image capture device, wherein the first image has a first field of view (FOV) and a first set of quality characteristics; obtaining a second image from a second image capture device, wherein the second image has a second FOV and a second set of quality characteristics, and wherein the second FOV partially overlaps the first FOV; obtaining a neural network that produces a modified second image having a modified second set of quality characteristics determined by the neural network attempting to match the first set of quality characteristics; and generating an output image based, at least in part, on the modified second image.
Abstract:
Systems and methods for stitching videos are disclosed. Image-based registration between frames from a first video source and frames from a second video source is performed at a first rate. Calibration-based registration between frames from the first video source and frames from the second video source are performed at a second rate higher than the first rate. Then, for a first frame from the first video source for which calibration-based registration data and image-based registration data have been generated, a stitching transform that maps the first frame to a counterpart frame from the second video source based on image-based registration data is generated. A delta transform from the image-based registration data and the calibration-based registration data at the first frame is also derived. For a subsequent frame from the first video source for which calibration-based registration data have been generated, but no image-based registration data have been generated, a stitching transform that maps the subsequent frame to a counterpart frame from the second video source based on the calibration-based registration data and the delta transform is generated. Frames from the first video source and frames from the second video source are stitched according to their respective generated stitching transforms.
Abstract:
Techniques are described for automatically selecting between multiple image capture subsystems with overlapping fields of view but different optical properties. A selection may be made by estimating a plurality of operational characteristics of an image capture event, and, based on those estimates, selecting a primary image capture subsystem for the image capture event. For example, in a device such as a cellphone comprising two capture subsystems, each subsystem including a lens system and sensor system where each subsystem has a different fixed optical zoom parameter, a subsystem can be chosen based on a combination of desired zoom value, estimated focus distance, and estimated scene brightness.
Abstract:
Systems, methods, and computer readable media to rapidly identify and track an arbitrary sized object through a temporal sequence of frames is described. The object being tracked may initially be identified via a specified or otherwise known region-of-interest (ROI). A portion of that ROI can be used to generate an initial or reference histogram and luminosity measure, metrics that may be used to identify the ROI in a subsequent frame. For a frame subsequent to the initial or reference frame, a series of putative ROIs (each having its own location and size) may be identified and the “best” of the identified ROIs selected. As used here, the term “best” simply means that the more similar two frames' histograms and luminosity measures are, the better one is with respect to the other.