摘要:
A method and system to improve the performance of phase correlation motion estimation for low-bit-precision implementation are described herein. Phase correlation uses the Fast Fourier Transform (FFT) with operations with infinite-precision constants. Since physical implementations use finite-precision arithmetic, there is some loss in precision relative to the ideal infinite-precision case. In low-complexity implementations, it is desirable to use as few bits as possible, and if the precision is too low, the performance of traditional phase correlation suffers. A pre-processing technique is applied to the data prior to taking the FFT, which minimizes the negative effects of finite precision in the FFT and allows high quality results from phase correlation. The pre-processing step is a content-dependent contrast adjustment that maps the range of the input images' pixel values to the range of input values for the FFT. There is no post-processing required after the FFT to compensate for the pre-processing step.
摘要:
An image alignment method includes computationally efficient methods of achieving high-accuracy local motion estimates by using phase correlation. The method also estimates motion reliability that allows a generic robust model fitting algorithm to produce more accurate results while operating much more efficiently. One of three methods are used to determine sub-pel motion estimation with improved accuracy. Each of the sub-pel motion estimation methods uses phase correlation, and are based on fitting computationally efficient 2-D quadratic surfaces to a phase correlation surface. A pre-filter is applied which shapes the phase correlation surface to enable appropriate fitting to the quadratic surface. Bias is also compensated for prior to applying a sub-pel motion estimation method. The method also estimates the reliability of the sub-pel motion estimates determined using phase correlation.
摘要:
A method to estimate segmented motion uses phase correlation to identify local motion candidates and a region-growing algorithm to group small picture units into few distinct regions, each of which has its own motion according to optimal matching and grouping criteria. Phase correlation and region growing are combined which allows sharing of information. Using phase correlation to identify a small number of motion candidates allows the space of possible motions to be narrowed. The region growing uses efficient management of lists of matching criteria to avoid repetitively evaluating matching criteria.
摘要:
Content adaptive detection of images having stand-out objects involves block variance-based detection and determining if an object includes a stand-out object. The images with a stand-out object are further processed to isolate an object of interest. The images without a detected stand-out object are further processed with a transition map-based detection method which includes generating a transition map. If an object portrait is determined from the transition map, then the image is further processed to isolate the object of interest.
摘要:
Apparatus and method for automatically generating panoramic still photographs from a sequence of images collected during panning. Programming within the camera allows creating the panoramic image output from multiple captured stills and/or video frames without laborious user “stitching”. A sequence of images are captured under control of the camera which span a desired subject area being panned (in any direction) by the user. As the images are being captured, the programming assures that the edges of adjacent images in the sequence sufficiently overlap one another as the desired subject area is being panned, as well as controlling other necessary camera adjustments (e.g., maintaining fixed focus). A set of sequential overlapping image frames is collected and combined to create at least one panoramic still photograph. The user can preferably change settings to control how the images are put together into the panoramic image output.
摘要:
A method and apparatus for forming a demosaiced image from a color-filter-array (“CFA”) image is provided. The CFA image comprises a first set of pixels colored according to a first (e.g., a green) color channel, a second set of pixels colored according to a second (e.g., a red) color channel and a third set of pixels colored according to a third (e.g., blue) color channel. The method may include obtaining an orientation map, which includes, for each pixel of the color-filter-array image, an indicator of orientation of an edge bounding such pixel. The method may further include interpolating the first color channel at the second and third sets of pixels as a function of the orientation map so as to form a fourth set of pixels. The method may also include interpolating the second color channel at the first and third sets of pixels as a function of the orientation map and the fourth set of pixels; and interpolating the third color channel at the first and second sets of pixels as a function of the orientation map and the fourth set of pixels.
摘要:
Brightness values of image frames are collected and counted to form a brightness distribution. The brightness distribution of the image frames is compared with predetermined brightness distributions. Brightness values of all pixels of successive image frames are adjusted according to a result of the comparison, so as to improve image contrast.
摘要:
A video system includes: analyzing video data, having a block; performing a transition change detection for determining a spatial intensity transition within the block; performing a block-wise similarity measurement on the block in the video data for identifying a blocking artifact; and filtering with a two dimensional cross filter every pixel in the block for removing the blocking artifact.
摘要:
Low complexity edge detection and DCT type selection method to improve the visual quality of H.264/AVC encoded video sequence is described. Encoding-generated information is reused to detect an edge macroblock. Variance and Mean Absolute Difference (MAD) of one macroblock shows a certain relationship that is able to be used to differentiate the edge macroblock and the non-edge macroblock. Also, the variance difference of neighbor macroblocks provides a hint for edge existence. Then, a block-based edge detection method uses this information. To determine the DCT type for each block, the detected edges are differentiated as visual obvious edge, texture-like edge, soft edge and strong edge. 8×8 DCT is used for texture-like edges and the 4×4 DCT is used for all the other edges. The result is an efficient and accurate edge detection and transform selection method.
摘要:
In one embodiment, a coding mode selection method is provided to improve the visual quality of an encoded video sequence. The coding mode is selected based on a human visual tolerance level. Picture data may be received for a video coding process. The picture data is then analyzed to determine human visual tolerance adjustment information. For example, parameters of a cost equation may be adjusted based on the human visual tolerance level, which may be a tolerance that is based on a distortion bound that the human visual system can tolerate. The picture data may be analyzed in places that are considered visually sensitive areas, such as trailing suspicious areas, stripping suspicious areas, picture boundary areas, and/or blocking suspicious areas. Depending on what kind of visually sensitive area is found in the picture data, a parameter in a cost equation may be adjusted based on different visual tolerance thresholds. The coding mode is then determined based on the cost.