摘要:
A method (600) of generating a composite image from multiple images captured for a subject is disclosed. In some embodiments, the method (600) may include receiving, via an image capturing device (104), a plurality of sets of images of at least a portion of a subject. The images within a set of images may be captured at a plurality of vertical positions with respect to an associated fixed section of a horizontal plane (202). The method (600) may further include generating a plurality of focus-stacked images corresponding to the plurality of sets of images, for example, by combining the images in the associated set of images. The method (600) may further include aligning the plurality of focus-stacked images in the horizontal plane (202) based on a horizontal coordinate transformation model to generate a composite image representing the subject.
摘要:
A method and system of determining quality of a document image is disclosed that includes segmenting, by one or more processors, a document image into a plurality of regions each of which comprises text data. The plurality of regions is classified into one of a plurality of image quality classes based on a determination of a highest prediction value from one of a plurality of machine learning models. The plurality of machine learning models is trained corresponding to one of the plurality of image quality classes. A cumulative quality score for the image is computed based on a weighted average of a number of regions classified into each of the plurality of image quality classes. The quality of the image is determined based on the cumulative quality score.
摘要:
The invention relates to an image processing method and system for constructing composite image with extended depth of field. The composite image may be constructed from a plurality of source images of a scene stored in an image stack. The method includes aligning the images in the image stack such that every image in the image stack is aligned with other images in the stack, performing illumination and color correction on the aligned images in the image stack, generating an energy matrix for each pixel of each illumination and color corrected image in the image stack by computing energy content for each pixel, generating a raw index map that contains the location of every pixels having maximum energy level among all the images in the image stack, generating degree of defocus ma and constructing the composite image.
摘要:
The invention relates to an image processing method and system for constructing composite image with extended depth of field. The composite image may be constructed from a plurality of source images of a scene stored in an image stack. The method includes aligning the images in the image stack such that every image in the image stack is aligned with other images in the stack, performing illumination and color correction on the aligned images in the image stack, generating an energy matrix for each pixel of each illumination and color corrected image in the image stack by computing energy content for each pixel, generating a raw index map that contains the location of every pixels having maximum energy level among all the images in the image stack, generating degree of defocus ma and constructing the composite image.
摘要:
This disclosure relates to method and system for detecting orientation. The method includes detecting a plurality of regions in a document image, each region including text data, and determining positional information of each of the regions; for each of the plurality of regions, determining a region orientation to be one of first orientation or second orientation based on height and width of the region; determining a ratio of number of regions having first orientation and number of regions having second orientation; determining page orientation of the image as third orientation or second orientation, or rotating the image by 90° in counter-clockwise direction based on the ratio; determining first optical character recognition (OCR) data and second OCR data corresponding to the image and the image rotated by 180°, respectively; and determining number of correct words in first OCR data and second OCR data based on comparison with dictionary data.