摘要:
Systems and methods of extracting from an input image a graphical bar code containing graphically encoded information are described. In one aspect, a document template is matched to the input image. The document template is selected from a set of document templates each having a respective predetermined page layout corresponding to a respective document type and including a predetermined graphical bar code location. The input image is cropped based on information relating to the graphical bar code location in the page layout of the document template matched to the input image to produce a cropped graphical bar code candidate for decoding.
摘要:
Systems and methods of extracting from an input image a graphical bar code containing graphically encoded information are described. In one aspect, a document template is matched to the input image. The document template is selected from a set of document templates each having a respective predetermined page layout corresponding to a respective document type and including a predetermined graphical bar code location. The input image is cropped based on information relating to the graphical bar code location in the page layout of the document template matched to the input image to produce a cropped graphical bar code candidate for decoding.
摘要:
A method and system for halftoning images that uses error diffusion with partial dots is provided. First, an input picture element (input pixel) that has a picture level (e.g., gray level) is received. Next, a reproducible gray level is generated based on the gray level of an input pixel. Then, a corrected gray level is generated based on the gray level of an input pixel and an error amount (e.g., error propagated or diffused from adjacent areas or pixels). A determination is made whether the corrected gray level is in a predetermined relationship with a threshold. When the corrected gray level is in a predetermined relationship with the threshold, the reproducible gray level (i.e., partial dot size) is provided as output. When the corrected gray level is not in a predetermined relationship with the threshold, a zero value is provided as output. It is noted that the output gray level and the corrected gray level are provided to an error distribution module for calculating an error and for propagating or diffusing the error to future adjacent areas or pixels.
摘要:
A method and system for capturing images. First, a preview image of a scene is captured. Next, an automatic determination is made whether the scene is a document. When it is determined that the scene is a document, at least one camera control is set to a value that is tailored for document capture. The scene is then captured using the set camera controls. Image processing that is tailored for documents is then performed on the captured scene.
摘要:
A system and method for enhancing scanned document images utilizes an estimated background luminance of a given digital image to remove or reduce visual “see-through” noise. The estimated background luminance is dependent on the luminance values of the edges pixels of detected text edges of the image. In one embodiment, the estimated background luminance is generated using only the edge pixels that are on the lighter side of the detected edges. In addition to the see-through removal, the system and method may further enhance the scanned document images by removing color fringes and sharpening and/or darkening edges of text contained in the images.
摘要:
Semantically ranking content in a website (110) with a computerized ranking device (105) includes: parsing content from the website (110) into multiple autonomous content blocks (415-1 to 415-17) with the computerized ranking device (105) and assigning an importance ranking with said computerized ranking device (105) to each of the content blocks (415-1 to 415-17) based on a degree to which a substance of the content block (415-1 to 415-17) is relevant to one of a plurality of predefined categories.
摘要:
A method and system for extracting Web content is disclosed. In one embodiment, Web content in a Webpage is extracted by identifying paragraphs in the Web content based on line-break node determination. A range of text-body associated with the identified paragraphs is then identified using a maximum scoring subsequence. Further, the identified text-body is refined using a heuristic rule of substantially horizontal alignment. Furthermore, one or more titles and one or more images associated with the Web content are extracted. Moreover, the Web content including the identified paragraphs, the one or more titles and the one or more images are outputted.
摘要:
Disclosed is a computer-implemented method of determining smarty between first and second elements of an electronic document. The method uses a computer to calculate a plurality of measures of similarity between the first and second elements in at least two representations of the electronic document. A computer program product and system implementing this method are also disclosed.
摘要:
A method for generating a panoramic image that enables a user to obtain panoramic photographs with a digital camera without the aid of a computer system or specialized lenses. A digital camera according to the present techniques captures a series of image frames as a user pans the digital camera through a panoramic image scene and combines the captured image frames while the image frames are being captured.
摘要:
A method performed by a processing system is provided. The method comprises detecting an artifact in a first frame of a digital video using a plurality of edges identified in the first frame and replacing a region that encompasses the artifact in the first frame with a corresponding region from a second frame.