Abstract:
Techniques for binarization and extraction of information from image data are disclosed. The inventive concepts include independently binarizing portions of the image data on the basis of individual features, e.g. per connected component, and using multiple different binarization thresholds to obtain the best possible binarization result for each portion of the image data. Determining the quality of each binarization result may be based on attempted recognition and/or extraction of information therefrom. Independently binarized portions may be assembled into a contiguous result. In one embodiment, a method includes: identifying a region of interest within a digital image; generating a plurality of binarized images based on the region of interest using different binarization thresholds; and extracting data from some or all of the plurality of binarized images. The extracted data includes connected components that overlap and/or are obscured by unique background. Corresponding systems and computer program products are disclosed.
Abstract:
Systems, computer program products, and techniques for detecting and/or reconstructing objects depicted in digital image data within a three-dimensional space are disclosed. The concepts utilize internal features for detection and reconstruction, avoiding reliance on information derived from location of edges. The inventive concepts provide an improvement over conventional techniques since objects may be detected and/or reconstructed even when edges are obscured or not depicted in the digital image data. In one aspect, detecting a document depicted in a digital image includes: detecting a plurality of identifying features of the document, wherein the plurality of identifying features are located internally with respect to the object; projecting a location of one or more edges of the document based at least in part on the plurality of identifying features; and outputting the projected location of the one or more edges of the document to a display of a computer, and/or a memory.
Abstract:
Systems, computer program products, and techniques for detecting and/or reconstructing objects depicted in digital image data within a three-dimensional space are disclosed, according to various exemplary embodiments. The inventive concepts uniquely utilize internal features to accomplish reconstruction, thereby avoiding reliance on reconstructing objects based on information derived from location of edges. The inventive concepts thus provide an improvement over conventional object reconstruction since objects may be detected and/or reconstructed even when edges are obscured or not depicted in the digital image data. In one aspect, reconstructing an object depicted in a digital image includes using a processor to: detect a plurality of identifying features of the object, where the identifying features are located internally with respect to the object; and reconstruct the digital image of the object within a three dimensional coordinate space based at least in part on some or all of the identifying features.
Abstract:
Techniques for binarization and extraction of information from image data are disclosed. The inventive concepts include independently binarizing portions of the image data on the basis of individual features, e.g. per connected component, and using multiple different binarization thresholds to obtain the best possible binarization result for each portion of the image data. Determining the quality of each binarization result may be based on attempted recognition and/or extraction of information therefrom. Independently binarized portions may be assembled into a contiguous result. In one embodiment, a method includes: identifying a region of interest within a digital image; generating a plurality of binarized images based on the region of interest using different binarization thresholds; subjecting the region of interest within a digital image to a plurality of thresholding and extraction iterations; and extracting data from some or all of the plurality of binarized images. Corresponding systems and computer program products are disclosed.
Abstract:
Systems, computer program products, and techniques for reconstructing objects depicted in digital image data within a three-dimensional space are disclosed, according to various exemplary embodiments. The inventive concepts uniquely utilize internal features to accomplish reconstruction, thereby avoiding reliance on reconstructing objects based on information derived from location of edges. The inventive concepts thus provide an improvement over conventional object reconstruction since objects may be reconstructed even when edges are obscured or not depicted in the digital image data. In one aspect, a computer-implemented method of reconstructing an object depicted in a digital image includes: detecting a plurality of identifying features of the object, wherein the plurality of identifying features are located internally with respect to the object; and reconstructing the digital image of the object within a three dimensional coordinate space based at least in part on some or all of the plurality of identifying features.
Abstract:
A computer program product includes program instructions configured to cause a processor, to: perform optical character recognition (OCR) on an image of a document; extract an identifier of the document from the image based at least in part on the OCR; compare at least portions of the identifier with content from one or more reference data sources; and determine whether the identifier is valid based at least in part on the comparison. The content comprises global address information; while the content from the reference is derived from geographic information. Deriving the content from the geographic information includes: obtaining the geographic information; and parsing the geographic information according to a set of predefined heuristic rules, where the heuristic rules are configured to normalize the global address information obtained from the one or more sources according to a single convention for representing address information.
Abstract:
According to one embodiment, a computer-implemented method for cleaning up a data set having a possible incorrect label includes: selecting a plurality of training documents; estimating a quality of an organization of a plurality of categories; and determining whether the quality of the organization is greater than a predetermined quality threshold. Corresponding system and computer program product embodiments are also presented. Other aspects and advantages of the present invention will become apparent from the following detailed description, which, when taken in conjunction with the drawings, illustrate by way of example the principles of the invention.
Abstract:
A method includes receiving or capturing a digital image using a mobile device, and using a processor of the mobile device to: determine whether an object depicted in the digital image belongs to a particular object class among a plurality of object classes; determine one or more object features of the object based at least in part on the particular object class at least partially in response to determining the object belongs to the particular object class; build or select an extraction model based at least in part on the one or more determined object features; and extract data from the digital image using the extraction model. The extraction model excludes, and/or the extraction process does not utilize, optical character recognition (OCR) techniques. Related systems and computer program products are also disclosed.
Abstract:
A method is provided for organizing data sets. In use, an automatic decision system is created or updated for determining whether data elements fit a predefined organization or not, where the decision system is based on a set of preorganized data elements. A plurality of data elements is organized using the decision system. At least one organized data element is selected for output to a user based on a score or confidence from the decision system for the at least one organized data element. Additionally, at least a portion of the at least one organized data element is output to the user. A response is received from the user comprising at least one of a confirmation, modification, and a negation of the organization of the at least one organized data element. The automatic decision system is recreated or updated based on the user response. Other embodiments are also presented.
Abstract:
According to one embodiment, a method includes: capturing an image of a financial document using a camera of a mobile device; performing optical character recognition (OCR) on the image of the financial document; extracting an identifier of the financial document from the image based at least in part on the OCR; associating the image of the financial document with metadata descriptive of one or more of the financial document and financial information relating to the financial document; and storing the image of the financial document and the associated metadata to a memory of the mobile device. Exemplary systems and computer program products are also disclosed.