摘要:
Techniques for binarization and extraction of information from image data are disclosed. The inventive concepts include independently binarizing portions of the image data on the basis of individual features, e.g. per connected component, and using multiple different binarization thresholds to obtain the best possible binarization result for each portion of the image data. Determining the quality of each binarization result may be based on attempted recognition and/or extraction of information therefrom. Independently binarized portions may be assembled into a contiguous result. In one embodiment, a method includes: identifying a region of interest within a digital image; generating a plurality of binarized images based on the region of interest using different binarization thresholds; and extracting data from some or all of the plurality of binarized images. The extracted data includes connected components that overlap and/or are obscured by unique background. Corresponding systems and computer program products are disclosed.
摘要:
Techniques for binarization and extraction of information from image data are disclosed. The inventive concepts include independently binarizing portions of the image data on the basis of individual features, e.g. per connected component, and using multiple different binarization thresholds to obtain the best possible binarization result for each portion of the image data. Determining the quality of each binarization result may be based on attempted recognition and/or extraction of information therefrom. Independently binarized portions may be assembled into a contiguous result. In one embodiment, a method includes: identifying a region of interest within a digital image; generating a plurality of binarized images based on the region of interest using different binarization thresholds; subjecting the region of interest within a digital image to a plurality of thresholding and extraction iterations; and extracting data from some or all of the plurality of binarized images. Corresponding systems and computer program products are disclosed.
摘要:
In various embodiments, methods, systems, and computer program products for processing digital images captured by a mobile device are disclosed. The exemplary image processing techniques are coupled with inbound and outbound communications protocols and workflows configured to facilitate closed-loop processing, such that a method includes initiating a workflow; providing one or more of case information and raw data to the workflow; processing one or more of the case information and the raw data to generate a processing result; storing at least some of the case information in association with the processing result, wherein the associated case information acts as an identifier of the processing result; transmitting at least the processing result and the identifier; receiving, in response to the transmitting, a reply comprising the identifier; and retrieving at least the processing result using the identifier.
摘要:
According to one embodiment, a system includes a processor and logic in and/or executable by the processor to cause the processor to: initiate a capture operation using an image capture component of the mobile device, the capture operation comprising; capturing video data; and estimating a plurality of motion vectors corresponding to motion of the image capture component during the capture operation; detect a document depicted in the video data; track a position of the detected document throughout the video data; select a plurality of images using the image capture component of the mobile device, wherein the selection is based at least in part on: the tracked position of the detected document; and the estimated motion vectors; and generate a composite image based on at least some of the selected plurality of images.
摘要:
A computer program product includes program instructions configured to cause a processor, to: perform optical character recognition (OCR) on an image of a document; extract an identifier of the document from the image based at least in part on the OCR; compare at least portions of the identifier with content from one or more reference data sources; and determine whether the identifier is valid based at least in part on the comparison. The content comprises global address information; while the content from the reference is derived from geographic information. Deriving the content from the geographic information includes: obtaining the geographic information; and parsing the geographic information according to a set of predefined heuristic rules, where the heuristic rules are configured to normalize the global address information obtained from the one or more sources according to a single convention for representing address information.
摘要:
In one embodiment, a system includes a processor and logic executable by the processor. The logic is configured to cause the processor to: capture video data using a mobile device, the video data comprising a plurality of frames; determine whether one or more of the frames depict a document exhibiting one or more defining characteristics; determine whether one or more of the frame(s) determined to depict the document also satisfy one or more predetermined quality control criteria; and in response to determining one or more of the frames depict the document and also satisfy the one or more predetermined quality control criteria, automatically capture an image of the document. Corresponding computer program products are also disclosed.
摘要:
In one approach, a method includes: capturing an image of a document using a camera of a mobile device; performing optical character recognition (OCR) on the image of the document; extracting data of interest from the image based at least in part on the OCR; and validating the extracted data of interest against reference information stored on the mobile device. In another embodiment, a method includes: capturing an image of a document using a camera of a mobile device; performing optical character recognition (OCR) on the image of the document; extracting data of interest from the image based at least in part on the OCR; and validating authenticity of the document based on comparing some or all of the extracted data of interest to reference information stored on the mobile device.
摘要:
Systems, methods, and computer program products for smart, automated capture of textual information using optical sensors of a mobile device are disclosed. The capture and provision is context-aware, and determines context of the optical input, and invokes a contextually-appropriate workflow based thereon. The techniques also provide capability to normalize, correct, and/or validate the captured optical input and provide the corrected, normalized, validated, etc. information to the contextually-appropriate workflow. Other information necessary by the workflow and available to the mobile device optical sensors may also be captured and provided, in a single automatic process. As a result, the overall process of capturing information from optical input using a mobile device, invoking an appropriate workflow, and providing captured information to the workflow is significantly simplified and improved in terms of accuracy of data transfer/entry, speed and efficiency of workflows, and user experience.
摘要:
In several embodiments, methods, systems, and computer program products for processing digital images captured by a mobile device are disclosed. The techniques include detecting medical documents and/or documents relevant to an insurance claim by defining candidate edge points based on the captured image data and defining four sides of a tetragon based on at least some of the candidate edge points. In the case of an insurance claim process, the techniques also include determining whether the document is relevant to an insurance claim; and in response to determining the document is relevant to the insurance claim, submitting the image data, information extracted from the image data, or both to a remote server for claims processing. The image capture and processing techniques further facilitate processing of medical documents and/or insurance claims with a plurality of additional features that may be used individually or in combination in various embodiments.
摘要:
A method includes: capturing or receiving at least one image of one or more identity documents (IDs) using a mobile device; determining identifying information from one or more of the IDs; building an ID profile based on the identifying information; storing the ID profile to a memory of the mobile device; invoking a workflow configured to facilitate a business transaction; detecting a predetermined stimulus in the workflow, the stimulus relating to the business transaction; providing at least a portion of the ID profile to the workflow in response to detecting the predetermined stimulus; and driving at least a portion of the workflow using the provided portion of the ID profile. Related systems and computer program products are also disclosed.