Abstract:
Systems, methods, and computer program products are disclosed and include: initiating a capture operation using an image capture component of the mobile device, the capture operation comprising; capturing video data; and estimating a plurality of motion vectors corresponding to motion of the image capture component during the capture operation. The systems, techniques, and computer program products also include detecting a document depicted in the video data; tracking a position of the detected document throughout the video data; selecting a plurality of images using the image capture component of the mobile device, wherein the selection is based at least in part on: the tracked position of the detected document; and the estimated motion vectors; and generating a composite image based on at least some of the selected plurality of images.
Abstract:
In various embodiments, methods, systems, and computer program products for processing digital images captured by a mobile device are disclosed. Myriad features enable and/or facilitate processing of such digital images using a mobile device that would otherwise be technically impossible or impractical, and furthermore address unique challenges presented by images captured using a camera rather than a traditional flat-bed scanner, paper-feed scanner, or multifunction peripheral. Particularly advantageous features include robustly detecting edges of one or more documents depicted in the digital image data, and defining/locating document pages at least partially on this basis. The statistical approaches employed enable robust yet computationally efficient techniques to accomplish page detection, and associated functions, using hardware typically included in mobile devices and within practical (especially temporal) limits imposed by device manufacturers, users, associated and/or downstream computational and/or business processes.
Abstract:
In various embodiments, methods, systems, and computer program products for processing digital images captured by a mobile device are disclosed. Myriad features enable and/or facilitate processing of such digital images using a mobile device that would otherwise be technically impossible or impractical, and furthermore address unique challenges presented by images captured using a camera rather than a traditional flat-bed scanner, paper-feed scanner, or multifunction peripheral. Particularly advantageous features include robustly detecting edges of one or more documents depicted in the digital image data, and defining/locating document pages at least partially on this basis. The statistical approaches employed enable robust yet computationally efficient techniques to accomplish page detection, and associated functions, using hardware typically included in mobile devices and within practical (especially temporal) limits imposed by device manufacturers, users, associated and/or downstream computational and/or business processes.
Abstract:
In various embodiments, methods, systems, and computer program products for processing digital images captured by a mobile device are disclosed. Myriad features enable and/or facilitate processing of such digital images using a mobile device that would otherwise be technically impossible or impractical, and furthermore address unique challenges presented by images captured using a camera rather than a traditional flat-bed scanner, paper-feed scanner or multifunction peripheral.
Abstract:
In various embodiments, methods, systems, and computer program products for processing digital images captured by a mobile device are disclosed. Myriad features enable and/or facilitate processing of such digital images using a mobile device that would otherwise be technically impossible or impractical, and furthermore address unique challenges presented by images captured using a camera rather than a traditional flat-bed scanner, paper-feed scanner, or multifunction peripheral.
Abstract:
Techniques for binarization and extraction of information from image data are disclosed. The inventive concepts include independently binarizing portions of the image data on the basis of individual features, e.g. per connected component, and using multiple different binarization thresholds to obtain the best possible binarization result for each portion of the image data. Determining the quality of each binarization result may be based on attempted recognition and/or extraction of information therefrom. Independently binarized portions may be assembled into a contiguous result. In one embodiment, a method includes: identifying a region of interest within a digital image; generating a plurality of binarized images based on the region of interest using different binarization thresholds; subjecting the region of interest within a digital image to a plurality of thresholding and extraction iterations; and extracting data from some or all of the plurality of binarized images. Corresponding systems and computer program products are disclosed.
Abstract:
In several embodiments, methods, systems, and computer program products for processing digital images captured by a mobile device are disclosed. The techniques include capturing image data depicting a document; defining a plurality of candidate edge points within the image data; and defining four sides of a tetragon based on at least some of the plurality of candidate edge points; wherein each side of the tetragon corresponds to a different side of the document; wherein an area of the tetragon comprises at least a threshold percentage of a total area of the digital image; and wherein the tetragon bounds the digital representation of the document.
Abstract:
In various embodiments, computer program products for detecting, estimating, calculating, etc. characteristics of a document based on reference objects depicted on the document are disclosed. In one approach, a computer program product for processing a digital image depicting a document includes instructions executable by a computer for analyzing the digital image to determine one or more of a presence and a location of one or more reference objects; determining one or more geometric characteristics of at least one of the reference objects; defining one or more region(s) of interest based at least in part on one or more of the determined geometric characteristics; and detecting a presence or absence of an edge of the document within each defined region of interest. Additional embodiments leverage the type of document depicted in the image, multiple frames of image data, and/or calculate or extrapolate document edges rather than locating edges in the image.
Abstract:
Systems, methods, and computer program products for smart, automated capture of textual information using optical sensors of a mobile device, and selective provision of such textual information to a user interface for facilitating performance of downstream workflows are disclosed. The capture and provision is context-aware, and determines context of the optical input, and optionally invokes a contextually-appropriate workflow based thereon. The techniques also provide capability to normalize, correct, and/or validate the captured optical input and provide the corrected, normalized, validated, etc. information to the contextually-appropriate workflow. As a result, the overall process of capturing information from optical input using a mobile device, invoking an appropriate workflow, and providing captured information to the workflow is significantly simplified and improved in terms of accuracy of data transfer/entry, speed and efficiency of workflows, and user experience.
Abstract:
A method includes receiving user input defining a workflow comprising one or more activities and one or more rules; receiving user input defining a user interface (UI) configured to facilitate a user performing the workflow at least in part using a processor of a mobile device; and generating a mobile software application based on the workflow and the UI. In another embodiment, a method includes: instantiating a mobile application on a mobile device; launching a workflow within the mobile application, the workflow comprising one or more activities and one or more rules; rendering one or more user interfaces based at least in part on the workflow; displaying at least one of the user interfaces on a display of the mobile device; receiving user input via at least one of the user interfaces; and modifying the workflow based at least partially on user input. Systems and computer program products are also disclosed.