Iterative recognition-guided thresholding and data extraction

    公开(公告)号:US10242285B2

    公开(公告)日:2019-03-26

    申请号:US15214351

    申请日:2016-07-19

    Applicant: Kofax, Inc.

    Abstract: Techniques for improved binarization and extraction of information from digital image data are disclosed in accordance with various embodiments. The inventive concepts include independently binarizing portions of the image data on the basis of individual features, e.g. per connected component, and using multiple different binarization thresholds to obtain the best possible binarization result for each portion of the image data independently binarized. Determining the quality of each binarization result may be based on attempted recognition and/or extraction of information therefrom. Independently binarized portions may be assembled into a contiguous result. In one embodiment, a method includes: identifying a region of interest within a digital image; generating a plurality of binarized images based on the region of interest using different binarization thresholds; and extracting data from some or all of the plurality of binarized images. Corresponding systems and computer program products are also disclosed.

    Selective, user-mediated content recognition using mobile devices

    公开(公告)号:US10049268B2

    公开(公告)日:2018-08-14

    申请号:US15059242

    申请日:2016-03-02

    Applicant: Kofax, Inc.

    Abstract: A method includes: displaying a digital image on a first portion of a display of a mobile device; receiving user feedback via the display of the mobile device; analyzing the user feedback to determine a meaning of the user feedback; based on the determined meaning of the user feedback, analyzing a portion of the digital image corresponding to either the point of interest or the region of interest to detect one or more connected components depicted within the portion of the digital image; classifying each detected connected component depicted within the portion of the digital image; estimating an identity of each detected connected component based on the classification of the detected connected component; and one or more of: displaying the identity of each detected connected component on a second portion of the display of the mobile device; and providing the identity of each detected connected component to a workflow.

    MACHINE PRINT, HAND PRINT, AND SIGNATURE DISCRIMINATION

    公开(公告)号:US20180189558A1

    公开(公告)日:2018-07-05

    申请号:US15910797

    申请日:2018-03-02

    Applicant: Kofax, Inc.

    CPC classification number: G06K9/00422 G06K9/00187 G06K9/346

    Abstract: Computer program products for discriminating hand and machine print from each other, and from signatures, are disclosed and include program code readable and/or executable by a processor to: receive an image, determine a color depth of the image; reducing the color depth of non-bi-tonal images to generate a bi-tonal representation of the image; identify a set of one or more graphical line candidates in either the bi-tonal image or the bi-tonal representation, the graphical line candidates including true graphical lines and/or false positives; discriminate any of the true graphical lines from any of the false positives; remove the true graphical lines from the bi-tonal image or the bi-tonal representation without removing the false positives to generate a component map comprising connected components and excluding graphical lines; identify one or more of the connected components in the component map; and output and/or display and indicator of each of the connected components.

    Range and/or polarity-based thresholding for improved data extraction

    公开(公告)号:US11302109B2

    公开(公告)日:2022-04-12

    申请号:US16569247

    申请日:2019-09-12

    Applicant: Kofax, Inc.

    Abstract: Computerized techniques for improved binarization and extraction of information from digital image data are disclosed in accordance with various embodiments. The inventive concepts include rendering a digital image using a plurality of binarization thresholds to generate a plurality of binarized digital images, wherein at least some of the binarized digital images are generated using one or more binarization thresholds that are determined based on a priori knowledge regarding an object depicted in the digital image; identifying one or more connected components within the plurality of binarized digital images; and identifying one or more text regions within the digital image based on some or all of the connected components. Systems and computer program products are also disclosed.

    MOBILE DOCUMENT DETECTION AND ORIENTATION BASED ON REFERENCE OBJECT CHARACTERISTICS

    公开(公告)号:US20170357869A1

    公开(公告)日:2017-12-14

    申请号:US15672200

    申请日:2017-08-08

    Applicant: Kofax, Inc.

    Abstract: In various embodiments, computer program products for detecting, estimating, calculating, etc. characteristics of a document based on reference objects depicted on the document are disclosed. In one approach, a computer program product for processing a digital image depicting a document includes instructions executable by a computer for analyzing the digital image to determine one or more of a presence and a location of one or more reference objects; determining one or more geometric characteristics of at least one of the reference objects; defining one or more region(s) of interest based at least in part on one or more of the determined geometric characteristics; and detecting a presence or absence of an edge of the document within each defined region of interest. Additional embodiments leverage the type of document depicted in the image, multiple frames of image data, and/or calculate or extrapolate document edges rather than locating edges in the image.

Patent Agency Ranking