ITERATIVE RECOGNITION-GUIDED THRESHOLDING AND DATA EXTRACTION

    公开(公告)号:US20210383150A1

    公开(公告)日:2021-12-09

    申请号:US17348584

    申请日:2021-06-15

    Applicant: Kofax, Inc.

    Abstract: Techniques for binarization and extraction of information from image data are disclosed. The inventive concepts include independently binarizing portions of the image data on the basis of individual features, e.g. per connected component, and using multiple different binarization thresholds to obtain the best possible binarization result for each portion of the image data. Determining the quality of each binarization result may be based on attempted recognition and/or extraction of information therefrom. Independently binarized portions may be assembled into a contiguous result. In one embodiment, a method includes: identifying a region of interest within a digital image; generating a plurality of binarized images based on the region of interest using different binarization thresholds; and extracting data from some or all of the plurality of binarized images. The extracted data includes connected components that overlap and/or are obscured by unique background. Corresponding systems and computer program products are disclosed.

    ITERATIVE RECOGNITION-GUIDED THRESHOLDING AND DATA EXTRACTION

    公开(公告)号:US20190171900A1

    公开(公告)日:2019-06-06

    申请号:US16267205

    申请日:2019-02-04

    Applicant: Kofax, Inc.

    Abstract: Techniques for binarization and extraction of information from image data are disclosed. The inventive concepts include independently binarizing portions of the image data on the basis of individual features, e.g. per connected component, and using multiple different binarization thresholds to obtain the best possible binarization result for each portion of the image data. Determining the quality of each binarization result may be based on attempted recognition and/or extraction of information therefrom. Independently binarized portions may be assembled into a contiguous result. In one embodiment, a method includes: identifying a region of interest within a digital image; generating a plurality of binarized images based on the region of interest using different binarization thresholds; subjecting the region of interest within a digital image to a plurality of thresholding and extraction iterations; and extracting data from some or all of the plurality of binarized images. Corresponding systems and computer program products are disclosed.

    Range and/or polarity-based thresholding for improved data extraction

    公开(公告)号:US10467465B2

    公开(公告)日:2019-11-05

    申请号:US15396327

    申请日:2016-12-30

    Applicant: Kofax, Inc.

    Abstract: Computerized techniques for improved binarization and extraction of information from digital image data are disclosed in accordance with various embodiments. The inventive concepts include: rendering, using a processor of the mobile device, a digital image using a plurality of binarization thresholds to generate a plurality of range-binarized digital images, wherein each rendering of the digital image is generated using a different combination of the plurality of binarization thresholds; identifying, using the processor of the mobile device, one or more range connected components within the plurality of range-binarized digital images; and identifying, using the processor of the mobile device, a plurality of text regions within the digital image based on some or all of the range connected components. Corresponding systems and computer program products are also disclosed.

Patent Agency Ranking