Range and/or polarity-based thresholding for improved data extraction

    公开(公告)号:US11302109B2

    公开(公告)日:2022-04-12

    申请号:US16569247

    申请日:2019-09-12

    Applicant: Kofax, Inc.

    Abstract: Computerized techniques for improved binarization and extraction of information from digital image data are disclosed in accordance with various embodiments. The inventive concepts include rendering a digital image using a plurality of binarization thresholds to generate a plurality of binarized digital images, wherein at least some of the binarized digital images are generated using one or more binarization thresholds that are determined based on a priori knowledge regarding an object depicted in the digital image; identifying one or more connected components within the plurality of binarized digital images; and identifying one or more text regions within the digital image based on some or all of the connected components. Systems and computer program products are also disclosed.

    MOBILE DOCUMENT DETECTION AND ORIENTATION BASED ON REFERENCE OBJECT CHARACTERISTICS

    公开(公告)号:US20170357869A1

    公开(公告)日:2017-12-14

    申请号:US15672200

    申请日:2017-08-08

    Applicant: Kofax, Inc.

    Abstract: In various embodiments, computer program products for detecting, estimating, calculating, etc. characteristics of a document based on reference objects depicted on the document are disclosed. In one approach, a computer program product for processing a digital image depicting a document includes instructions executable by a computer for analyzing the digital image to determine one or more of a presence and a location of one or more reference objects; determining one or more geometric characteristics of at least one of the reference objects; defining one or more region(s) of interest based at least in part on one or more of the determined geometric characteristics; and detecting a presence or absence of an edge of the document within each defined region of interest. Additional embodiments leverage the type of document depicted in the image, multiple frames of image data, and/or calculate or extrapolate document edges rather than locating edges in the image.

    SYSTEMS AND METHODS FOR MOBILE IMAGE CAPTURE AND PROCESSING

    公开(公告)号:US20170109830A1

    公开(公告)日:2017-04-20

    申请号:US15394726

    申请日:2016-12-29

    Applicant: Kofax, Inc.

    Abstract: In several embodiments, methods, systems, and computer program products for processing digital images captured by a mobile device are disclosed. The techniques include detecting medical documents and/or documents relevant to an insurance claim by defining candidate edge points based on the captured image data and defining four sides of a tetragon based on at least some of the candidate edge points. In the case of an insurance claim process, the techniques also include determining whether the document is relevant to an insurance claim; and in response to determining the document is relevant to the insurance claim, submitting the image data, information extracted from the image data, or both to a remote server for claims processing. The image capture and processing techniques further facilitate processing of medical documents and/or insurance claims with a plurality of additional features that may be used individually or in combination in various embodiments.

    ITERATIVE RECOGNITION-GUIDED THRESHOLDING AND DATA EXTRACTION
    37.
    发明申请
    ITERATIVE RECOGNITION-GUIDED THRESHOLDING AND DATA EXTRACTION 审中-公开
    迭代识别引导和数据提取

    公开(公告)号:US20170024629A1

    公开(公告)日:2017-01-26

    申请号:US15214351

    申请日:2016-07-19

    Applicant: Kofax, Inc.

    Abstract: Techniques for improved binarization and extraction of information from digital image data are disclosed in accordance with various embodiments. The inventive concepts include independently binarizing portions of the image data on the basis of individual features, e.g. per connected component, and using multiple different binarization thresholds to obtain the best possible binarization result for each portion of the image data independently binarized. Determining the quality of each binarization result may be based on attempted recognition and/or extraction of information therefrom. Independently binarized portions may be assembled into a contiguous result. In one embodiment, a method includes: identifying a region of interest within a digital image; generating a plurality of binarized images based on the region of interest using different binarization thresholds; and extracting data from some or all of the plurality of binarized images. Corresponding systems and computer program products are also disclosed.

    Abstract translation: 根据各种实施例公开了用于从数字图像数据改进二值化和提取信息的技术。 本发明的概念包括基于各个特征来独立地二值化图像数据的部分。 并且使用多个不同的二值化阈值来获得独立二进制化的图像数据的每个部分的最佳二值化结果。 确定每个二值化结果的质量可以基于从其中的信息的尝试识别和/或提取。 独立的二值化部分可以组装成连续的结果。 在一个实施例中,一种方法包括:识别数字图像内的感兴趣区域; 使用不同的二值化阈值,基于感兴趣区域生成多个二值化图像; 以及从所述多个二值化图像中的一些或全部提取数据。 还公开了相应的系统和计算机程序产品。

    SYSTEMS AND METHODS FOR CLASSIFYING OBJECTS IN DIGITAL IMAGES CAPTURED USING MOBILE DEVICES
    38.
    发明申请
    SYSTEMS AND METHODS FOR CLASSIFYING OBJECTS IN DIGITAL IMAGES CAPTURED USING MOBILE DEVICES 有权
    使用移动设备捕获的数字图像中的对象进行分类的系统和方法

    公开(公告)号:US20160259973A1

    公开(公告)日:2016-09-08

    申请号:US15157325

    申请日:2016-05-17

    Applicant: Kofax, Inc.

    Abstract: In one embodiment, a method includes receiving a digital image captured by a mobile device; and using a processor of the mobile device: generating a first representation of the digital image, the first representation being characterized by a reduced resolution; generating a first feature vector based on the first representation; comparing the first feature vector to a plurality of reference feature matrices; classifying an object depicted in the digital image as a member of a particular object class based at least in part on the comparing; and determining one or more object features of the object based at least in part on the particular object class. Corresponding systems and computer program products are also disclosed.

    Abstract translation: 在一个实施例中,一种方法包括接收由移动设备捕获的数字图像; 以及使用所述移动设备的处理器:产生所述数字图像的第一表示,所述第一表示的特征在于降低的分辨率; 基于所述第一表示生成第一特征向量; 将所述第一特征向量与多个参考特征矩阵进行比较; 至少部分地基于比较将数字图像中描绘的对象分类为特定对象类的成员; 以及至少部分地基于所述特定对象类来确定所述对象的一个​​或多个对象特征。 还公开了相应的系统和计算机程序产品。

    MOBILE DOCUMENT DETECTION AND ORIENTATION BASED ON REFERENCE OBJECT CHARACTERISTICS
    39.
    发明申请
    MOBILE DOCUMENT DETECTION AND ORIENTATION BASED ON REFERENCE OBJECT CHARACTERISTICS 有权
    基于参考对象特征的移动文档检测和定向

    公开(公告)号:US20160125613A1

    公开(公告)日:2016-05-05

    申请号:US14927359

    申请日:2015-10-29

    Applicant: Kofax, Inc.

    CPC classification number: G06K9/3208 G06K9/00463 G06K9/186

    Abstract: In various embodiments, methods, systems, and computer program products for detecting, estimating, calculating, etc. characteristics of a document based on reference objects depicted on the document are disclosed. In one approach, a computer-implemented method for processing a digital image depicting a document includes analyzing the digital image to determine one or more of a presence and a location of one or more reference objects; determining one or more geometric characteristics of at least one of the reference objects; defining one or more region(s) of interest based at least in part on one or more of the determined geometric characteristics; and detecting a presence or an absence of an edge of the document within each defined region of interest. Additional embodiments leverage the type of document depicted in the image, multiple frames of image data, and/or calculate or extrapolate document edges rather than locating edges in the image.

    Abstract translation: 在各种实施例中,公开了用于基于文档上描绘的参考对象检测,估计,计算等特征的方法,系统和计算机程序产品。 在一种方法中,用于处理描绘文档的数字图像的计算机实现的方法包括分析数字图像以确定一个或多个参考对象的存在和位置中的一个或多个; 确定所述参考对象中的至少一个的一个或多个几何特征; 至少部分地基于所确定的几何特征中的一个或多个来定义感兴趣的一个或多个区域; 以及在每个确定的感兴趣区域内检测文档的边缘的存在或不存在。 附加实施例利用图像中描绘的文档类型,多帧图像数据,和/或计算或外推文档边缘,而不是定位图像中的边缘。

Patent Agency Ranking