CONTENT-BASED DOCUMENT IMAGE CLASSIFICATION
    1.
    发明申请
    CONTENT-BASED DOCUMENT IMAGE CLASSIFICATION 有权
    基于内容的文档图像分类

    公开(公告)号:US20160092730A1

    公开(公告)日:2016-03-31

    申请号:US14571766

    申请日:2014-12-16

    IPC分类号: G06K9/00

    摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for classifying one or more document images based on its content by determining blocks layout of the document image; recognizing the document image to obtain digital content data representing text content or the potential graphical content of the image; calculating feature values of the document image for features based on the digital content data and the blocks layout; and classifying the document image as belonging to one of document classes based on the calculated feature values.

    摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于通过确定文件图像的块布局来基于其内容对一个或多个文档图像进行分类; 识别文档图像以获得表示文本内容的数字内容数据或图像的潜在图形内容; 基于数字内容数据和块布局来计算特征的文档图像的特征值; 并且基于所计算的特征值将文档图像分类为属于文档类别之一。

    Comparing documents using a trusted source

    公开(公告)号:US09922247B2

    公开(公告)日:2018-03-20

    申请号:US14588670

    申请日:2015-01-02

    IPC分类号: G06K9/18 G06K9/00

    CPC分类号: G06K9/00483 G06K2209/01

    摘要: Systems and methods for enhancing and comparing documents. An example method comprises: comparing document images to identify a first document image of a reference document that corresponds with a second document image of a related document; transforming the second document image based on a layout of the first document image; and performing character recognition of the second document image.

    Document scanning method, system, and device having sets of parallel lines as background
    3.
    发明授权
    Document scanning method, system, and device having sets of parallel lines as background 有权
    文档扫描方法,系统和具有平行线组的设备作为背景

    公开(公告)号:US09319547B2

    公开(公告)日:2016-04-19

    申请号:US14501658

    申请日:2014-09-30

    发明人: Andrey Isaev

    摘要: Methods and devices are described for detecting boundaries of documents on flatbed and multi-function scanners on a first pass of a carriage assembly, and then performing a high resolution scan on a second pass. High resolution images of documents can then be obtained with little or no interaction normally necessary to identify areas of interest on the scanner bed. Patterns on the scanner cover or lid facilitate not only edge determination, but orientation of text and other objects, and straightening of images in preparation for OCR and related functions. Electronic images and files derived from paper documents may be automatically cropped, deskewed, subjected to OCR, and named consistent with content or other information derived from them.

    摘要翻译: 描述了用于在托架组件的第一次通过中检测平板和多功能扫描仪上的文档的边界,然后在第二遍上进行高分辨率扫描的方法和装置。 那么文件的高分辨率图像然后可以很少或根本不需要进行交互来获得,以识别扫描仪床上的感兴趣区域。 扫描仪盖子或盖子上的图案不仅可以方便边缘确定,还可以方便文本和其他物体,以及校正图像以准备OCR和相关功能。 从纸质文档衍生的电子图像和文件可能会自动裁剪,进行偏斜校正,受到OCR的影响,并与来自它们的内容或其他信息命名一致。

    VIDEO CAPTURE IN DATA CAPTURE SCENARIO
    5.
    发明申请

    公开(公告)号:US20170286796A1

    公开(公告)日:2017-10-05

    申请号:US15627334

    申请日:2017-06-19

    发明人: Andrey Isaev

    IPC分类号: G06K9/34 G06K9/62 G06K9/00

    摘要: A data capture component of a mobile device receives information for an identification of a data field in a physical document. The data capture component receives a video stream comprising a plurality of frames, wherein each frame comprises a portion of the physical document. A frame is selected from the plurality of frames in the video stream. One or more text regions in the frame are identified. Each of the identified text region(s) in the frame is processed to identify data of each of the identified text region(s) and to select data of one of the identified text region(s) that corresponds to a set of attributes associated with the data field. The selected data is then compared with data of text regions of a subsequent frame. If the data of the text regions of the subsequent frame is a closer match to the set of attributes, the selected data is updated. A display field is then provided with the selected data for presentation with the frame in a user interface.

    Video capture in data capture scenario

    公开(公告)号:US09684843B2

    公开(公告)日:2017-06-20

    申请号:US14967645

    申请日:2015-12-14

    发明人: Andrey Isaev

    IPC分类号: G06K9/34 G06K9/00 G06K9/62

    摘要: A data capture component of a mobile device receives information for an identification of a data field in a physical document. The data capture component receives a video stream comprising a plurality of frames, wherein each frame comprises a portion of the physical document. A frame is selected from the plurality of frames in the video stream. One or more text regions in the frame are identified. Each of the identified text region(s) in the frame is processed to identify data of each of the identified text region(s) and to select data of one of the identified text region(s) that corresponds to a set of attributes associated with the data field. The selected data is then compared with data of text regions of a subsequent frame. If the data of the text regions of the subsequent frame is a closer match to the set of attributes, the selected data is updated. A display field is then provided with the selected data for presentation in a user interface.

    SYSTEM AND METHOD FOR USING PRIOR FRAME DATA FOR OCR PROCESSING OF FRAMES IN VIDEO SOURCES
    7.
    发明申请
    SYSTEM AND METHOD FOR USING PRIOR FRAME DATA FOR OCR PROCESSING OF FRAMES IN VIDEO SOURCES 有权
    使用先前帧数据的系统和方法用于在视频源中的帧的OCR处理

    公开(公告)号:US20160171329A1

    公开(公告)日:2016-06-16

    申请号:US14863512

    申请日:2015-09-24

    摘要: Disclosed are systems, methods and computer program products for using prior frame data for OCR processing of frames in video sources to detect natural language text therein. An example includes receiving a frame from a video source and retrieving prior frame data associated with the video source. The OCR-processing includes using prior frame data to detect blobs similar to blobs described in the prior frame data; using detected similar blobs to detect in the frame character candidates similar to character candidates described in the prior frame data; using detected similar character candidates to detect in the frame text candidates similar to text candidates described in the prior frame data; and using detected similar text candidates to detect in the frame text strings similar to text strings described in the prior frame data.

    摘要翻译: 公开了用于使用先前帧数据进行视频源中的帧的OCR处理以检测其中的自然语言文本的系统,方法和计算机程序产品。 一个例子包括从视频源接收帧并检索与视频源相关联的先前帧数据。 OCR处理包括使用先前帧数据来检测类似于先前帧数据中描述的斑点的斑点; 使用检测到的类似斑点来检测与在先前帧数据中描述的字符候选相似的帧字符候选; 使用检测到的相似字符候选来检测与在先前帧数据中描述的文本候选相似的帧文本候选; 并且使用检测到的类似文本候选来检测与在先前帧数据中描述的文本串类似的帧文本串。

    USING SCANNING IMPLEMENTED SOFTWARE FOR TIME ECONOMY WITHOUT RESACANNING (S.I.S.T.E.R.)
    8.
    发明申请
    USING SCANNING IMPLEMENTED SOFTWARE FOR TIME ECONOMY WITHOUT RESACANNING (S.I.S.T.E.R.) 有权
    使用扫描实施的软件进行时间经济,无需重新定义(S.I.S.T.E.R.)

    公开(公告)号:US20150015926A1

    公开(公告)日:2015-01-15

    申请号:US14501658

    申请日:2014-09-30

    发明人: Andrey Isaev

    IPC分类号: H04N1/00 H04N1/10

    摘要: Methods and devices are described for detecting boundaries of documents on flatbed and multi-function scanners on a first pass of a carriage assembly, and then performing a high resolution scan on a second pass. High resolution images of documents can then be obtained with little or no interaction normally necessary to identify areas of interest on the scanner bed. Patterns on the scanner cover or lid facilitate not only edge determination, but orientation of text and other objects, and straightening of images in preparation for OCR and related functions. Electronic images and files derived from paper documents may be automatically cropped, deskewed, subjected to OCR, and named consistent with content or other information derived from them.

    摘要翻译: 描述了用于在托架组件的第一次通过中检测平板和多功能扫描仪上的文档的边界,然后在第二遍上进行高分辨率扫描的方法和装置。 那么文件的高分辨率图像然后可以很少或根本不需要进行交互来获得,以识别扫描仪床上的感兴趣区域。 扫描仪盖子或盖子上的图案不仅可以方便边缘确定,还可以方便文本和其他物体,以及校正图像以准备OCR和相关功能。 从纸质文档衍生的电子图像和文件可能会自动裁剪,进行偏斜校正,受到OCR的影响,并与来自它们的内容或其他信息命名一致。

    VIDEO CAPTURE IN DATA CAPTURE SCENARIO

    公开(公告)号:US20170116494A1

    公开(公告)日:2017-04-27

    申请号:US14967645

    申请日:2015-12-14

    发明人: Andrey Isaev

    IPC分类号: G06K9/34 G06K9/62 G06K9/00

    摘要: A data capture component of a mobile device receives information for an identification of a data field in a physical document. The data capture component receives a video stream comprising a plurality of frames, wherein each frame comprises a portion of the physical document. A frame is selected from the plurality of frames in the video stream. One or more text regions in the frame are identified. Each of the identified text region(s) in the frame is processed to identify data of each of the identified text region(s) and to select data of one of the identified text region(s) that corresponds to a set of attributes associated with the data field. The selected data is then compared with data of text regions of a subsequent frame. If the data of the text regions of the subsequent frame is a closer match to the set of attributes, the selected data is updated. A display field is then provided with the selected data for presentation in a user interface.