Method and apparatus for determining a document suitability for server-based optical character recognition (OCR) processing

    公开(公告)号:US10198628B2

    公开(公告)日:2019-02-05

    申请号:US15377170

    申请日:2016-12-13

    IPC分类号: G06K9/00

    摘要: There is disclosed a method of analyzing a digital image of a document (to determine, as example, a document suitability for server-based OCR processing) in a computer system that includes a user electronic device (for acquiring or storing a digital image of a document) connectable to a server (for executing the server-based OCR processing of the digital image to create a recognized-text document). The method is executable by the user electronic device and comprises: acquiring the digital image of the document; analyzing an OCR quality parameter associated with a compressed digital image to be created from the digital image using a compression algorithm and a compression parameter; in response to the OCR quality parameter being above or equal to a pre-determined threshold: transmitting the compressed digital image to the server. Optionally, the method further comprises compressing the digital image using the compression algorithm and the compression parameter to create the compressed digital image before transmission thereof.

    Verification of optical character recognition results

    公开(公告)号:US10068155B2

    公开(公告)日:2018-09-04

    申请号:US15275990

    申请日:2016-09-26

    摘要: A method of verifying optical character recognition (OCR) results may involve: performing OCR on one or more initial images of a document and displaying initial OCR results of the document to a user; receiving a feedback from the user regarding an error location in the initial OCR results, the error location being a location of a misspelled character sequence; receiving an additional image of the document, which corresponds to the error location, and performing OCR of the additional image to produce additional OCR results; identifying a cluster of character sequences, which correspond to the error location, using the initial OCR results and the additional OCR results; identifying an order of character sequences in the cluster of character sequences based on their respective probability values; and displaying to the user modified optical character recognition results, which contain in the error location a corrected character sequence.

    Video capture in data capture scenario

    公开(公告)号:US09684843B2

    公开(公告)日:2017-06-20

    申请号:US14967645

    申请日:2015-12-14

    发明人: Andrey Isaev

    IPC分类号: G06K9/34 G06K9/00 G06K9/62

    摘要: A data capture component of a mobile device receives information for an identification of a data field in a physical document. The data capture component receives a video stream comprising a plurality of frames, wherein each frame comprises a portion of the physical document. A frame is selected from the plurality of frames in the video stream. One or more text regions in the frame are identified. Each of the identified text region(s) in the frame is processed to identify data of each of the identified text region(s) and to select data of one of the identified text region(s) that corresponds to a set of attributes associated with the data field. The selected data is then compared with data of text regions of a subsequent frame. If the data of the text regions of the subsequent frame is a closer match to the set of attributes, the selected data is updated. A display field is then provided with the selected data for presentation in a user interface.

    Method and system of pre-analysis and automated classification of documents

    公开(公告)号:US09633257B2

    公开(公告)日:2017-04-25

    申请号:US14314892

    申请日:2014-06-25

    IPC分类号: G06K9/00 G06K9/46 G06K9/62

    摘要: Automatic classification of different types of documents is disclosed. An image of a form or document is captured. The document is assigned to one or more type definitions by identifying one or more objects within the image of the document. A matching model is selected via identification of the document image. In the case of multiple identifications, a profound analysis of the document type is performed—either automatically or manually. An automatic classifier may be trained with document samples of each of a plurality of document classes or document types where the types are known in advance or a system of classes may be formed automatically without a priori information about types of samples. An automatic classifier determines possible features and calculates a range of feature values and possible other feature parameters for each type or class of document. A decision tree, based on rules specified by a user, may be used for classifying documents. Processing, such as optical character recognition (OCR), may be used in the classification process.

    Translation system and method
    9.
    发明授权
    Translation system and method 有权
    翻译系统和方法

    公开(公告)号:US09483466B2

    公开(公告)日:2016-11-01

    申请号:US12464798

    申请日:2009-05-12

    申请人: Ding-Yuan Tang

    发明人: Ding-Yuan Tang

    IPC分类号: G06F17/28 G06Q10/06

    CPC分类号: G06F17/289 G06Q10/06

    摘要: In accordance with a first aspect of the invention, there is provided a method comprising receiving an input as part of a translation request from a requestor, performing a first translation of the input; wherein the first translation is a machine translation, returning the first translation to the requestor; and based on feedback on the first translation from the requestor performing the following (a) fragmenting the input into multiple translation jobs, (b) distributing the multiple translation jobs to a plurality of human translators; (c) generating a second translation of the input based on translations of the multiple jobs by the human translators; and (d) returning the second translation to the requestor.

    摘要翻译: 根据本发明的第一方面,提供了一种方法,包括从请求者接收作为转换请求的一部分的输入,执行输入的第一翻译; 其中所述第一翻译是机器翻译,将所述第一翻译返回给所述请求者; 并且基于来自所述请求者的对所述第一翻译的反馈,所述第一翻译执行以下步骤(a)将所述输入分段成多个翻译作业;(b)将所述多个翻译作业分发到多个翻译人员; (c)基于人类翻译器对多个作业的翻译生成输入的第二翻译; 及(d)将第二笔译文寄回要求人。

    Techniques for detecting user-entered check marks
    10.
    发明授权
    Techniques for detecting user-entered check marks 有权
    用于检测用户输入的复选标记的技术

    公开(公告)号:US09396389B2

    公开(公告)日:2016-07-19

    申请号:US14509188

    申请日:2014-10-08

    IPC分类号: G06K9/46 G06K9/00 G06K9/60

    摘要: A digital camera associated with a mobile processing apparatus is used to produce a file containing a 2D digitized image of a document having pre-formatted fields for user's check marks. The image is electronically matched to a digital template of the document for extracting digitized images of the pre-formatted fields, which are thereafter analyzed for presence therein of user-entered check marks.

    摘要翻译: 与移动处理装置相关联的数码相机用于产生包含用于用户复选标记的预格式化字段的文档的2D数字化图像的文件。 图像与文档的数字模板电子匹配,用于提取预格式化字段的数字化图像,此后分析其中存在用户输入的复选标记。