-
公开(公告)号:US10198628B2
公开(公告)日:2019-02-05
申请号:US15377170
申请日:2016-12-13
发明人: Vasily Loginov , Ivan Zagaynov
IPC分类号: G06K9/00
摘要: There is disclosed a method of analyzing a digital image of a document (to determine, as example, a document suitability for server-based OCR processing) in a computer system that includes a user electronic device (for acquiring or storing a digital image of a document) connectable to a server (for executing the server-based OCR processing of the digital image to create a recognized-text document). The method is executable by the user electronic device and comprises: acquiring the digital image of the document; analyzing an OCR quality parameter associated with a compressed digital image to be created from the digital image using a compression algorithm and a compression parameter; in response to the OCR quality parameter being above or equal to a pre-determined threshold: transmitting the compressed digital image to the server. Optionally, the method further comprises compressing the digital image using the compression algorithm and the compression parameter to create the compressed digital image before transmission thereof.
-
公开(公告)号:US10068155B2
公开(公告)日:2018-09-04
申请号:US15275990
申请日:2016-09-26
摘要: A method of verifying optical character recognition (OCR) results may involve: performing OCR on one or more initial images of a document and displaying initial OCR results of the document to a user; receiving a feedback from the user regarding an error location in the initial OCR results, the error location being a location of a misspelled character sequence; receiving an additional image of the document, which corresponds to the error location, and performing OCR of the additional image to produce additional OCR results; identifying a cluster of character sequences, which correspond to the error location, using the initial OCR results and the additional OCR results; identifying an order of character sequences in the cluster of character sequences based on their respective probability values; and displaying to the user modified optical character recognition results, which contain in the error location a corrected character sequence.
-
公开(公告)号:US09911034B2
公开(公告)日:2018-03-06
申请号:US14781656
申请日:2013-06-18
CPC分类号: G06K9/00463 , G06K9/6814 , G06K9/6842 , G06K9/723 , G06K2209/01
摘要: The current application is directed to methods and systems that convert document images, which contain Arabic text and text in other languages in which symbols are joined together to produce continuous words and portions of words, into corresponding electronic documents. In one implementation, a document-image-processing method and system to which the current application is directed employs numerous techniques and features that render efficiently computable an otherwise intractable or impractical document-image-to-electronic-document conversion. These techniques and features include transformation of text-image morphemes and words into feature symbols with associated parameters, efficiently identifying similar morphemes and words in an electronic store of standard-feature-symbol-encoded morphemes and words, and identifying candidate inter-character division points and corresponding traversal paths using the similar morphemes and words identified in the word store.
-
公开(公告)号:US09811726B2
公开(公告)日:2017-11-07
申请号:US15193058
申请日:2016-06-26
CPC分类号: G06K9/00456 , G06F17/2223 , G06F17/275 , G06F17/2775 , G06K9/18 , G06K9/3208 , G06K9/6821 , G06K2209/011
摘要: Disclosed are systems, computer-readable mediums, and methods for determining that text contains Chinese, Japanese, or Korean characters. One method includes determining a language hypothesis for each text fragment in a plurality of text fragments identified from connected components in a document image. The method further includes selecting a first subset of text fragments from the plurality of text fragments based on ratings for the language hypothesis of each text fragment in the plurality of text fragments. The method further includes verifying, by a processor, the language hypothesis of one or more text fragments in the first subset of text fragments based on optical character recognition of the one or more text fragments. The method further includes determining, by the processor, that Chinese, Japanese, or Korean (CJK) characters are present in the document image based on the verification of the language hypothesis of each of the one or more text fragments.
-
公开(公告)号:US09754187B2
公开(公告)日:2017-09-05
申请号:US14571979
申请日:2014-12-16
CPC分类号: G06K9/6255 , G06K9/00483 , G06K2209/01
摘要: For extracting data from a document with fixed structure, we recognize key words in an image of the document; identify reference object based on these key words, create templates based on the identified reference objects; match the created templates against the image of the document while recognizing fields in the image of the document these templates; and select the best template using quality of the recognized field.
-
公开(公告)号:US09684843B2
公开(公告)日:2017-06-20
申请号:US14967645
申请日:2015-12-14
发明人: Andrey Isaev
CPC分类号: G06K9/344 , G06K9/00449 , G06K9/00463 , G06K9/00744 , G06K9/6201 , G06K2209/01
摘要: A data capture component of a mobile device receives information for an identification of a data field in a physical document. The data capture component receives a video stream comprising a plurality of frames, wherein each frame comprises a portion of the physical document. A frame is selected from the plurality of frames in the video stream. One or more text regions in the frame are identified. Each of the identified text region(s) in the frame is processed to identify data of each of the identified text region(s) and to select data of one of the identified text region(s) that corresponds to a set of attributes associated with the data field. The selected data is then compared with data of text regions of a subsequent frame. If the data of the text regions of the subsequent frame is a closer match to the set of attributes, the selected data is updated. A display field is then provided with the selected data for presentation in a user interface.
-
公开(公告)号:US09633257B2
公开(公告)日:2017-04-25
申请号:US14314892
申请日:2014-06-25
申请人: Irina Filimonova , Sergey Zlobin , Andrey Myakutin
发明人: Irina Filimonova , Sergey Zlobin , Andrey Myakutin
CPC分类号: G06K9/00469 , G06K9/00449 , G06K9/46 , G06K9/6202 , G06K9/626 , G06K2209/01
摘要: Automatic classification of different types of documents is disclosed. An image of a form or document is captured. The document is assigned to one or more type definitions by identifying one or more objects within the image of the document. A matching model is selected via identification of the document image. In the case of multiple identifications, a profound analysis of the document type is performed—either automatically or manually. An automatic classifier may be trained with document samples of each of a plurality of document classes or document types where the types are known in advance or a system of classes may be formed automatically without a priori information about types of samples. An automatic classifier determines possible features and calculates a range of feature values and possible other feature parameters for each type or class of document. A decision tree, based on rules specified by a user, may be used for classifying documents. Processing, such as optical character recognition (OCR), may be used in the classification process.
-
公开(公告)号:USD771077S1
公开(公告)日:2016-11-08
申请号:US29438941
申请日:2012-12-05
申请人: ABBYY Software Ltd.
设计人: Anatoly Ryzhkov
-
公开(公告)号:US09483466B2
公开(公告)日:2016-11-01
申请号:US12464798
申请日:2009-05-12
申请人: Ding-Yuan Tang
发明人: Ding-Yuan Tang
CPC分类号: G06F17/289 , G06Q10/06
摘要: In accordance with a first aspect of the invention, there is provided a method comprising receiving an input as part of a translation request from a requestor, performing a first translation of the input; wherein the first translation is a machine translation, returning the first translation to the requestor; and based on feedback on the first translation from the requestor performing the following (a) fragmenting the input into multiple translation jobs, (b) distributing the multiple translation jobs to a plurality of human translators; (c) generating a second translation of the input based on translations of the multiple jobs by the human translators; and (d) returning the second translation to the requestor.
摘要翻译: 根据本发明的第一方面,提供了一种方法,包括从请求者接收作为转换请求的一部分的输入,执行输入的第一翻译; 其中所述第一翻译是机器翻译,将所述第一翻译返回给所述请求者; 并且基于来自所述请求者的对所述第一翻译的反馈,所述第一翻译执行以下步骤(a)将所述输入分段成多个翻译作业;(b)将所述多个翻译作业分发到多个翻译人员; (c)基于人类翻译器对多个作业的翻译生成输入的第二翻译; 及(d)将第二笔译文寄回要求人。
-
公开(公告)号:US09396389B2
公开(公告)日:2016-07-19
申请号:US14509188
申请日:2014-10-08
CPC分类号: G06K9/00456 , G06K9/00449 , G06K9/2063 , G06K9/60
摘要: A digital camera associated with a mobile processing apparatus is used to produce a file containing a 2D digitized image of a document having pre-formatted fields for user's check marks. The image is electronically matched to a digital template of the document for extracting digitized images of the pre-formatted fields, which are thereafter analyzed for presence therein of user-entered check marks.
摘要翻译: 与移动处理装置相关联的数码相机用于产生包含用于用户复选标记的预格式化字段的文档的2D数字化图像的文件。 图像与文档的数字模板电子匹配,用于提取预格式化字段的数字化图像,此后分析其中存在用户输入的复选标记。
-
-
-
-
-
-
-
-
-