SYSTEMS AND METHODS FOR ENABLING MANUAL CLASSIFICATION OF UNRECOGNIZED DOCUMENTS TO COMPLETE WORKFLOW FOR ELECTRONIC JOBS AND TO ASSIST MACHINE LEARNING OF A RECOGNITION SYSTEM USING AUTOMATICALLY EXTRACTED FEATURES OF UNRECOGNIZED DOCUMENTS
    1.
    发明申请
    SYSTEMS AND METHODS FOR ENABLING MANUAL CLASSIFICATION OF UNRECOGNIZED DOCUMENTS TO COMPLETE WORKFLOW FOR ELECTRONIC JOBS AND TO ASSIST MACHINE LEARNING OF A RECOGNITION SYSTEM USING AUTOMATICALLY EXTRACTED FEATURES OF UNRECOGNIZED DOCUMENTS 审中-公开
    使用手册分类未经许可的文件来完成电子作业的工作流程并使用自动提取的未经许可文件的特征来协助识别系统的机器学习的系统和方法

    公开(公告)号:US20090116755A1

    公开(公告)日:2009-05-07

    申请号:US12266454

    申请日:2008-11-06

    IPC分类号: G06K9/62

    CPC分类号: G06K9/00442 G06K9/6885

    摘要: A method in a document analysis system automatically extracts image and text features from each received electronic document and compares the extracted features with feature sets associated with each category of document to determine whether the document is recognizable as belonging to a document category. If an electronic document is recognized as belonging to one of the document categories, the method classifies the electronic document as belonging to that document category. If, however, an electronic document is unrecognized, the method submits the unrecognized document to a learning phase, in which the unrecognized document is presented to a human trainer for manual classification of the unrecognized electronic document into a document category, and automatically modifies at least one of the features and the weights of the feature set of the document category corresponding to the manually-classified electronic document using the automatically extracted features of the manually-classified document.

    摘要翻译: 文档分析系统中的方法自动从每个接收到的电子文档中提取图像和文本特征,并将所提取的特征与与每个类别的文档相关联的特征集合进行比较,以确定文档是否可识别为属于文档类别。 如果电子文档被识别为属于文档类别之一,则该方法将电子文档归类为属于该文档类别。 然而,如果电子文档无法识别,则该方法将无法识别的文档提交到学习阶段,在该阶段将未被识别的文档呈现给人类教练,以将未被识别的电子文档手动分类为文档类别,并至少自动修改 使用手动分类文档的自动提取的特征,对应于手动分类的电子文档的文档类别的特征集的特征和权重之一。

    Systems and methods for handling and distinguishing binarized, background artifacts in the vicinity of document text and image features indicative of a document category
    2.
    发明授权
    Systems and methods for handling and distinguishing binarized, background artifacts in the vicinity of document text and image features indicative of a document category 有权
    用于处理和区分文档文本附近的二进制化的背景文物和指示文档类别的图像特征的系统和方法

    公开(公告)号:US08538184B2

    公开(公告)日:2013-09-17

    申请号:US12266465

    申请日:2008-11-06

    IPC分类号: G06K9/40 G06F17/30

    CPC分类号: G06K9/00442 G06K9/6885

    摘要: A method of enhancing electronic documents received from a plurality of users by a document analysis system for improving automatic recognition and classification of the received electronic documents, is provided. For each page of a received electronic document, the method filters the page to infer binarized-background artifacts resulting from the binarization of the original grayscale or color image source document and which reside in the vicinity of binarized text and binarized image features in the page, so that the binarized text and binarized images may be distinguished from the binarized-background artifacts and extracted from the document. The method then uses the extracted features from the filtered document to automatically recognized and classify a document into a document category.

    摘要翻译: 提供了一种通过文档分析系统增强从多个用户接收的用于改善所接收的电子文档的自动识别和分类的方法。 对于所接收的电子文档的每个页面,该方法对该页面进行过滤以推断由原始灰度或彩色图像源文档二值化而产生的二值化背景伪像,其驻留在页面中的二值化文本和二值化图像特征附近, 使得二进制文本和二值化图像可以与二进制化的背景伪像区分开并从文档中提取。 然后,该方法使用经过滤的文档中提取的特征来自动识别和将文档分类为文档类别。

    SYSTEMS AND METHODS FOR HANDLING AND DISTINGUISHING BINARIZED, BACKGROUND ARTIFACTS IN THE VICINITY OF DOCUMENT TEXT AND IMAGE FEATURES INDICATIVE OF A DOCUMENT CATEGORY
    3.
    发明申请
    SYSTEMS AND METHODS FOR HANDLING AND DISTINGUISHING BINARIZED, BACKGROUND ARTIFACTS IN THE VICINITY OF DOCUMENT TEXT AND IMAGE FEATURES INDICATIVE OF A DOCUMENT CATEGORY 有权
    用于处理和排除混合的系统和方法,文献文本和图像特征的背景文献指出文献类别

    公开(公告)号:US20090119296A1

    公开(公告)日:2009-05-07

    申请号:US12266465

    申请日:2008-11-06

    IPC分类号: G06F17/30

    CPC分类号: G06K9/00442 G06K9/6885

    摘要: A method of enhancing electronic documents received from a plurality of users by a document analysis system for improving automatic recognition and classification of the received electronic documents, is provided. For each page of a received electronic document, the method filters the page to infer binarized-background artifacts resulting from the binarization of the original grayscale or color image source document and which reside in the vicinity of binarized text and binarized image features in the page, so that the binarized text and binarized images may be distinguished from the binarized-background artifacts and extracted from the document. The method then uses the extracted features from the filtered document to automatically recognized and classify a document into a document category.

    摘要翻译: 提供了一种通过文档分析系统增强从多个用户接收的用于改善所接收的电子文档的自动识别和分类的方法。 对于所接收的电子文档的每个页面,该方法对该页面进行过滤以推断由原始灰度或彩色图像源文档二值化而产生的二值化背景伪像,其驻留在页面中的二值化文本和二值化图像特征附近, 使得二进制文本和二值化图像可以与二进制化的背景伪像区分开并从文档中提取。 然后,该方法使用经过滤的文档中提取的特征来自动识别和将文档分类为文档类别。

    Methods and apparatus for establishing secure communications between client computing devices that use transport and security protocols
    4.
    发明授权
    Methods and apparatus for establishing secure communications between client computing devices that use transport and security protocols 有权
    用于在使用传输和安全协议的客户端计算设备之间建立安全通信的方法和装置

    公开(公告)号:US08683053B2

    公开(公告)日:2014-03-25

    申请号:US12979850

    申请日:2010-12-28

    IPC分类号: G06F13/00

    摘要: Methods and apparatuses, including computer program products, are described for establishing secure communications sessions between computing devices located behind network security devices. The method includes receiving, from a first client computing device, a request for a secure connection with a second client computing device, the request including a first transport protocol role and a first security protocol role associated with the first device. The method includes transmitting the request to the second device. The method includes receiving, from the second device, a response to the request including a second transport protocol role and a second security protocol role associated with the second device, transmitting the response to the first device, and establishing the secure connection between the first device and the second device, where the first and second security protocol roles are determined independently from the first and second transport protocol roles.

    摘要翻译: 描述了包括计算机程序产品在内的方法和装置,用于在位于网络安全设备之后的计算设备之间建立安全通信会话。 该方法包括从第一客户端计算设备接收与第二客户端计算设备的安全连接的请求,该请求包括与第一设备相关联的第一传输协议角色和第一安全协议角色。 该方法包括将请求发送到第二设备。 所述方法包括从所述第二设备接收对所述请求的响应,所述响应包括与所述第二设备相关联的第二传输协议角色和第二安全协议角色,向所述第一设备发送所述响应,以及建立所述第一设备之间的安全连接 以及第二设备,其中独立于第一和第二传输协议角色确定第一和第二安全协议角色。

    Method and system for secure data entry
    5.
    发明授权
    Method and system for secure data entry 有权
    用于安全数据输入的方法和系统

    公开(公告)号:US08270720B1

    公开(公告)日:2012-09-18

    申请号:US11708201

    申请日:2007-02-20

    IPC分类号: G06K9/46

    CPC分类号: G06F21/6254

    摘要: The present invention includes a method of secure data entry that enables complex data entry work to be performed by unskilled workers that results in data entry with higher productivity, higher quality and higher security than data entry performed by highly skilled workers. The invention identifies data fields on an electronic image of an identified input page, sequences identified data field images, and individually displays data field images for manual data entry. The invention also provides for extracting data from a data field image and displaying extracted data along with the corresponding data field image for approval or correction. Sequenced data field images are optionally reordered or randomized for display and manual entry.

    摘要翻译: 本发明包括一种安全数据输入的方法,其使复杂数据输入工作能够由非熟练工人执行,导致数据输入具有比高技能工人执行的数据输入更高的生产率,更高的质量和更高的安全性。 本发明识别识别的输入页面的电子图像上的数据字段,序列识别的数据字段图像,并且单独地显示用于手动数据输入的数据字段图像。 本发明还提供从数据场图像提取数据并显示提取的数据以及相应的数据场图像以供批准或校正。 序列数据字段图像可选地重新排序或随机化以进行显示和手动输入。

    Establishing Secure Communications Between Client Computing Devices Located Behind Network Security Devices
    6.
    发明申请
    Establishing Secure Communications Between Client Computing Devices Located Behind Network Security Devices 有权
    建立位于网络安全设备之后的客户端计算设备之间的安全通信

    公开(公告)号:US20120166656A1

    公开(公告)日:2012-06-28

    申请号:US12979850

    申请日:2010-12-28

    IPC分类号: G06F15/16

    摘要: Methods and apparatuses, including computer program products, are described for establishing secure communications sessions between computing devices located behind network security devices. The method includes receiving, from a first client computing device, a request for a secure connection with a second client computing device, the request including a first transport protocol role and a first security protocol role associated with the first device. The method includes transmitting the request to the second device. The method includes receiving, from the second device, a response to the request including a second transport protocol role and a second security protocol role associated with the second device, transmitting the response to the first device, and establishing the secure connection between the first device and the second device, where the first and second security protocol roles are determined independently from the first and second transport protocol roles.

    摘要翻译: 描述了包括计算机程序产品在内的方法和装置,用于在位于网络安全设备之后的计算设备之间建立安全通信会话。 该方法包括从第一客户端计算设备接收与第二客户端计算设备的安全连接的请求,该请求包括与第一设备相关联的第一传输协议角色和第一安全协议角色。 该方法包括将请求发送到第二设备。 所述方法包括从所述第二设备接收对所述请求的响应,所述响应包括与所述第二设备相关联的第二传输协议角色和第二安全协议角色,向所述第一设备发送所述响应,以及建立所述第一设备之间的安全连接 以及第二设备,其中独立于第一和第二传输协议角色确定第一和第二安全协议角色。

    SYSTEMS AND METHODS FOR TRAINING A DOCUMENT CLASSIFICATION SYSTEM USING DOCUMENTS FROM A PLURALITY OF USERS
    7.
    发明申请
    SYSTEMS AND METHODS FOR TRAINING A DOCUMENT CLASSIFICATION SYSTEM USING DOCUMENTS FROM A PLURALITY OF USERS 审中-公开
    使用多个用户的文档来培训文档分类系统的系统和方法

    公开(公告)号:US20090116756A1

    公开(公告)日:2009-05-07

    申请号:US12266469

    申请日:2008-11-06

    IPC分类号: G06K9/62

    CPC分类号: G06K9/00442 G06K9/6885

    摘要: A method of training a document analysis system that automatically extracts image and text features from each received electronic document and compares the extracted features with feature sets associated with each document category is provided. If an electronic document is recognized as belonging to one of the document categories with predetermined confidence, the method classifies the electronic document as being of that one document category. If an electronic document is not recognized as belonging to one of the document categories with predetermined confidence, however, the method submits the unrecognized document to a training phase in which the document is recognized as belonging to a document category and automatically modifies at least one of the features and the weights of the features of the feature set for the document category for the now-recognized document.

    摘要翻译: 提供了一种从每个接收的电子文档自动提取图像和文本特征的文档分析系统的训练方法,并将所提取的特征与与每个文档类别相关联的特征集进行比较。 如果电子文档被确定为具有预定置信度的文档类别之一,则该方法将电子文档分类为该一个文档类别。 然而,如果电子文档不被确定为具有预定的置信度的文档类别之一,则该方法将该无法识别的文档提交到将该文档识别为属于文档类别的训练阶段,并自动修改 功能集的特征和功能集的权重为现在被认可的文档的文档类别。

    SYSTEMS AND METHODS FOR PARALLEL PROCESSING OF DOCUMENT RECOGNITION AND CLASSIFICATION USING EXTRACTED IMAGE AND TEXT FEATURES
    8.
    发明申请
    SYSTEMS AND METHODS FOR PARALLEL PROCESSING OF DOCUMENT RECOGNITION AND CLASSIFICATION USING EXTRACTED IMAGE AND TEXT FEATURES 审中-公开
    使用提取的图像和文字特征并行处理文档识别和分类的系统和方法

    公开(公告)号:US20090116746A1

    公开(公告)日:2009-05-07

    申请号:US12266468

    申请日:2008-11-06

    IPC分类号: G06K9/46

    CPC分类号: G06K9/00442 G06K9/6885

    摘要: A method of parallel processing jobs received from a plurality of users by a document analysis system that automatically classifies documents to organize each job, automatically separates each job into its constituent electronic document and automatically separate the document into subsets of electronic pages. For each page of each subset, the method automatically extracts image features that are indicative of how the document is laid out or textually-organized. For each subset, the method automatically compares the extracted features with feature sets associated with each document category to determine a comparison score for the subset. The method then classifies the electronic document as being one of the categories of documents using the comparison score for each of the subsets and organize the job according to the categories of documents the job contains.

    摘要翻译: 通过文档分析系统从多个用户接收的并行处理作业的方法,其自动分类文档以组织每个作业,将每个作业自动分离成其组成电子文档,并将文档自动分离成电子页面的子集。 对于每个子集的每个页面,该方法自动提取表示文档布局或文本组织的图像特征。 对于每个子集,该方法自动将提取的特征与与每个文档类别相关联的特征集进行比较,以确定子集的比较分数。 然后,该方法将电子文档分类为使用每个子集的比较分数的文档类别之一,并根据作业所包含的文档的类别来组织作业。

    SYSTEMS AND METHODS TO AUTOMATICALLY CLASSIFY ELECTRONIC DOCUMENTS USING EXTRACTED IMAGE AND TEXT FEATURES AND USING A MACHINE LEARNING SUBSYSTEM
    9.
    发明申请
    SYSTEMS AND METHODS TO AUTOMATICALLY CLASSIFY ELECTRONIC DOCUMENTS USING EXTRACTED IMAGE AND TEXT FEATURES AND USING A MACHINE LEARNING SUBSYSTEM 审中-公开
    使用提取的图像和文字特征以及使用机器学习子系统自动分类电子文档的系统和方法

    公开(公告)号:US20090116736A1

    公开(公告)日:2009-05-07

    申请号:US12266462

    申请日:2008-11-06

    IPC分类号: G06K9/62

    CPC分类号: G06K9/00442 G06K9/6885

    摘要: A document analysis system that automatically classifies documents by recognizing in each document distinctive features comprises a document acquisition system, a document recognition training system, a document classification system, a document recognition system, and a job organization system. The document acquisition system receives jobs wherein each job containing at least one electronic document. The document feature recognition system automatically extracts image and text features from each received document. The document classification system automatically classifies recognized electronic documents by finding the best match between the extracted features of each of the document and feature sets associated with each category of document. The document recognition training system automatically trains the feature set for each corresponding category of documents, wherein the training system using extracted features of unrecognized documents automatically modifies the feature set for a document category. The job organization system automatically organizes each job according to the document categories it contains.

    摘要翻译: 一种文档分析系统,通过在每个文档中识别独特的特征来自动分类文档包括文档获取系统,文档识别训练系统,文档分类系统,文档识别系统和作业组织系统。 文档获取系统接收作业,其中每个作业包含至少一个电子文档。 文档特征识别系统自动从每个收到的文档中提取图像和文本特征。 文档分类系统通过找到与每个文档类别相关联的每个文档和特征集的提取的特征之间的最佳匹配来自动对识别的电子文档进行分类。 文档识别训练系统自动训练每个相应类别的文档的特征集,其中使用提取的无法识别的文档的特征的训练系统自动修改文档类别的特征集。 作业组织系统根据其所包含的文档类别自动组织每个作业。