SYSTEMS AND METHODS OF PROCESSING SCANNED DATA
    1.
    发明申请
    SYSTEMS AND METHODS OF PROCESSING SCANNED DATA 有权
    处理扫描数据的系统和方法

    公开(公告)号:US20140233068A1

    公开(公告)日:2014-08-21

    申请号:US14266671

    申请日:2014-04-30

    Applicant: Kofax, Inc.

    CPC classification number: G06K15/407 G06K9/3208 G06T5/00 H04N1/40

    Abstract: An efficient method and system to enhance digital acquisition devices for analog data is presented. The enhancements offered by the method and system are available to the user in local as well as in remote deployments yielding efficiency gains for a large variety of business processes. The quality enhancements of the acquired digital data are achieved efficiently by employing virtual reacquisition. The method of virtual reacquisition renders unnecessary the physical reacquisition of the analog data in case the digital data obtained by the acquisition device are of insufficient quality. The method and system allows multiple users to access the same acquisition device for analog data. In some embodiments, one or more users can virtually reacquire data provided by multiple analog or digital sources. The acquired raw data can be processed by each user according to his personal preferences and/or requirements. The preferred processing settings and attributes are determined interactively in real time as well as non real time, automatically and a combination thereof.

    Abstract translation: 提出了一种增强模拟数据采集设备的有效方法和系统。 方法和系统提供的增强功能可以在本地和远程部署中为用户提供,从而为各种业务流程带来效率提升。 通过采用虚拟反馈技术,可以有效地实现采集数字数据的质量提升。 在采集设备获得的数字数据质量不足的情况下,虚拟反馈方法不需要对模拟数据的物理重新捕获。 该方法和系统允许多个用户访问相同的采集设备以进行模拟数据。 在一些实施例中,一个或多个用户可以虚拟地重新获取由多个模拟或数字源提供的数据。 所获取的原始数据可以由每个用户根据他的个人喜好和/或要求来处理。 优选的处理设置和属性被实时地以非实时的方式交互地确定并且它们的组合。

    SYSTEMS AND METHODS FOR ORGANIZING DATA SETS

    公开(公告)号:US20170329838A1

    公开(公告)日:2017-11-16

    申请号:US15666409

    申请日:2017-08-01

    Applicant: Kofax, Inc.

    CPC classification number: G06F17/30598 G06F17/30312 G06F17/3053 G06N99/005

    Abstract: According to one embodiment, a computer-implemented method for cleaning up a data set having a possible incorrect label includes: selecting a plurality of training documents; estimating a quality of an organization of a plurality of categories; and determining whether the quality of the organization is greater than a predetermined quality threshold. Corresponding system and computer program product embodiments are also presented. Other aspects and advantages of the present invention will become apparent from the following detailed description, which, when taken in conjunction with the drawings, illustrate by way of example the principles of the invention.

    SYSTEMS AND METHODS FOR ORGANIZING DATA SETS
    3.
    发明申请
    SYSTEMS AND METHODS FOR ORGANIZING DATA SETS 有权
    用于组织数据集的系统和方法

    公开(公告)号:US20150269245A1

    公开(公告)日:2015-09-24

    申请号:US14733742

    申请日:2015-06-08

    Applicant: Kofax, Inc.

    CPC classification number: G06F17/30598 G06F17/30312 G06F17/3053 G06N99/005

    Abstract: A method is provided for organizing data sets. In use, an automatic decision system is created or updated for determining whether data elements fit a predefined organization or not, where the decision system is based on a set of preorganized data elements. A plurality of data elements is organized using the decision system. At least one organized data element is selected for output to a user based on a score or confidence from the decision system for the at least one organized data element. Additionally, at least a portion of the at least one organized data element is output to the user. A response is received from the user comprising at least one of a confirmation, modification, and a negation of the organization of the at least one organized data element. The automatic decision system is recreated or updated based on the user response. Other embodiments are also presented.

    Abstract translation: 提供了一种用于组织数据集的方法。 在使用中,创建或更新自动决策系统以确定数据元素是否符合预定义的组织,其中决策系统基于一组预先组织的数据元素。 使用决策系统来组织多个数据元素。 基于来自决策系统对于至少一个有组织数据元素的分数或置信度,选择至少一个有组织数据元素来输出给用户。 此外,至少一个有组织数据元素的至少一部分被输出给用户。 从用户接收到包括至少一个有组织数据元素的组织的确认,修改和否定中的至少一个的响应。 基于用户响应重新创建或更新自动决策系统。 还提出了其他实施例。

    DATA CLASSIFICATION USING MACHINE LEARNING TECHNIQUES
    5.
    发明申请
    DATA CLASSIFICATION USING MACHINE LEARNING TECHNIQUES 审中-公开
    使用机器学习技术的数据分类

    公开(公告)号:US20140207717A1

    公开(公告)日:2014-07-24

    申请号:US14225298

    申请日:2014-03-25

    Applicant: Kofax, Inc.

    Abstract: Systems, methods and computer program products for classifying documents are presented. Systems, methods and computer program products for analyzing documents, e.g. for verifying an association of an invoice with an entity are also presented. Systems, methods and computer program products for managing medical records are presented. One exemplary system includes a memory; and a processor in communication with the memory, the processor being configured to process at least some instructions stored in the memory. The memory stores computer executable program code comprising instructions for: training a classifier based on an invoice format associated with a first entity; accessing a plurality of invoices labeled as being associated with at least one of the first entity and other entities; and outputting an identifier of at least one of the invoices having a high probability of not being associated with the first entity.

    Abstract translation: 介绍了用于分类文件的系统,方法和计算机程序产品。 用于分析文件的系统,方法和计算机程序产品,例如 也用于验证发票与实体的关联。 介绍了管理医疗记录的系统,方法和计算机程序产品。 一个示例性系统包括存储器; 以及与所述存储器通信的处理器,所述处理器被配置为处理存储在所述存储器中的至少一些指令。 存储器存储包括以下指令的计算机可执行程序代码:用于基于与第一实体相关联的发票格式来训练分类器; 访问标记为与第一实体和其他实体中的至少一个相关联的多个发票; 并且输出具有与第一实体不相关联的高可能性的发票中的至少一个的标识符。

    SYSTEMS, METHODS AND COMPUTER PROGRAM PRODUCTS FOR DETERMINING DOCUMENT VALIDITY
    6.
    发明申请
    SYSTEMS, METHODS AND COMPUTER PROGRAM PRODUCTS FOR DETERMINING DOCUMENT VALIDITY 有权
    用于确定文件有效性的系统,方法和计算机程序产品

    公开(公告)号:US20140153787A1

    公开(公告)日:2014-06-05

    申请号:US14176006

    申请日:2014-02-07

    Applicant: Kofax, Inc.

    Abstract: In one embodiment, a method includes performing optical character recognition (OCR) on an image of a financial document and at least one of: (a) correct OCR errors in the financial document using at least one of textual information from a complementary document and predefined business rules; (b) normalize data from the complementary document using at least one of textual information from the financial document and the predefined business rules: and (c) normalize data from the financial document using at least one of textual information from the complementary document and the predefined business riles. Exemplary systems and computer program products are also disclosed.

    Abstract translation: 在一个实施例中,一种方法包括对财务文件的图像执行光学字符识别(OCR),并且至少一个:(a)使用来自补充文档的文本信息和预定义的文本信息中的至少一个,在财务文档中正确的OCR错误 业务规则; (b)使用来自财务文件和预定义业务规则的文本信息中的至少一个来从补充文档规范化数据;以及(c)使用补充文件和预定义的文本信息中的至少一个文本信息来从财务文档归一化数据 商务旅行。 还公开了示例性系统和计算机程序产品。

    SYSTEMS AND METHODS OF PROCESSING SCANNED DATA
    7.
    发明申请
    SYSTEMS AND METHODS OF PROCESSING SCANNED DATA 有权
    处理扫描数据的系统和方法

    公开(公告)号:US20140333971A1

    公开(公告)日:2014-11-13

    申请号:US14340460

    申请日:2014-07-24

    Applicant: Kofax, Inc.

    Abstract: A method includes storing raw or normalized video data in a computer accessible storage medium; analyzing portions of the video data with a first analytic engine to: determine whether the raw video data is within a first set of parameters; and generate with the first analytic engine a first set of processor settings; processing the raw or normalized video data with the first set of processor settings; and analyzing portions of the processed data with a second analytic engine to determine whether the processed data is within a second set of parameters; generating with the second analytic engine a second set of processor settings to reprocess the raw or normalized video data, sending the second set of processor settings to the first analytic engine; and reprocessing the raw or normalized video data with the first analytic engine using the second set of processor settings.

    Abstract translation: 一种方法包括将原始或标准化的视频数据存储在计算机可访问存储介质中; 用第一分析引擎分析视频数据的部分,以:确定原始视频数据是否在第一组参数内; 并用第一分析引擎生成第一组处理器设置; 用第一组处理器设置处理原始或归一化的视频数据; 以及用第二分析引擎分析处理数据的部分,以确定所处理的数据是否在第二组参数内; 用第二分析引擎产生第二组处理器设置以重新处理原始或归一化的视频数据,将第二组处理器设置发送到第一分析引擎; 以及使用第二组处理器设置与第一分析引擎再处理原始或归一化的视频数据。

    SYSTEMS AND METHODS FOR ORGANIZING DATA SETS
    8.
    发明申请
    SYSTEMS AND METHODS FOR ORGANIZING DATA SETS 审中-公开
    用于组织数据集的系统和方法

    公开(公告)号:US20130041863A1

    公开(公告)日:2013-02-14

    申请号:US13655267

    申请日:2012-10-18

    Applicant: KOFAX, INC.

    CPC classification number: G06F16/285 G06F16/22 G06F16/24578 G06N20/00

    Abstract: A method is provided for organizing data sets. In use, an automatic decision system is created or updated for determining whether data elements fit a predefined organization or not, where the decision system is based on a set of preorganized data elements. A plurality of data elements is organized using the decision system. At least one organized data element is selected for output to a user based on a score or confidence from the decision system for the at least one organized data element. Additionally, at least a portion of the at least one organized data element is output to the user. A response is received from the user comprising at least one of a confirmation, modification, and a negation of the organization of the at least: one organized data element. The automatic decision system is recreated or updated based on the user response. Other embodiments are also presented.

    Abstract translation: 提供了一种用于组织数据集的方法。 在使用中,创建或更新自动决策系统以确定数据元素是否符合预定义的组织,其中决策系统基于一组预先组织的数据元素。 使用决策系统来组织多个数据元素。 基于来自决策系统对于至少一个有组织数据元素的分数或置信度,选择至少一个有组织数据元素来输出给用户。 此外,至少一个有组织数据元素的至少一部分被输出给用户。 从用户接收到响应包括至少一个有组织数据元素的组织的确认,修改和否定中的至少一个。 基于用户响应重新创建或更新自动决策系统。 还提出了其他实施例。

    SYSTEMS AND METHODS FOR ORGANIZING DATA SETS

    公开(公告)号:US20170140030A1

    公开(公告)日:2017-05-18

    申请号:US15422435

    申请日:2017-02-01

    Applicant: Kofax, Inc.

    CPC classification number: G06F17/30598 G06F17/30312 G06F17/3053 G06N99/005

    Abstract: According to one embodiment, a computer-implemented method for confirming/rejecting a most relevant example includes: generating a binary decision model by training a binary classifier using a plurality of training documents; classifying one or more test documents into one of a plurality of categories using the binary decision model, wherein the one or more test documents lack a user-defined category label; selecting a most relevant example of the classified test documents from among the classified test documents; displaying, using a display of the computer, the most relevant example of the classified test documents to a user; receiving, via the computer and from the user, a confirmation or a negation of a classification label of the most relevant example of the classified test documents; and storing the confirmation or the negation of the classification label of the most relevant example of the classified test documents to a memory of the computer.

    SYSTEMS, METHODS AND COMPUTER PROGRAM PRODUCTS FOR DETERMINING DOCUMENT VALIDITY
    10.
    发明申请
    SYSTEMS, METHODS AND COMPUTER PROGRAM PRODUCTS FOR DETERMINING DOCUMENT VALIDITY 有权
    用于确定文件有效性的系统,方法和计算机程序产品

    公开(公告)号:US20130308832A1

    公开(公告)日:2013-11-21

    申请号:US13948046

    申请日:2013-07-22

    Applicant: Kofax, Inc.

    CPC classification number: G06K9/00442 G06K9/00469 H04N1/40

    Abstract: A method according to one embodiment includes performing optical character recognition (OCR) on an image of a first document; and at least one of: correcting OCR errors in the first document using at least one of textual information from a complementary document and predefined business rules; normalizing data from the complementary document using at least one of textual information from the first document and the predefined business rules; and normalizing data from the first document using at least one of textual information from the complementary document and the predefined business rules. Additional systems, methods and computer program products are also presented.

    Abstract translation: 根据一个实施例的方法包括对第一文档的图像执行光学字符识别(OCR); 以及至少一个:使用来自补充文档和预定义业务规则的文本信息中的至少一个来修正第一文档中的OCR错误; 使用来自第一文档和预定义业务规则的文本信息中的至少一个来对来自补充文档的数据进行归一化; 以及使用来自所述补充文档和所述预定义业务规则的文本信息中的至少一个来对来自所述第一文档的数据进行规范化。 还介绍了其他系统,方法和计算机程序产品。

Patent Agency Ranking