System for enhancing expert-based computerized analysis of a set of digital documents and methods useful in conjunction therewith
    1.
    发明授权
    System for enhancing expert-based computerized analysis of a set of digital documents and methods useful in conjunction therewith 有权
    用于加强与一起有用的一组数字文档和方法的基于专家的计算机化分析的系统

    公开(公告)号:US08533194B1

    公开(公告)日:2013-09-10

    申请号:US13161087

    申请日:2011-06-15

    IPC分类号: G06F7/00

    CPC分类号: G06N99/005

    摘要: An electronic document analysis method using a processor for analyzing N electronic documents, the method comprising providing a set of control electronic documents from among the electronic N documents; and using the set of control electronic documents and a processor to evaluate at least one aspect of a computerized text-classifier based electronic document categorization process performed on the N documents including computation of at least one statistic; wherein providing includes providing an initial set of control electronic documents; computing, using a processor, an estimated validation level of the at least one statistic assuming the initial set is used, and comparing the estimated validation level to a desired validation level, using a processor, and enlarging the initial set of control electronic documents if the estimated validation level falls below the desired validation level.

    摘要翻译: 一种使用处理器分析N个电子文档的电子文档分析方法,所述方法包括从所述电子N文档中提供一组控制电子文档; 以及使用所述一组控制电子文档和处理器来评估对所述N个文档执行的基于计算机化文本分类器的电子文档分类过程的至少一个方面,包括至少一个统计量的计算; 其中提供包括提供一组初始控制电子文档; 使用处理器来计算假设使用初始集合的至少一个统计量的估计验证级别,并且使用处理器将估计的验证级别与期望的验证级别进行比较,并且如果所述控制电子文档的初始集合放大 估计验证水平低于期望的验证水平。