Systems and methods for identifying and categorizing electronic documents through machine learning
    22.
    发明授权
    Systems and methods for identifying and categorizing electronic documents through machine learning 有权
    通过机器学习识别和分类电子文档的系统和方法

    公开(公告)号:US09514414B1

    公开(公告)日:2016-12-06

    申请号:US15088481

    申请日:2016-04-01

    Abstract: Computer implemented systems and methods are disclosed for identifying and categorizing electronic documents through machine learning. In accordance with some embodiments, a seed set of categorized electronic documents may be used to train a document categorizer based on a machine learning algorithm. The trained document categorizer may categorize electronic documents in a large corpus of electronic documents. Performance metrics associated with performance of the trained document categorizer may be tracked, and additional seed sets of categorized electronic documents may be used to improve the performance of document categorizer by retraining the document categorizer on subsequent seed sets. Additional seed sets may and categorizations may be iterated through until a desired document categorization performance is reached.

    Abstract translation: 公开了计算机实现的系统和方法,用于通过机器学习识别和分类电子文档。 根据一些实施例,可以使用分类电子文档的种子集合来基于机器学习算法来训练文档分类器。 经过培训的文档分类器可以将电子文档分类为大型电子文档语料库。 可以跟踪与经过训练的文档分类器的性能相关联的性能度量,并且可以使用分类电子文档的附加种子集来通过在后续种子集上重新训练文档分类器来提高文档分类器的性能。 可以遍历额外的种子集合和分类,直到达到期望的文档分类表现。

    INTERNAL MALWARE DATA ITEM CLUSTERING AND ANALYSIS
    23.
    发明申请
    INTERNAL MALWARE DATA ITEM CLUSTERING AND ANALYSIS 有权
    内部恶意数据项集合和分析

    公开(公告)号:US20160006749A1

    公开(公告)日:2016-01-07

    申请号:US14486991

    申请日:2014-09-15

    Abstract: Embodiments of the present disclosure relate to a data analysis system that may automatically generate memory-efficient clustered data structures, automatically analyze those clustered data structures, and provide results of the automated analysis in an optimized way to an analyst. The automated analysis of the clustered data structures (also referred to herein as data clusters) may include an automated application of various criteria or rules so as to generate a compact, human-readable analysis of the data clusters. The human-readable analyses (also referred to herein as “summaries” or “conclusions”) of the data clusters may be organized into an interactive user interface so as to enable an analyst to quickly navigate among information associated with various data clusters and efficiently evaluate those data clusters in the context of, for example, a fraud investigation. Embodiments of the present disclosure also relate to automated scoring of the clustered data structures.

    Abstract translation: 本公开的实施例涉及一种数据分析系统,其可以自动生成存储器有效的集群数据结构,自动分析这些集群数据结构,并以优化的方式向分析者提供自动化分析的结果。 集群数据结构(本文中也称为数据集群)的自动化分析可以包括各种标准或规则的自动应用,以便生成数据集群的紧凑的,人类可读的分析。 可以将数据集群的可读分析(也称为“摘要”或“结论”)组织成交互式用户界面,以使分析人员能够在与各种数据集群相关联的信息之间快速导航,并有效地评估 这些数据集群在例如欺诈调查的背景下。 本公开的实施例还涉及聚类数据结构的自动评分。

    Systems and methods for active column filtering
    24.
    发明授权
    Systems and methods for active column filtering 有权
    主动列过滤的系统和方法

    公开(公告)号:US09009171B1

    公开(公告)日:2015-04-14

    申请号:US14268964

    申请日:2014-05-02

    Abstract: Systems and methods are disclosed for active column filtering. In accordance with one implementation, a method is provided for active column filtering. The method includes providing a table having data values arranged in rows and columns, providing a first filter location indicator whose location is visually associated with a first column, and providing a first interface based on a selection of the first filter location indicator, wherein the first interface's location is visually associated with the first column. The method also includes acquiring a first filter input entered into the first interface, filtering the table based on the acquired first filter input, providing the filtered table for displaying, and providing an applied filter indicator, whose location is visually associated with the first column, the applied filter indicator including at least the first filter input.

    Abstract translation: 公开了用于活性柱过滤的系统和方法。 根据一个实现,提供了一种用于主动列过滤的方法。 该方法包括提供具有以行和列排列的数据值的表,提供其位置与第一列可视地相关联的第一过滤器位置指示符,以及基于第一过滤器位置指示符的选择来提供第一接口,其中第一 界面的位置与第一列视觉相关联。 该方法还包括获取输入到第一接口中的第一过滤器输入,基于所获取的第一过滤器输入来过滤表,提供用于显示的过滤表,并提供其位置与第一列视觉相关联的应用过滤器指示符, 应用的滤波器指示器至少包括第一滤波器输入。

    Data analysis system and method
    25.
    发明授权

    公开(公告)号:US12032583B2

    公开(公告)日:2024-07-09

    申请号:US17812961

    申请日:2022-07-15

    Abstract: This disclosure relates to a system and method for data analysis. According to a first aspect, there is described a method, the method being performed using one or more processors, comprising: receiving one or more user inputs indicative of one or more relationships between data in a plurality of datasets; determining, based on the one or more user inputs, at least one object view for visualizing the data in the plurality of datasets; generating, based on the one or more user inputs, metadata comprising: an object graph indicative of the one or more relationships between two or more of the plurality of datasets; and information identifying the at least one object view; and in response to a query relating to the plurality of datasets, using the metadata to determine how response data responding to the query should be provided.

    DATA ANALYSIS SYSTEM AND METHOD
    28.
    发明申请

    公开(公告)号:US20210042303A1

    公开(公告)日:2021-02-11

    申请号:US17078525

    申请日:2020-10-23

    Abstract: This disclosure relates to a system and method for data analysis. According to a first aspect, there is described a method, the method being performed using one or more processors, comprising: receiving one or more user inputs indicative of one or more relationships between data in a plurality of datasets; determining, based on the one or more user inputs, at least one object view for visualizing the data in the plurality of datasets; generating, based on the one or more user inputs, metadata comprising: an object graph indicative of the one or more relationships between two or more of the plurality of datasets; and information identifying the at least one object view; and in response to a query relating to the plurality of datasets, using the metadata to determine how response data responding to the query should be provided.

Patent Agency Ranking