SYSTEMS AND METHODS FOR CONTENT EXTRACTION
    51.
    发明申请
    SYSTEMS AND METHODS FOR CONTENT EXTRACTION 有权
    用于内容提取的系统和方法

    公开(公告)号:US20130326332A1

    公开(公告)日:2013-12-05

    申请号:US13900912

    申请日:2013-05-23

    IPC分类号: G06F17/22

    摘要: Systems and methods are presented for content extraction from markup language text. The content extraction process may parse markup language text into a hierarchical data model and then apply one or more filters. Output filters may be used to make the process more versatile. The operation of the content extraction process and the one or more filters may be controlled by one or more settings set by a user, or automatically by a classifier. The classifier may automatically enter settings by classifying markup language text and entering settings based on this classification. Automatic classification may be performed by clustering unclassified markup language texts with previously classified markup language texts.

    摘要翻译: 介绍了从标记语言文本中提取内容的系统和方法。 内容提取过程可以将标记语言文本解析成分层数据模型,然后应用一个或多个过滤器。 输出滤波器可用于使该过程更加通用。 内容提取处理和一个或多个过滤器的操作可以由用户设置的一个或多个设置或由分类器自动地控制。 分类器可以通过分类标记语言文本并基于此分类输入设置来自动输入设置。 可以通过将未分类的标记语言文本与先前分类的标记语言文本进行聚类来执行自动分类。

    System and methods for detecting intrusions in a computer system by monitoring operating system registry accesses
    52.
    发明申请
    System and methods for detecting intrusions in a computer system by monitoring operating system registry accesses 有权
    通过监视操作系统注册表访问来检测计算机系统中的入侵的系统和方法

    公开(公告)号:US20090083855A1

    公开(公告)日:2009-03-26

    申请号:US12154405

    申请日:2008-05-21

    IPC分类号: G06F21/22 G06F11/30

    CPC分类号: G06F21/552 H04L63/1416

    摘要: A method for detecting intrusions in the operation of a computer system is disclosed which comprises gathering features from records of normal processes that access the files system of the computer, such as the Windows registry, and generating a probabilistic model of normal computer system usage based on occurrences of said features. The features of a record of a process that accesses the Windows registry are analyzed to determine whether said access to the Windows registry is an anomaly. A system is disclosed, comprising a registry auditing module configured to gather records regarding processes that access the Windows registry; a model generator configured to generate a probabilistic model of normal computer system usage based on records of a plurality of processes that access the Windows registry and that are indicative of normal computer system usage; and a model comparator configured to determine whether the access of the Windows registry is an anomaly.

    摘要翻译: 公开了一种用于检测计算机系统操作中的入侵的方法,其包括从访问诸如Windows注册表的计算机的文件系统的正常进程的记录中收集特征,并且基于以下方式生成基于计算机系统的正常计算机系统使用的概率模型: 出现所述特征。 分析访问Windows注册表的进程记录的功能,以确定对Windows注册表的访问是否为异常。 公开了一种系统,其包括注册表审核模块,其被配置为收集关于访问所述Windows注册表的进程的记录; 模型生成器,其被配置为基于访问Windows注册表并且指示正常的计算机系统使用的多个进程的记录来生成正常计算机系统使用的概率模型; 以及配置为确定Windows注册表的访问是否是异常的模型比较器。

    System and methods for detecting intrusions in a computer system by monitoring operating system registry accesses
    53.
    发明授权
    System and methods for detecting intrusions in a computer system by monitoring operating system registry accesses 有权
    通过监视操作系统注册表访问来检测计算机系统中的入侵的系统和方法

    公开(公告)号:US07448084B1

    公开(公告)日:2008-11-04

    申请号:US10352343

    申请日:2003-01-27

    IPC分类号: G06F21/22 G06F11/30

    CPC分类号: G06F21/552 H04L63/1416

    摘要: A method for detecting intrusions in the operation of a computer system is disclosed which comprises gathering features from records of normal processes that access the files system of the computer, such as the Windows registry, and generating a probabilistic model of normal computer system usage based on occurrences of said features. The features of a record of a process that accesses the Windows registry are analyzed to determine whether said access to the Windows registry is an anomaly. A system is disclosed, comprising a registry auditing module configured to gather records regarding processes that access the Windows registry; a model generator configured to generate a probabilistic model of normal computer system usage based on records of a plurality of processes that access the Windows registry and that are indicative of normal computer system usage; and a model comparator configured to determine whether the access of the Windows registry is an anomaly.

    摘要翻译: 公开了一种用于检测计算机系统操作中的入侵的方法,其包括从访问诸如Windows注册表的计算机的文件系统的正常进程的记录中收集特征,并且基于以下方式生成基于计算机系统的正常计算机系统使用的概率模型: 出现所述特征。 分析访问Windows注册表的进程记录的功能,以确定对Windows注册表的访问是否为异常。 公开了一种系统,其包括注册表审核模块,其被配置为收集关于访问所述Windows注册表的进程的记录; 模型生成器,其被配置为基于访问Windows注册表并且指示正常的计算机系统使用的多个进程的记录来生成正常计算机系统使用的概率模型; 以及配置为确定Windows注册表的访问是否是异常的模型比较器。

    System and methods for anomaly detection and adaptive learning
    54.
    发明授权
    System and methods for anomaly detection and adaptive learning 有权
    异常检测和自适应学习的系统和方法

    公开(公告)号:US07424619B1

    公开(公告)日:2008-09-09

    申请号:US10269694

    申请日:2002-10-11

    摘要: In a method of generating an anomaly detection model for classifying activities of a computer system, using a training set of data corresponding to activity on the computer system, the training set comprising a plurality of instances of data having features, and wherein each feature in said plurality of features has a plurality of values. For a selected feature and a selected value of the selected feature, a quantity is determined which corresponds to the relative sparsity of such value. The quantity may correspond to the difference between the number occurrences of the selected value and the number of occurrences of the most frequently occurring value. These instances are classified as anomaly and added to the training set of normal data to generate a rule set or other detection model.

    摘要翻译: 在产生用于对计算机系统的活动进行分类的异常检测模型的方法中,使用与计算机系统上的活动相对应的数据的训练集合,所述训练集合包括具有特征的多个数据实例,并且其中所述 多个特征具有多个值。 对于所选特征和所选特征的选定值,确定与该值相对稀疏度对应的数量。 数量可以对应于所选值的出现次数与最常发生值的出现次数之间的差异。 这些实例被分类为异常,并添加到正常数据的训练集中以生成规则集或其他检测模型。

    Method and system for using intelligent agents for financial
transactions, services, accounting, and advice
    55.
    发明授权
    Method and system for using intelligent agents for financial transactions, services, accounting, and advice 失效
    使用智能代理进行金融交易,服务,会计和咨询的方法和系统

    公开(公告)号:US5920848A

    公开(公告)日:1999-07-06

    申请号:US10677

    申请日:1998-01-22

    摘要: The present invention relates to the use of computerized intelligent agents to facilitate the integration of networked performance of financial transactions with computerized methods of financial accounting. Incorporated into this combined financial transaction/financial accounting system are intelligent agents that automatically analyze the system information to provide users with financial advice. This invention permits the automated performance on-line of a wide variety of financial transactions and integrates these transactions with computerized financial accounting. All of this information is collated and analyzed automatically by intelligent agents, which generate user-specific financial reports, profiles, and advice, and under appropriate conditions take action.

    摘要翻译: 本发明涉及使用计算机智能代理来促进金融交易的联网绩效与计算机化的财务会计方法的整合。 并入该组合的金融交易/财务会计系统是智能代理,可自动分析系统信息,为用户提供财务咨询。 本发明允许在线进行各种金融交易的自动化表现,并将这些交易与计算机化的财务会计相结合。 所有这些信息都由智能代理自动整理和分析,这些代理生成用户特定的财务报告,配置文件和建议,并在适当的条件下采取行动。

    Method of merging large databases in parallel

    公开(公告)号:US5717915A

    公开(公告)日:1998-02-10

    申请号:US610639

    申请日:1996-03-04

    摘要: The semantic integration problem for merging multiple databases of very large size, the merge/purge problem, can be solved by multiple runs of the sorted neighborhood method or the clustering method with small windows followed by the computation of the transitive closure over the results of each run. The sorted neighborhood method works well under this scheme but is computationally expensive due to the sorting phase. An alternative method based on data clustering that reduces the complexity to linear time making multiple runs followed by transitive closure feasible and efficient. A method is provided for identifying duplicate records in a database, each record having at least one field and a plurality of keys, including the steps of sorting the records according to a criteria applied to a first key; comparing a number of consecutive sorted records to each other, wherein the number is less than a number of records in said database and identifying a first group of duplicate records; storing the identity of the first group; sorting the records according to a criteria applied to a second key; comparing a number of consecutive sorted records to each other, wherein the number is less than a number of records in said database and identifying a second group of duplicate records; storing the identity of the second group; and subjecting the union of the first and second groups to transitive closure.

    Method and apparatus for imaging, image processing and data compression
merge/purge techniques for document image databases
    57.
    发明授权
    Method and apparatus for imaging, image processing and data compression merge/purge techniques for document image databases 失效
    用于文件图像数据库的成像,图像处理和数据压缩合并/清除技术的方法和装置

    公开(公告)号:US5668897A

    公开(公告)日:1997-09-16

    申请号:US488333

    申请日:1995-06-07

    IPC分类号: G06F17/30 G06K9/00 G06K9/36

    摘要: A method for processing an image, consisting of a foreground and a background, to produce a highly compressed and accurate representation of the image, including the steps of scanning the image to create a digital image of the image, comparing the digital image against a codebook of stored digital images; matching the digital image with one of the stored digital images of the codebook; producing an index code identifying the background of the stored digital image as having matched the digital image; subtracting the stored digital image from the digital image to produce a second digital image representing the foreground of the stored digital image; and storing the second digital image with the index code. Techniques are also provided to enable merge/purge of the database(s) thereby created.

    摘要翻译: 一种用于处理由前景和背景组成的图像以产生图像的高度压缩和精确表示的方法,包括扫描图像以创建图像的数字图像的步骤,将数字图像与码本进行比较 存储的数字图像; 将数字图像与码本的所存储的数字图像之一进行匹配; 产生将所存储的数字图像的背景识别为与数字图像相匹配的索引码; 从数字图像中减去所存储的数字图像,以产生表示所存储的数字图像的前景的第二数字图像; 并存储具有索引码的第二数字图像。 还提供了技术来使得能够合并/清除由此创建的数据库。