Systems and methods for improving concept landscape visualizations as a data analysis tool
    2.
    发明授权
    Systems and methods for improving concept landscape visualizations as a data analysis tool 有权
    将概念景观可视化提升为数据分析工具的系统和方法

    公开(公告)号:US06940509B1

    公开(公告)日:2005-09-06

    申请号:US09675515

    申请日:2000-09-29

    IPC分类号: G06T11/20 G06T11/00

    CPC分类号: G06T11/206

    摘要: Systems and methods provide several enhancements for the viewing, analysis, and generation of landscape views in a data analysis system, including: allowing a user to select from multiple methods to generate a landscape view, providing labels for peaks of a landscape, enabling the user to replace labels displayed on the landscape view, enabling a landscape view to be recalculated based on the replacement labels, and allowing a user to switch or morph between two landscape views generated by different methods. Such methods or systems generate graphical landscape map visualizations from a set of data records.

    摘要翻译: 系统和方法为数据分析系统中的景观视图的查看,分析和生成提供了几个增强功能,包括:允许用户从多种方法中选择生成横向视图,为景观的峰值提供标签,使用户 以替换横向视图中显示的标签,可以根据替换标签重新计算横向视图,并允许用户在由不同方法生成的两个横向视图之间进行切换或变形。 这样的方法或系统从一组数据记录生成图形横向地图可视化。

    Data import system for data analysis system
    4.
    发明授权
    Data import system for data analysis system 有权
    数据导入系统用于数据分析系统

    公开(公告)号:US06718336B1

    公开(公告)日:2004-04-06

    申请号:US09672622

    申请日:2000-09-29

    IPC分类号: G06F1730

    摘要: A data import system enables access to data of multiple types from multiple data sources of different formats and provides an interface for importing data into a data analysis system. The interface enables a user to customize the formatting of the data as the data is being imported into a data analysis system. A user may select first user defined options for operating on a first data set received during a data importation process. An intermediate representation of the data set is generated based on the user first defined options. A user may specify second user defined options based on the intermediate representation during the data importation process. The second user defined options are processed to produce a final data representation of the data set to be used for analysis of the data. The intermediate representation may be a data table. The processing of a data set may include merging a first and second data set to produce the final data representation. The second user defined options may enable a user to select a basic operation for merging the data sets or to select a non-basic operation for merging the data sets. The basic operation may combine data sets in response to a user's selection of a first graphical interface control, and the non-basic operation may combine the data sets based on user selection of at least two graphical interface controls from a group of graphical interface controls.

    摘要翻译: 数据导入系统可以访问来自不同格式的多个数据源的多种类型的数据,并提供用于将数据导入数据分析系统的接口。 该界面使用户能够在将数据导入数据分析系统时自定义数据的格式。 用户可以选择用于在数据导入过程期间接收的第一数据集上操作的第一用户定义的选项。 基于用户首先定义的选项生成数据集的中间表示。 用户可以在数据导入过程期间基于中间表示来指定第二用户定义的选项。 处理第二个用户定义的选项以产生要用于数据分析的数据集的最终数据表示。 中间表示可以是数据表。 数据集的处理可以包括合并第一和第二数据集以产生最终数据表示。 第二用户定义的选项可以使得用户能够选择用于合并数据集的基本操作或者选择用于合并数据集的非基本操作。 基本操作可以响应于用户对第一图形界面控件的选择来组合数据集,并且非基本操作可以基于来自一组图形界面控件的至少两个图形界面控件的用户选择来组合数据集。

    System and method for use in text analysis of documents and records
    5.
    发明授权
    System and method for use in text analysis of documents and records 有权
    用于文件和记录文本分析的系统和方法

    公开(公告)号:US06665661B1

    公开(公告)日:2003-12-16

    申请号:US09672599

    申请日:2000-09-29

    IPC分类号: G06F1730

    摘要: Methods and systems are provided that enable text in various sections of data records to be separately catalogued, indexed, or vectorized for analysis in a text visualization and mining system. A text processing system receives a plurality of data records, where each data record has one or a plurality of attribute fields associated with the records. The attributes fields containing textual information are identified. The specific textual content of each attribute field is identified. An index is generated that associates the textual content contained in each attribute field with the attribute field containing the textual content. The index is operable for use in text processing. The plurality of data records may be located in a data table and the textual information may be contained within cells of the data table. In another aspect, a plurality of data records is received, where at least some of the data records contain text terms. A first method is applied to weight text terms of the data records in a first manner to aid in distinguishing records from each other in response to selection of the first method. A second method is applied to weight text terms of the data records in a second manner to aid in distinguishing records from each other in response to selection of the second method. A vector is generated to distinguish each of the data records based on the text terms weighted by either the first or second method.

    摘要翻译: 提供了方法和系统,使数据记录的各个部分的文本可以单独编目,索引或向量化,以便在文本可视化和挖掘系统中进行分析。 文本处理系统接收多个数据记录,其中每个数据记录具有与记录相关联的一个或多个属性字段。 标识包含文本信息的属性字段。 识别每个属性字段的特定文本内容。 生成将每个属性字段中包含的文本内容与包含文本内容的属性字段相关联的索引。 该索引可操作用于文本处理。 多个数据记录可以位于数据表中,并且文本信息可以包含在数据表的单元内。 在另一方面,接收多个数据记录,其中至少一些数据记录包含文本术语。 应用第一种方法以第一种方式对数据记录的文本术语进行加权,以帮助响应于第一种方法的选择来区分记录。 应用第二种方法以第二种方式对数据记录的文本术语进行加权,以帮助响应于第二种方法的选择来区分记录。 生成矢量以基于由第一或第二方法加权的文本项来区分每个数据记录。