Multi-faceted visualization of rich text corpora
    1.
    发明授权
    Multi-faceted visualization of rich text corpora 有权
    丰富的文本语料库的多面可视化

    公开(公告)号:US09390194B2

    公开(公告)日:2016-07-12

    申请号:US12872794

    申请日:2010-08-31

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30941 G06F17/30716

    摘要: Methods and apparatus are provided for multi-faceted visualization of rich text corpora. A data set comprising a plurality of entities, facets and relations is visualized by generating a visualization of a plurality of the facets in the data set, wherein the visualization indicates connections along the plurality of the facets in a single view using multi-faceted edges. The entities are instances of a particular concept, the facets are classes of entities and the relations are connections between pairs of the entities. A compound node comprises a representation of a primary entity, surrounded by representations of one or more secondary entities connected by one or more external relations. The internal relations can be represented as edges connecting two facet nodes from different compound nodes and a number of crossings of the edges can be reduced by adjusting a position order of facet nodes. The compound nodes can optionally be rotated based on, for example, a global spring force model to reduce an average length of one or more of the edges and/or to allow edge bundling.

    摘要翻译: 为丰富的文本语料库的多面可视化提供了方法和设备。 包括多个实体,小平面和关系的数据集通过生成数据集中的多个方面的可视化而被可视化,其中可视化表示使用多面边缘的单个视图中沿多个小平面的连接。 实体是特定概念的实例,方面是实体的类,关系是实体对之间的连接。 复合节点包括由一个或多个外部关系连接的一个或多个次实体的表示包围的主实体的表示。 内部关系可以表示为连接来自不同复合节点的两个面节点的边缘,并且可以通过调整小面节点的位置顺序来减少边缘的数量。 复合节点可以基于例如全局弹簧力模型来选择地旋转以减少一个或多个边缘的平均长度和/或允许边缘捆绑。

    Multi-Faceted Visualization of Rich Text Corpora
    2.
    发明申请
    Multi-Faceted Visualization of Rich Text Corpora 有权
    富文本的多方面可视化

    公开(公告)号:US20120054226A1

    公开(公告)日:2012-03-01

    申请号:US12872794

    申请日:2010-08-31

    IPC分类号: G06F3/048 G06F17/30

    CPC分类号: G06F17/30941 G06F17/30716

    摘要: Methods and apparatus are provided for multi-faceted visualization of rich text corpora. A data set comprising a plurality of entities, facets and relations is visualized by generating a visualization of a plurality of the facets in the data set, wherein the visualization indicates connections along the plurality of the facets in a single view using multi-faceted edges. The entities are instances of a particular concept, the facets are classes of entities and the relations are connections between pairs of the entities. A compound node comprises a representation of a primary entity, surrounded by representations of one or more secondary entities connected by one or more external relations. The internal relations can be represented as edges connecting two facet nodes from different compound nodes and a number of crossings of the edges can be reduced by adjusting a position order of facet nodes. The compound nodes can optionally be rotated based on, for example, a global spring force model to reduce an average length of one or more of the edges and/or to allow edge bundling.

    摘要翻译: 为丰富的文本语料库的多面可视化提供了方法和设备。 包括多个实体,小平面和关系的数据集通过生成数据集中的多个方面的可视化而被可视化,其中可视化表示使用多面边缘的单个视图中沿多个小平面的连接。 实体是特定概念的实例,方面是实体的类,关系是实体对之间的连接。 复合节点包括由一个或多个外部关系连接的一个或多个次实体的表示包围的主实体的表示。 内部关系可以表示为连接来自不同复合节点的两个面节点的边缘,并且可以通过调整小面节点的位置顺序来减少边缘的数量。 复合节点可以基于例如全局弹簧力模型来选择地旋转以减少一个或多个边缘的平均长度和/或允许边缘捆绑。

    Visual analysis of multidimensional clusters
    3.
    发明授权
    Visual analysis of multidimensional clusters 有权
    多维集群的视觉分析

    公开(公告)号:US09342579B2

    公开(公告)日:2016-05-17

    申请号:US13149132

    申请日:2011-05-31

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30601

    摘要: Visualization techniques are provided for a clustered multidimensional dataset. A data set is visualized by obtaining a clustering of a multidimensional dataset comprising a plurality of entities, wherein the entities are instances of a particular concept and wherein each entity comprises a plurality of features; and generating an icon for at least one of the entities, the icon having a plurality of regions, wherein each region corresponds to one of the features of the at least one entity, and wherein a size of each region is based on a value of the corresponding feature. Each icon can convey statistical measures. A stabilized Voronoi-based icon layout algorithm is optionally employed. Icons can be embedded in a visualization of the multidimensional dataset. A hierarchical encoding scheme can be employed to encode a data cluster into the icon, such as a hierarchy of cluster, feature type and entity.

    摘要翻译: 为集群多维数据集提供可视化技术。 通过获得包括多个实体的多维数据集的聚类来可视化数据集,其中所述实体是特定概念的实例,并且其中每个实体包括多个特征; 以及为所述实体中的至少一个生成图标,所述图标具有多个区域,其中每个区域对应于所述至少一个实体的所述特征之一,并且其中每个区域的大小基于所述多个区域的值 相应的功能。 每个图标都可以传达统计学方法。 可选地使用基于稳定的基于Voronoi的图标布局算法。 图标可以嵌入到多维数据集的可视化中。 可以采用分层编码方案将数据集群编码成图标,例如集群的层次结构,特征类型和实体。

    Multifaceted Visualization for Topic Exploration
    4.
    发明申请
    Multifaceted Visualization for Topic Exploration 审中-公开
    主题探索的多角度可视化

    公开(公告)号:US20120290988A1

    公开(公告)日:2012-11-15

    申请号:US13106207

    申请日:2011-05-12

    IPC分类号: G06F3/048

    CPC分类号: G06F16/26

    摘要: A multifaceted visualization technique is provided for visually exploring topics in multi-relational data. A data set is visualized by obtaining the data set comprising a plurality of entities, facets and relations, wherein the entities are instances of a particular concept, the facets are classes of entities and the relations are connections between pairs of the entities; obtaining a selection of one of the facets as a topic facet, wherein entities in the topic facet are topic entities, wherein facets in the plurality of facets other than the topic facet are keyword facets; generating a visualization comprising the topic entities rendered as nodes arranged within a central region; and generating one or more surrounding shapes around the central region, wherein each of the surrounding shapes corresponds to one of the keyword facets, wherein entities within the corresponding keyword facet of a given one of the surrounding shapes are rendered as keyword entities.

    摘要翻译: 提供了一种多方面的可视化技术,用于在多关系数据中视觉探索主题。 通过获得包括多个实体,面和关系的数据集来可视化数据集,其中实体是特定概念的实例,小面是实体的类别,并且关系是实体对之间的连接; 获取所述方面之一的选择作为主题小面,其中所述主题构面中的实体是主题实体,其中除所述主题构面之外的所述多个方面中的小平面是关键词方面; 生成包括被呈现为在中心区域内布置的节点的主题实体的可视化; 以及围绕所述中心区域生成一个或多个周围形状,其中所述周围形状中的每一个对应于所述关键词方面中的一个,其中所述周围形状中的给定一个周围形状的对应关键字小面内的实体被呈现为关键字实体。

    Visual Analysis of Multidimensional Clusters
    5.
    发明申请
    Visual Analysis of Multidimensional Clusters 有权
    多维群集的视觉分析

    公开(公告)号:US20120311496A1

    公开(公告)日:2012-12-06

    申请号:US13149132

    申请日:2011-05-31

    IPC分类号: G06F3/048

    CPC分类号: G06F17/30601

    摘要: Visualization techniques are provided for a clustered multidimensional dataset. A data set is visualized by obtaining a clustering of a multidimensional dataset comprising a plurality of entities, wherein the entities are instances of a particular concept and wherein each entity comprises a plurality of features; and generating an icon for at least one of the entities, the icon having a plurality of regions, wherein each region corresponds to one of the features of the at least one entity, and wherein a size of each region is based on a value of the corresponding feature. Each icon can convey statistical measures. A stabilized Voronoi-based icon layout algorithm is optionally employed. Icons can be embedded in a visualization of the multidimensional dataset. A hierarchical encoding scheme can be employed to encode a data cluster into the icon, such as a hierarchy of cluster, feature type and entity.

    摘要翻译: 为集群多维数据集提供可视化技术。 通过获得包括多个实体的多维数据集的聚类来可视化数据集,其中所述实体是特定概念的实例,并且其中每个实体包括多个特征; 以及为所述实体中的至少一个生成图标,所述图标具有多个区域,其中每个区域对应于所述至少一个实体的所述特征之一,并且其中每个区域的大小基于所述多个区域的值 相应的功能。 每个图标都可以传达统计学方法。 可选地使用基于稳定的基于Voronoi的图标布局算法。 图标可以嵌入到多维数据集的可视化中。 可以采用分层编码方案将数据集群编码成图标,例如集群的层次结构,特征类型和实体。

    METHOD AND SYSTEM FOR VISUALIZATION OF DATA SET
    6.
    发明申请
    METHOD AND SYSTEM FOR VISUALIZATION OF DATA SET 有权
    数据集可视化方法与系统

    公开(公告)号:US20140082024A1

    公开(公告)日:2014-03-20

    申请号:US12917469

    申请日:2010-11-01

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30601 G06F17/30572

    摘要: The invention provides a method and system for visualization of a data set, the method comprises: dividing the data set into a plurality of information layers based on different information dimensions; and visually processing the plurality of information layers based on different information dimensions, respectively, in order to present respective views of the plurality of information layers. In the present invention, by visualizing the data set through presenting different overviews of the data set from different information dimensions, respectively, the presentation of comprehensive information of the data set to a data set analyst is ensured while distortion of presented contents as well as visual clutter are prevented.

    摘要翻译: 本发明提供了一种用于可视化数据集的方法和系统,该方法包括:基于不同的信息维度将数据集划分成多个信息层; 并且分别基于不同的信息维度可视地处理多个信息层,以便呈现多个信息层的各个视图。 在本发明中,通过分别通过从不同的信息维度呈现数据集的不同概况来可视化数据集,确保了数据集分析器的综合信息的呈现,同时呈现内容以及视觉 杂乱无章。

    Method and system for visualization of data set
    7.
    发明授权
    Method and system for visualization of data set 有权
    数据集可视化的方法和系统

    公开(公告)号:US09087117B2

    公开(公告)日:2015-07-21

    申请号:US12917469

    申请日:2010-11-01

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30601 G06F17/30572

    摘要: The invention provides a method and system for visualization of a data set, the method comprises: dividing the data set into a plurality of information layers based on different information dimensions; and visually processing the plurality of information layers based on different information dimensions, respectively, in order to present respective views of the plurality of information layers. In the present invention, by visualizing the data set through presenting different overviews of the data set from different information dimensions, respectively, the presentation of comprehensive information of the data set to a data set analyst is ensured while distortion of presented contents as well as visual clutter are prevented.

    摘要翻译: 本发明提供了一种用于可视化数据集的方法和系统,该方法包括:基于不同的信息维度将数据集划分成多个信息层; 并且分别基于不同的信息维度可视地处理多个信息层,以便呈现多个信息层的各个视图。 在本发明中,通过分别通过从不同的信息维度呈现数据集的不同概况来可视化数据集,确保了数据集分析器的综合信息的呈现,同时呈现内容以及视觉 杂乱无章。

    Method and system for managing and querying large graphs
    8.
    发明授权
    Method and system for managing and querying large graphs 失效
    用于管理和查询大图的方法和系统

    公开(公告)号:US08645339B2

    公开(公告)日:2014-02-04

    申请号:US13294598

    申请日:2011-11-11

    IPC分类号: G06F17/30 G06F15/16

    CPC分类号: G06F17/30533 G06F17/30958

    摘要: A method, system and computer program product for managing and querying a graph. The method includes the steps of: receiving a graph; partitioning the graph into homogeneous blocks; compressing the homogeneous blocks; and storing the compressed homogeneous blocks in files where at least one of the steps is carried out using a computer device.

    摘要翻译: 用于管理和查询图形的方法,系统和计算机程序产品。 该方法包括以下步骤:接收图形; 将图划分成均匀块; 压制均质块; 以及将压缩的均匀块存储在使用计算机设备执行至少一个步骤的文件中。

    ANALYZING PARALLEL TOPICS FROM CORRELATED DOCUMENTS
    10.
    发明申请
    ANALYZING PARALLEL TOPICS FROM CORRELATED DOCUMENTS 审中-公开
    从相关文件分析平行主题

    公开(公告)号:US20110202484A1

    公开(公告)日:2011-08-18

    申请号:US12708053

    申请日:2010-02-18

    IPC分类号: G06F15/18 G06N5/02

    CPC分类号: G06N7/005

    摘要: Access is obtained to a parallel corpus including a problem corpus and a solution corpus. A first plurality of topics are mined from the problem corpus and a second plurality of topics are mined from the solution corpus. A transition probability from the first plurality of topics to the second plurality of topics is determined, to identify a most appropriate one of the topics from the solution corpus for a given one of the topics from the problem corpus.

    摘要翻译: 获取包含问题语料库和解决方案语料库的并行语料库。 从问题语料库中挖掘出第一多个主题,并从解决方案语料库中挖掘出第二个主题。 确定从第一多个主题到第二多个主题的转移概率,以从问题语料库中的给定一个主题的解语料库中识别最合适的一个主题。