Method and system for fast, generic, online and offline, multi-source text analysis and visualization
    1.
    发明授权
    Method and system for fast, generic, online and offline, multi-source text analysis and visualization 有权
    快速,通用,在线和离线,多源文本分析和可视化的方法和系统

    公开(公告)号:US08103682B2

    公开(公告)日:2012-01-24

    申请号:US12862242

    申请日:2010-08-24

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30713

    摘要: Methods and systems for text data analysis and visualization enable a user to specify a set of text data sources and visualize the content of the text data sources in an overview of salient features in the form of a network of words. A user may focus on one or more words to provide a visualization of connections specific to the focused word(s). The visualization may include clustering of relevant concepts within the network of words. Upon selection of a word, the context thereof, e.g., links to articles where the word appears, may be provided to the user. Analyzing may include textual statistical correlation models for assigning weights to words and links between words. Displaying the network of words may include a force-based network layout algorithm. Extracting clusters for display may include identifying “communities of words” as if the network of words was a social network.

    摘要翻译: 用于文本数据分析和可视化的方法和系统使得用户能够指定一组文本数据源,并以单词形式的显着特征的概述来可视化文本数据源的内容。 用户可以专注于一个或多个单词以提供特定于聚焦单词的连接的可视化。 可视化可以包括在网络中的相关概念的聚类。 在选择一个单词之后,可以将其上下文(例如,出现该单词的文章的链接)提供给用户。 分析可以包括用于为单词和单词之间的链接分配权重的文本统计相关模型。 显示单词网络可能包括基于力的网络布局算法。 提取用于显示的集群可以包括识别“单词的社区”,就好像网络的网络是社交网络一样。

    Method and system for fast, generic, online and offline, multi-source text analysis and visualization
    2.
    发明授权
    Method and system for fast, generic, online and offline, multi-source text analysis and visualization 有权
    快速,通用,在线和离线,多源文本分析和可视化的方法和系统

    公开(公告)号:US07792816B2

    公开(公告)日:2010-09-07

    申请号:US12023693

    申请日:2008-01-31

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30713

    摘要: Methods and systems for text data analysis and visualization enable a user to specify a set of text data sources and visualize the content of the text data sources in an overview of salient features in the form of a network of words. A user may focus on one or more words to provide a visualization of connections specific to the focused word(s). The visualization may include clustering of relevant concepts within the network of words. Upon selection of a word, the context thereof, e.g., links to articles where the word appears, may be provided to the user. Analyzing may include textual statistical correlation models for assigning weights to words and links between words. Displaying the network of words may include a force-based network layout algorithm. Extracting clusters for display may include identifying “communities of words” as if the network of words was a social network.

    摘要翻译: 用于文本数据分析和可视化的方法和系统使得用户能够指定一组文本数据源,并以单词形式的显着特征的概述来可视化文本数据源的内容。 用户可以专注于一个或多个单词以提供特定于聚焦单词的连接的可视化。 可视化可以包括在网络中的相关概念的聚类。 在选择一个单词之后,可以将其上下文(例如,出现该单词的文章的链接)提供给用户。 分析可以包括用于为单词和单词之间的链接分配权重的文本统计相关模型。 显示单词网络可能包括基于力的网络布局算法。 提取用于显示的集群可以包括识别“单词的社区”,就好像网络的网络是社交网络一样。

    Method and System for Fast, Generic, Online and Offline, Multi-Source Text Analysis and Visualization
    3.
    发明申请
    Method and System for Fast, Generic, Online and Offline, Multi-Source Text Analysis and Visualization 有权
    快速,通用,在线和离线的方法和系统,多源文本分析和可视化

    公开(公告)号:US20110047455A1

    公开(公告)日:2011-02-24

    申请号:US12862242

    申请日:2010-08-24

    IPC分类号: G06F17/21

    CPC分类号: G06F17/30713

    摘要: Methods and systems for text data analysis and visualization enable a user to specify a set of text data sources and visualize the content of the text data sources in an overview of salient features in the form of a network of words. A user may focus on one or more words to provide a visualization of connections specific to the focused word(s). The visualization may include clustering of relevant concepts within the network of words. Upon selection of a word, the context thereof, e.g., links to articles where the word appears, may be provided to the user. Analyzing may include textual statistical correlation models for assigning weights to words and links between words. Displaying the network of words may include a force-based network layout algorithm. Extracting clusters for display may include identifying “communities of words” as if the network of words was a social network.

    摘要翻译: 用于文本数据分析和可视化的方法和系统使得用户能够指定一组文本数据源,并以单词形式的显着特征的概述来可视化文本数据源的内容。 用户可以专注于一个或多个单词以提供特定于聚焦单词的连接的可视化。 可视化可以包括在网络中的相关概念的聚类。 在选择一个单词之后,可以将其上下文(例如,出现该单词的文章的链接)提供给用户。 分析可以包括用于为单词和单词之间的链接分配权重的文本统计相关模型。 显示单词网络可能包括基于力的网络布局算法。 提取用于显示的集群可以包括识别“单词的社区”,就好像网络的网络是社交网络一样。

    METHOD AND SYSTEM FOR FAST, GENERIC, ONLINE AND OFFLINE, MULTI-SOURCE TEXT ANALYSIS AND VISUALIZATION
    4.
    发明申请
    METHOD AND SYSTEM FOR FAST, GENERIC, ONLINE AND OFFLINE, MULTI-SOURCE TEXT ANALYSIS AND VISUALIZATION 有权
    快速,一般,在线和离线的方法和系统,多源文本分析和可视化

    公开(公告)号:US20090144617A1

    公开(公告)日:2009-06-04

    申请号:US12023693

    申请日:2008-01-31

    IPC分类号: G06F17/21

    CPC分类号: G06F17/30713

    摘要: Methods and systems for text data analysis and visualization enable a user to specify a set of text data sources and visualize the content of the text data sources in an overview of salient features in the form of a network of words. A user may focus on one or more words to provide a visualization of connections specific to the focused word(s). The visualization may include clustering of relevant concepts within the network of words. Upon selection of a word, the context thereof, e.g., links to articles where the word appears, may be provided to the user. Analyzing may include textual statistical correlation models for assigning weights to words and links between words. Displaying the network of words may include a force-based network layout algorithm. Extracting clusters for display may include identifying “communities of words” as if the network of words was a social network.

    摘要翻译: 用于文本数据分析和可视化的方法和系统使得用户能够指定一组文本数据源,并以单词形式的显着特征的概述来可视化文本数据源的内容。 用户可以专注于一个或多个单词以提供特定于聚焦单词的连接的可视化。 可视化可以包括在网络中的相关概念的聚类。 在选择一个单词之后,可以将其上下文(例如,出现该单词的文章的链接)提供给用户。 分析可以包括用于为单词和单词之间的链接分配权重的文本统计相关模型。 显示单词网络可能包括基于力的网络布局算法。 提取用于显示的集群可以包括识别“单词的社区”,就好像网络的网络是社交网络一样。