摘要:
Methods and systems for text data analysis and visualization enable a user to specify a set of text data sources and visualize the content of the text data sources in an overview of salient features in the form of a network of words. A user may focus on one or more words to provide a visualization of connections specific to the focused word(s). The visualization may include clustering of relevant concepts within the network of words. Upon selection of a word, the context thereof, e.g., links to articles where the word appears, may be provided to the user. Analyzing may include textual statistical correlation models for assigning weights to words and links between words. Displaying the network of words may include a force-based network layout algorithm. Extracting clusters for display may include identifying “communities of words” as if the network of words was a social network.
摘要:
Methods and systems for text data analysis and visualization enable a user to specify a set of text data sources and visualize the content of the text data sources in an overview of salient features in the form of a network of words. A user may focus on one or more words to provide a visualization of connections specific to the focused word(s). The visualization may include clustering of relevant concepts within the network of words. Upon selection of a word, the context thereof, e.g., links to articles where the word appears, may be provided to the user. Analyzing may include textual statistical correlation models for assigning weights to words and links between words. Displaying the network of words may include a force-based network layout algorithm. Extracting clusters for display may include identifying “communities of words” as if the network of words was a social network.
摘要:
Methods and systems for text data analysis and visualization enable a user to specify a set of text data sources and visualize the content of the text data sources in an overview of salient features in the form of a network of words. A user may focus on one or more words to provide a visualization of connections specific to the focused word(s). The visualization may include clustering of relevant concepts within the network of words. Upon selection of a word, the context thereof, e.g., links to articles where the word appears, may be provided to the user. Analyzing may include textual statistical correlation models for assigning weights to words and links between words. Displaying the network of words may include a force-based network layout algorithm. Extracting clusters for display may include identifying “communities of words” as if the network of words was a social network.
摘要:
Methods and systems for text data analysis and visualization enable a user to specify a set of text data sources and visualize the content of the text data sources in an overview of salient features in the form of a network of words. A user may focus on one or more words to provide a visualization of connections specific to the focused word(s). The visualization may include clustering of relevant concepts within the network of words. Upon selection of a word, the context thereof, e.g., links to articles where the word appears, may be provided to the user. Analyzing may include textual statistical correlation models for assigning weights to words and links between words. Displaying the network of words may include a force-based network layout algorithm. Extracting clusters for display may include identifying “communities of words” as if the network of words was a social network.
摘要:
Methods and apparatus include presenting an initial set of names to a user. The user selects a set of names from those presented. An Interactive Evolutionary Algorithm (IEA) extracts features of each selected name from a database of names and features to form a feature set. The IEA forms a set of match features that are chosen from the feature set according to a priority function and/or weighting of the features, either of which may vary in succeeding iterations. The IEA searches the database to obtain a candidate set of names, where each name has features matching the match features. One or more names is chosen from the candidate set and added into a presentation set of names. The IEA may repeat the formation of the match features, candidate set, and selection of one or more names from the candidate set until the new presentation set is complete.
摘要:
Methods and apparatus include presenting an initial set of names to a user. The user selects a set of names from those presented. An Interactive Evolutionary Algorithm (IEA) extracts features of each selected name from a database of names and features to form a feature set. The IEA forms a set of match features that are chosen from the feature set according to a priority function and/or weighting of the features, either of which may vary in succeeding iterations. The IEA searches the database to obtain a candidate set of names, where each name has features matching the match features. One or more names is chosen from the candidate set and added into a presentation set of names. The IEA may repeat the formation of the match features, candidate set, and selection of one or more names from the candidate set until the new presentation set is complete.