摘要:
Methods and systems for rapid automatic keyword extraction for information retrieval and analysis. Embodiments can include parsing words in an individual document by delimiters, stop words, or both in order to identify candidate keywords. Word scores for each word within the candidate keywords are then calculated based on a function of co-occurrence degree, co-occurrence frequency, or both. Based on a function of the word scores for words within the candidate keyword, a keyword score is calculated for each of the candidate keywords. A portion of the candidate keywords are then extracted as keywords based, at least in part, on the candidate keywords having the highest keyword scores.
摘要:
The visual analytic system enables information retrieval within large text collections. Typically, users have to directly and explicitly query information to retrieve it. With this system and process, the reasoning of the user is inferred from the user interaction they perform in a visual analytic tool, and the appropriate information to query, process, and visualize is systematically determined.
摘要:
A system and method for extracting and converting data from one or more information sources into a common format. The method comprises receiving the information sources, receiving at least one pattern descriptor selected from a graphical user interface, and receiving one or more templates with each templates having at least one pattern descriptor. The method then proceeds to apply the one or more templates to the information sources. The method generates the plurality of data in a common format by parsing the information sources with the templates. The method stores the data in the common format.
摘要:
Methods and systems for rapid automatic keyword extraction for information retrieval and analysis. Embodiments can include parsing words in an individual document by delimiters, stop words, or both in order to identify candidate keywords. Word scores for each word within the candidate keywords are then calculated based on a function of co-occurrence degree, co-occurrence frequency, or both. Based on a function of the word scores for words within the candidate keyword, a keyword score is calculated for each of the candidate keywords. A portion of the candidate keywords are then extracted as keywords based, at least in part, on the candidate keywords having the highest keyword scores.
摘要:
The visual analytic system enables information retrieval within large text collections. Typically, users have to directly and explicitly query information to retrieve it. With this system and process, the reasoning of the user is inferred from the user interaction they perform in a visual analytic tool, and the appropriate information to query, process, and visualize is systematically determined.