摘要:
A document analysis device (1) comprises a common assessment information selection unit (90) and an event impact analysis unit (100). The common assessment information selection unit (90) identifies information that matches second assessment information that appears in event-related documents which include descriptions concerning a designated specific event, from among first assessment information that appears in documents for analysis which include descriptions relating to items for analysis, and classifies the information thus identified as common assessment information. The event impact analysis unit (100) counts the number of times the common assessment information appears in the documents that are generated prior to the event occurring, and the number of times the common assessment information appears in the documents for analysis that are generated subsequent to the event occurring, and derives an index that denotes the impact of the specific event upon the documents for analysis, on the basis of the results of the counts thereupon.
摘要:
A document analysis device (1) comprises a common assessment information selection unit (90) and an event impact analysis unit (100). The common assessment information selection unit (90) identifies information that matches second assessment information that appears in event-related documents which include descriptions concerning a designated specific event, from among first assessment information that appears in documents for analysis which include descriptions relating to items for analysis, and classifies the information thus identified as common assessment information. The event impact analysis unit (100) counts the number of times the common assessment information appears in the documents that are generated prior to the event occurring, and the number of times the common assessment information appears in the documents for analysis that are generated subsequent to the event occurring, and derives an index that denotes the impact of the specific event upon the documents for analysis, on the basis of the results of the counts thereupon.
摘要:
An information analysis apparatus 1 that executes information analysis on a document set including documents to which time information is attached, the apparatus includes: a corresponding section selection unit 30 that mutually compares a plurality of time-series data generated and selects two or more sections that change corresponding to each of two or more sections of another time-series data from each time-series data; a feature extraction unit 40 that extracts features from the documents belonging to the selected two or more sections; a comparison unit 50 that acquires, from extracted features, an inter-feature distance of the selected one section and another section, and mutually compares the inter-feature distances of each of the time-series data; and a correlation degree calculation unit 70 that calculates a degree of correlation between the document sets based on the comparison result.
摘要:
Described are a reputation analysis device, reputation analysis method, and reputation analysis-use program capable of suitably analyzing temporal changes in reputation for an object indicated by a keyword. The disclosed reputation analysis device is provided with a voluntary activity description extraction means for extracting descriptions representing voluntary activity related to an object indicated by a keyword that has been input from within a plurality of documents; and a reputation chronological data estimation means for counting the number of occurrences of voluntary activity at each time point wherein the voluntary activity expressed by a description representing the voluntary activity related to the object has been performed, and estimating reputation chronological data for chronologically representing evaluations for the object by the agents of the voluntary activity.
摘要:
Described are a reputation analysis device, reputation analysis method, and reputation analysis-use program capable of suitably analyzing temporal changes in reputation for an object indicated by a keyword. The disclosed reputation analysis device is provided with a voluntary activity description extraction means for extracting descriptions representing voluntary activity related to an object indicated by a keyword that has been input from within a plurality of documents; and a reputation chronological data estimation means for counting the number of occurrences of voluntary activity at each time point wherein the voluntary activity expressed by a description representing the voluntary activity related to the object has been performed, and estimating reputation chronological data for chronologically representing evaluations for the object by the agents of the voluntary activity.
摘要:
A time-series document summarization (201) device outputs a summary sentence of a document-of-interest collection that is a document collection to be an object. A time-series document summarization (201) comprises: a background topic word extraction part (20) configured to acquire a set of the document-of-interest collection and a document-of-interest topic word that is a feature word of the document-of-interest collection, and a reference-use document collection that is a document collection different from the document-of-interest collection, and extract a background topic word representing a topic to be a background of a topic described in the document-of-interest collection from the reference-use document collection; and a representative character string extraction part (30) configured to extract a representative character string including the document-of-interest topic word and the background topic word as a summary sentence of the document-of-interest collection from among character strings included in the document-of-interest collection.
摘要:
Provided are a related expression generation section that accepts an evaluation object expression, an linguistic expression to be evaluated, as input and generates a linguistic expression related to the evaluation object expression as a related expression; and a credibility calculation section that acquires the evaluation object expression and the related expression from a plurality of electronic documents along with time information and calculates credibility concerning the meaning of the evaluation object expression at a specific point in time by comparing the number of times that the acquired evaluation object expression appears and the number of times that the acquired related expression appears at the same time point.
摘要:
A document analysis apparatus comprises: a feature expression acquisition unit acquiring a feature expression appearing during an attention period in an analysis object document collection; a document collection acquisition unit acquiring a feature expression containing document (FECD) collection in which a feature expression appears, from an analysis population including an analysis object document collection; a context determination unit specifying an analysis/FECD corresponding to an analysis object document among a FECD collection for every feature expression, and specifies a context in which the feature expression appeared in multiple analysis/FECDs; a context comparison determination unit specifying a non analysis/FECD not corresponding to an analysis object document among a FECD collection, and within that, compares a context in which the feature expression has appeared and a context specified previously; and a feature degree setting unit performing giving or the like of a feature degree to a feature expression from the comparison.
摘要:
An information analysis device includes a correlation value calculation unit which specifies the number of appearances of links between one linguistic expression and other linguistic expression and an appearance time of each link based on link information. The correlation value calculation unit calculates a correlation value between the one linguistic expression and the other linguistic expression according to a degree that the link continuously appears by using the specified number of appearances of the link and the appearance time of each link.
摘要:
Time-series data corresponding to an input linguistic expression to be analyzed is acquired, a relevant linguistic expression candidate which is highly relevant to the input linguistic expression is generated, time-series data corresponding to the relevant linguistic expression candidate generated is acquired, temporal correlation between the time-series data corresponding to the input linguistic expression and the time-series data corresponding to the relevant linguistic expression candidate is analyzed and a relevance level between the input linguistic expression and the relevant linguistic expression candidate generated is calculated using an analysis result of the time-series data.