摘要:
An information analysis apparatus 1 that executes information analysis on a document set including documents to which time information is attached, the apparatus includes: a corresponding section selection unit 30 that mutually compares a plurality of time-series data generated and selects two or more sections that change corresponding to each of two or more sections of another time-series data from each time-series data; a feature extraction unit 40 that extracts features from the documents belonging to the selected two or more sections; a comparison unit 50 that acquires, from extracted features, an inter-feature distance of the selected one section and another section, and mutually compares the inter-feature distances of each of the time-series data; and a correlation degree calculation unit 70 that calculates a degree of correlation between the document sets based on the comparison result.
摘要:
A document analysis device (1) comprises a common assessment information selection unit (90) and an event impact analysis unit (100). The common assessment information selection unit (90) identifies information that matches second assessment information that appears in event-related documents which include descriptions concerning a designated specific event, from among first assessment information that appears in documents for analysis which include descriptions relating to items for analysis, and classifies the information thus identified as common assessment information. The event impact analysis unit (100) counts the number of times the common assessment information appears in the documents that are generated prior to the event occurring, and the number of times the common assessment information appears in the documents for analysis that are generated subsequent to the event occurring, and derives an index that denotes the impact of the specific event upon the documents for analysis, on the basis of the results of the counts thereupon.
摘要:
Described are a reputation analysis device, reputation analysis method, and reputation analysis-use program capable of suitably analyzing temporal changes in reputation for an object indicated by a keyword. The disclosed reputation analysis device is provided with a voluntary activity description extraction means for extracting descriptions representing voluntary activity related to an object indicated by a keyword that has been input from within a plurality of documents; and a reputation chronological data estimation means for counting the number of occurrences of voluntary activity at each time point wherein the voluntary activity expressed by a description representing the voluntary activity related to the object has been performed, and estimating reputation chronological data for chronologically representing evaluations for the object by the agents of the voluntary activity.
摘要:
A document analysis device (1) comprises a common assessment information selection unit (90) and an event impact analysis unit (100). The common assessment information selection unit (90) identifies information that matches second assessment information that appears in event-related documents which include descriptions concerning a designated specific event, from among first assessment information that appears in documents for analysis which include descriptions relating to items for analysis, and classifies the information thus identified as common assessment information. The event impact analysis unit (100) counts the number of times the common assessment information appears in the documents that are generated prior to the event occurring, and the number of times the common assessment information appears in the documents for analysis that are generated subsequent to the event occurring, and derives an index that denotes the impact of the specific event upon the documents for analysis, on the basis of the results of the counts thereupon.
摘要:
Described are a reputation analysis device, reputation analysis method, and reputation analysis-use program capable of suitably analyzing temporal changes in reputation for an object indicated by a keyword. The disclosed reputation analysis device is provided with a voluntary activity description extraction means for extracting descriptions representing voluntary activity related to an object indicated by a keyword that has been input from within a plurality of documents; and a reputation chronological data estimation means for counting the number of occurrences of voluntary activity at each time point wherein the voluntary activity expressed by a description representing the voluntary activity related to the object has been performed, and estimating reputation chronological data for chronologically representing evaluations for the object by the agents of the voluntary activity.
摘要:
An information estimation apparatus 1 for estimating a transmission point in time of a document whose transmission point in time is not specified in a document set to be analyzed includes a structure analysis unit 3 configured to specify, from the document set, a document having a document structure in which a link relationship with another document is indicated in a table-of-contents manner, and extract the link relationship of documents included in the document set from the document structure of the specified document, a grouping unit 4 configured to set a group of documents using the specified document and the extracted link relationship, and an estimation unit 5 configured to estimate, based on the set group and a transmission point in time of a document that is included in the group and whose transmission point in time is specified, a transmission point in time of a document that is included in the group and whose transmission point in time is not specified.
摘要:
An information analysis device includes a correlation value calculation unit which specifies the number of appearances of links between one linguistic expression and other linguistic expression and an appearance time of each link based on link information. The correlation value calculation unit calculates a correlation value between the one linguistic expression and the other linguistic expression according to a degree that the link continuously appears by using the specified number of appearances of the link and the appearance time of each link.
摘要:
An information analysis device (1) uses a plurality of linguistic expressions as an analysis target, includes a link information generating unit (3) and a correlation value calculation unit (4). The link information generating unit (3) extracts time information included in each of a plurality of electronic documents including at least any one of the plurality of linguistic expressions and a relationship between the electronic documents in the plurality of electronic documents from the plurality of electronic documents, detects a link between one linguistic expression and another linguistic expression in the plurality of linguistic expressions and an appearance time of the link based on the extracted time information and the relationship between the electronic documents, and generates link information specifying the extracted link and the appearance time of the link. The correlation value calculation unit (4) specifies the number of appearances of links between the one linguistic expression and the other linguistic expression and an appearance time of each link based on the link information, and calculates a correlation value between the one linguistic expression and the other linguistic expression according to a degree that the link continuously appears by using the specified number of appearances of the link and the appearance time of each link.
摘要:
Disclosed is an information estimation device for estimating an appropriate issue time from a time representation described in a document without intervention of any operator; wherein an information estimation device (1) which is a device for estimating an issue time of a document to be estimated, includes a candidate generation unit (11) which extracts a time representation described in the document, and on the basis of the extracted time representation, generates a plurality of possible issue time candidates having possibilities corresponding to the issue time of the document; and an issue time estimation unit (12) for obtaining a temporal proximity, for each of the plurality of issue time candidates, between the issue time candidate and other issue time candidates, and on the basis of the obtained temporal proximity, estimating the issue time of the document.
摘要:
Disclosed is an information estimation device for estimating an appropriate issue time from a time representation described in a document without intervention of any operator; wherein an information estimation device (1) which is a device for estimating an issue time of a document to be estimated, includes a candidate generation unit (11) which extracts a time representation described in the document, and on the basis of the extracted time representation, generates a plurality of possible issue time candidates having possibilities corresponding to the issue time of the document; and an issue time estimation unit (12) for obtaining a temporal proximity, for each of the plurality of issue time candidates, between the issue time candidate and other issue time candidates, and on the basis of the obtained temporal proximity, estimating the issue time of the document.