摘要:
A recommended information presentation apparatus, including a display unit which displays text data, an extraction unit which extracts keyword candidates from the text data, a storage unit which stores semantic attributes of the keyword candidates, semantic attribute rules which contain scoring criteria for semantic attributes, descriptive phrases describing the keyword candidates and descriptive phrase rules which contain scoring criteria for descriptive phrases. The scores of the keyword candidates are calculated by a selection unit based on the semantic attribute rules and descriptive phrase rules and the highest scoring keyword candidates are selected as keywords. The selected keyword is used to search an information database by a search unit which also receives the search results which are displayed on the display unit by a control unit as recommended information with regards to the text information.
摘要:
A web browsing purpose classification apparatus, including a display unit which displays a webpage and a document retrieval unit which retrieves document data from the displayed webpage. A keyword extraction knowledge unit stores knowledge necessary for keyword extraction. This knowledge is used by a keyword extraction unit to extract keywords from the document data. A webpage format determination knowledge unit stores knowledge necessary for the determination of webpage formats which is used by a webpage format determination unit to determine webpage formats. A web browsing history storage unit stores the keywords and webpage formats as web browsing history. A browsing purpose classification knowledge unit stores knowledge necessary for the classification of browsing purposes which is used by a browsing purpose classification unit to classify browsing purposes.
摘要:
A recommended information presentation apparatus, including a display unit which displays text data, an extraction unit which extracts keyword candidates from the text data, a storage unit which stores semantic attributes of the keyword candidates, semantic attribute rules which contain scoring criteria for semantic attributes, descriptive phrases describing the keyword candidates and descriptive phrase rules which contain scoring criteria for descriptive phrases. The scores of the keyword candidates are calculated by a selection unit based on the semantic attribute rules and descriptive phrase rules and the highest scoring keyword candidates are selected as keywords. The selected keyword is used to search an information database by a search unit which also receives the search results which are displayed on the display unit by a control unit as recommended information with regards to the text information.
摘要:
A web browsing purpose classification apparatus, including a display unit which displays a webpage and a document retrieval unit which retrieves document data from the displayed webpage. A keyword extraction knowledge unit stores knowledge necessary for keyword extraction. This knowledge is used by a keyword extraction unit to extract keywords from the document data. A webpage format determination knowledge unit stores knowledge necessary for the determination of webpage formats which is used by a webpage format determination unit to determine webpage formats. A web browsing history storage unit stores the keywords and webpage formats as web browsing history. A browsing purpose classification knowledge unit stores knowledge necessary for the classification of browsing purposes which is used by a browsing purpose classification unit to classify browsing purposes.
摘要:
A keyword input supporting apparatus includes a document acquisition unit that acquires a document having a plurality of components containing text data, a main component selection unit that selects a component having many characters in the text data as a main component, a part-of-speech analysis unit that analyzes the part-of-speech of the text data contained in the main component, and adds a semantic attribute to each of words of the text data, a specific name extraction unit that extracts as a specific name a word, having a predetermined semantic attribute or part of speech, from the words, a specific name storage that stores the specific name together with the corresponding semantic attribute, a keyword candidate classification unit that performs classification of the specific name from the storage as a keyword candidate based on the semantic attribute, and a keyword candidate presentation unit that presents the keyword candidate to a user.
摘要:
A keyword input supporting apparatus includes a document acquisition unit that acquires a document having a plurality of components containing text data, a main component selection unit that selects a component having many characters in the text data as a main component, a part-of-speech analysis unit that analyzes the part-of-speech of the text data contained in the main component, and adds a semantic attribute to each of words of the text data, a specific name extraction unit that extracts as a specific name a word, having a predetermined semantic attribute or part of speech, from the words, a specific name storage that stores the specific name together with the corresponding semantic attribute, a keyword candidate classification unit that performs classification of the specific name from the storage as a keyword candidate based on the semantic attribute, and a keyword candidate presentation unit that presents the keyword candidate to a user.
摘要:
According to one embodiment, a phrase similarity is reduced when a common genre characteristic word is included in a program of interest interested by a user and a similar phrase program including the same phrase. A genre similarity is increased when a common genre characteristic word is included in the program of interest and a similar genre program including the same genre as the program of interest. The similar phrase program is presented based on the phrase similarity, and the similar genre program is presented based on the genre similarity.
摘要:
According to one embodiment, a keyword presentation apparatus includes an extraction unit, a selection unit and a clustering unit. The extraction unit is configured to extract, as technical terms, morpheme strings, which are not defined in a general concept dictionary, from a document set. The selection unit is configured to evaluate relevancies between each of basic term candidates and the technical terms, and to preferentially select basic term candidates having high relevancies as basic terms. The clustering unit is configured to calculate weighted sums of statistical degrees of correlation between the basic terms based on the document set, to calculate conceptual degrees of correlation between the basic terms based on the general concept dictionary, and to cluster the basic terms based on the weighted sums.
摘要:
According to one embodiment, a keyword presentation apparatus includes an extraction unit, a selection unit and a clustering unit. The extraction unit is configured to extract, as technical terms, morpheme strings, which are not defined in a general concept dictionary, from a document set. The selection unit is configured to evaluate relevancies between each of basic term candidates and the technical terms, and to preferentially select basic term candidates having high relevancies as basic terms. The clustering unit is configured to calculate weighted sums of statistical degrees of correlation between the basic terms based on the document set, to calculate conceptual degrees of correlation between the basic terms based on the general concept dictionary, and to cluster the basic terms based on the weighted sums.