摘要:
A dictionary creation device and dictionary creation method are provided which optimally create and update a dictionary for classifying, searching, or extracting text information in accordance with a changes in content of text information groups. The dictionary creation device includes a keyword extraction unit that extracts a keyword from inputted text information and a keyword statistics unit that finds statistics regarding an appearance of the keyword. The dictionary creation device further includes a keyword assessment value calculation unit that calculates an assessment value of the extracted keyword based on the statistics regarding the appearance of the keyword, a determination unit that determines whether or not to register or delete the keyword based on the calculated assessment value, a dictionary registration and deletion unit which registers or deletes the keyword in or from a dictionary database based on a result of the determination performed by the determination unit, and the dictionary database.
摘要:
There is provided a dictionary creation device and dictionary creation method which optimally creates and updates a dictionary for classifying, searching, or extracting text information in accordance with a changes in content of text information groups. The dictionary creation device includes: a keyword extraction unit that extracts a keyword from inputted text information; a keyword statistics unit that finds statistics regarding an appearance of the keyword; a keyword assessment value calculation unit that calculates an assessment value of the extracted keyword based on the statistics regarding the appearance of the keyword; a determination unit that determines whether or not to register or delete the keyword based on the calculated assessment value; a dictionary registration and deletion unit which registers or deletes is the keyword in or from a dictionary database based on a result of the determination performed by the determination unit; and the dictionary database.
摘要:
There is provided a dictionary creation device and dictionary creation method which optimally creates and updates a dictionary for classifying, searching, or extracting text information in accordance with a changes in content of text information groups. The dictionary creation device includes: a keyword extraction unit that extracts a keyword from inputted text information; a keyword statistics unit that finds statistics regarding an appearance of the keyword; a keyword assessment value calculation unit that calculates an assessment value of the extracted keyword based on the statistics regarding the appearance of the keyword; a determination unit that determines whether or not to register or delete the keyword based on the calculated assessment value; a dictionary registration and deletion unit which registers or deletes the keyword in or from a dictionary database based on a result of the determination performed by the determination unit; and the dictionary database.
摘要:
A dictionary creation device and dictionary creation method which optimally create and update a dictionary for classifying, searching, or extracting text information in accordance with a changes in content of text information groups. The dictionary creation device includes a keyword extraction unit that extracts a keyword from inputted text information; a keyword statistics unit that finds statistics regarding an appearance of the keyword; a keyword assessment value calculation unit that calculates an assessment value of the extracted keyword based on the statistics regarding the appearance of the keyword; a determination unit that determines whether or not to register or delete the keyword based on the calculated assessment value; a dictionary registration and deletion unit which registers or deletes the keyword in or from a dictionary database based on a result of the determination performed by the determination unit; and the dictionary database.
摘要:
Questions and answers associated with each other are stored in a document storage section. A clustering section classifies the answers in the document storage section into clusters based on feature vectors of the answers. When a natural language question is input by the user, a database retrieval/updating section retrieves a question similar to the input question, and presents answers associated with the retrieved question together for each cluster to the user or an expert. In addition, the database retrieval/updating section automatically updates the document storage section based on an answer selected as most appropriate by the user or the expert if selected, or based on an answer newly input by the expert if no appropriate answer is available. The natural language answer input by the expert is presented to the user as it is.
摘要:
An apparatus including: a preference word obtainment unit that, when a search condition is input, obtains related words associated with keywords included in the search condition by a predetermined threshold or greater. Further, the preference word obtainment unit decreases the predetermined threshold to obtain related words, from a preference association dictionary when an expansion command is received. Moreover, the apparatus includes (i) a judgment unit which judges whether the related word obtained by the preference word obtainment unit is stored when the expansion command is received, (ii) a general word obtainment unit which obtains a related word from a general dictionary when it is judged that the related word obtained by the preference word obtainment unit is stored, and (iii) a retrieval unit which generates a search condition from the related word, obtains information meeting the generated search condition, and outputs the obtained information as a search result.
摘要:
An information recommendation apparatus has recommendation means of selecting and recommending contents coincident with or similar to conditions input by condition input means of inputting the conditions represented by predetermined items and attribute values corresponding thereto designated through the terminal of a user via the Internet, from among contents formed of plural pieces of data having plural items and attribute values corresponding thereto and stored in a content database in which the contents are registered by registration means, wherein the recommended contents are output to the terminal by output means via the Internet.
摘要:
An information retrieval apparatus using a preference-based association dictionary, having words stored therein that are dynamically changed according to user's preferences, and using a general association dictionary storing a relationship between keywords included in a database. The apparatus includes: a matching unit which calculates a degree of matching between a search condition inputted from an input unit and a profile; a generation unit which obtains (i) one or more related words from a general association dictionary in a case where the matching degree is smaller than a predetermined threshold, and (ii) one or more related words from a preference-based association dictionary in a case where the matching degree is greater than the predetermined threshold; and a retrieval unit which retrieves information that meets the generated search condition.
摘要:
An information recommendation apparatus selects and recommends contents coincident with or similar to conditions input. The conditions are represented by predetermined items and attribute values corresponding thereto, from among contents formed of plural pieces of data having plural items and attribute values corresponding thereto and stored in a content database in which the contents are registered, wherein the recommended contents are output to the terminal.
摘要:
An information retrieval apparatus which can use a preference-based association dictionary, the words stored therein are dynamically changed according to user's preferences, and a general association dictionary storing a relationship between the keywords included in a database, by appropriately switching between the two association dictionaries, and which includes: a matching degree calculation unit which calculates a degree of matching between a search condition inputted from an input unit and a profile; a search condition generation unit which obtains one or more related words from a general association dictionary stored in a general association dictionary storage unit in the case where the calculated matching degree is smaller than a predetermined threshold, and obtains one or more related words from a preference-based association dictionary stored in a preference-based association dictionary storage unit in the case where the matching degree is greater than the predetermined threshold, in order to generate a search condition; and a retrieval unit which retrieves information that meets the generated search condition, and outputs the information as a search result.