摘要:
An information processing system according to one embodiment includes an extraction unit and a generation unit. The extraction unit extracts a word specified by a user and contained in a request transmitted from a user terminal and an image displayed as a choice on the user terminal in response to the request. The generation unit generates combination data by associating image data corresponding to the image and obtained from an image storage unit with a name based on the word.
摘要:
A base word to be a base, a compound word in which the base word becomes a modifiee, classification items to classify the compound word, and feature information about a feature that provides a clue to classify the compound word are acquired (S10, S11, S12, S13), the compound word containing the base word is distributed into the acquired classification item using a classification model generated in advance and the acquired feature information (S14, S15), base word related information containing a plurality of elements related to the base word is acquired based on the base word (S16), each of at least a portion of the elements contained in the acquired base word related information is classified into one of the classification items in accordance with a result of the classification (S17), and the classified base word related information (Web pages 40, 50, 51) is output (S18).
摘要:
Based on information being associated with one image among a plurality of images and concerning an object of the one image and information being associated with another image among the plurality of images and concerning an object of the other image, a characteristic information specification unit specifies characteristic information of the object of the one image as compared with the object of the other image. A characteristic information obtaining unit obtains the characteristic information specified by the characteristic information specification unit. A display control unit displays a screen image including a plurality of images on a display unit. Further, the display control unit displays the characteristic information so as to be associated with the one image.
摘要:
A counting device (100) provided with a subtree generating part (123) for generating first subtree comprising a first sentence and a second subtree comprising a second sentence. The counting device (100) is provided with: a categorizing part (125) for categorizing the first subtree in the same group as the second subtree when it is determined that a first expression represented by the first subtree and a second expression represented by a second subtree represent a matching content; and an output part (127) for outputting the number of subtrees categorized in the group, or an expression represented by a plurality of syntax trees or one of the subtrees categorized in the aforementioned group.
摘要:
A corpus generation device according to an embodiment includes a web page acquisition unit, a reference word acquisition unit, an attachment unit and an output unit. The web page acquisition unit acquires a web page including description sentence data regarding a presentation target. The reference word acquisition unit acquires a reference word that is an attribute value regarding the presentation target from the web page. The attachment unit extracts a broader word belonging to a layer above the reference word acquired by the reference word acquisition unit from a storage unit that stores hierarchical relationship information indicating a hierarchical relationship between attribute values, and attaches an attribute tag corresponding to the reference word to the broader word included in the description sentence data. The output unit outputs, as corpus data, the description sentence data to which the attribute tag is attached by the attachment unit.
摘要:
The information processing system according to one embodiment includes a specifying unit and an extraction unit. The specifying unit specifies a content word co-occurring with onomatopoeia in one review among a plurality of posted reviews stored in a storage unit. The extraction unit extracts a posted sentence containing the content word from the plurality of posted reviews. In general, posted sentences or posted reviews containing onomatopoeia are likely to include users' actual experiences. By extracting the posted sentences that contain the content word which is likely to co-occur with onomatopoeia, it is possible to effectively extract the posted sentences on which users' actual experiences are written.
摘要:
A counting device (100) provided with a subtree generating part (123) for generating first subtree comprising a first sentence and a second subtree comprising a second sentence. The counting device (100) is provided with: a categorizing part (125) for categorizing the first subtree in the same group as the second subtree when it is determined that a first expression represented by the first subtree and a second expression represented by a second subtree represent a matching content; and an output part (127) for outputting the number of subtrees categorized in the group, or an expression represented by a plurality of syntax trees or one of the subtrees categorized in the aforementioned group.
摘要:
A corpus creation device includes an acquisition unit that acquires item page data containing description data related to an item and an attribute list where an attribute name and an attribute value related to the item are associated, an adding unit that, when an attribute value in an attribute list contained in item page data is contained in description data in the item page data, adds an attribute tag identifying an attribute name with which the attribute value is associated in the attribute list to the attribute value contained in the description data, and an output unit that outputs description data in which an attribute tag is added as corpus data.