摘要:
Method, system, and computer program product for automatic generation of a word-cloud for a content item are provided. The method includes: extracting terms from a content item using statistical selection criteria; weighting a term by a probability that the term is used as a tag; and generating a visual representation of terms with enhanced representation of terms according to the weighting. Weighting a term by a probability that the term is used as a tag may include determining the relative frequency of the term in a folksonomy of tag terms for a domain.
摘要:
Method, system, and computer program product for automatic generation of a word-cloud for a content item are provided. The method includes: extracting terms from a content item using statistical selection criteria; weighting a term by a probability that the term is used as a tag; and generating a visual representation of terms with enhanced representation of terms according to the weighting. Weighting a term by a probability that the term is used as a tag may include determining the relative frequency of the term in a folksonomy of tag terms for a domain.
摘要:
Method, system, and computer program product are provided for visualization of user sentiment for one or more product features. The method may include: providing one or more product image templates, a product image template having a location representing a product feature; obtaining an aggregated sentiment score for a product feature from user generated content; mapping the aggregated sentiment score to a score visualization on a visualization scale; and representing the location in the product image template relating to the product feature with the score visualization for the aggregated sentiment score to provide a visualization of the product. The method may also include: collecting one or more text expressions from user generated content relating to a product feature; representing one or more text expressions in relation to the product feature in the product image template.
摘要:
Method, system, and computer program product are provided for visualization of user sentiment for one or more product features. The method may include: providing one or more product image templates, a product image template having a location representing a product feature; obtaining an aggregated sentiment score for a product feature from user generated content; mapping the aggregated sentiment score to a score visualization on a visualization scale; and representing the location in the product image template relating to the product feature with the score visualization for the aggregated sentiment score to provide a visualization of the product. The method may also include: collecting one or more text expressions from user generated content relating to a product feature; representing one or more text expressions in relation to the product feature in the product image template.
摘要:
Disclosed is a system architecture, components and a searching technique for an Unstructured Information Management System (UIMS). The UIMS may be provided as middleware for the effective management and interchange of unstructured information over a wide array of information sources. The architecture generally includes a search engine, data storage, analysis engines containing pipelined document annotators and various adapters. The searching technique makes use of a two-level searching technique. Also disclosed is system, method and computer program product to process document data. The method includes inputting a document and operating at least one text analysis engine that comprises a plurality of coupled annotators for tokenizing document data for identifying and annotating a particular type of semantic content. Operating the at least one text analysis engine generates a plurality of views of a document, where each of the plurality of views are derived from a different tokenization of the document. The method further includes storing the plurality of views in a common data structure associated with the document.
摘要:
A method and system are provided of spoken document retrieval using multiple search transcription indices. The method includes receiving a query input formed of one or more query terms and determining a type of a query term, wherein a type includes a term in a speech recognition vocabulary or a term not in a speech recognition vocabulary. One or more indices of search transcriptions are selected for searching the query term based on the type of the query term. The one or more indices are generated using different speech transcription methods. The results for the query term are scored by the one or more indices and the results of the one or more indices for the query term are merged. The results of the one or more query terms are then merged to provide the results for the query.
摘要:
Personalized tag ranking of images, including identifying within a reference image collection any images that are similar to an input image, identifying within a source image collection any images that have associated tags that are similar to a set of input tags associated with the input image, identifying among the images identified in the reference image collection any images that are similar to the images identified in the source image collection, and calculating a weight for each of a plurality of tag pairs, where each of the tags in each of the tag pairs is associated with a different subset of the images in the reference image collection identified as being similar to the images identified in the source image collection, and ranking the input tags of the input image in accordance with a predefined ranking function as applied to the tag pair weights.
摘要:
Personalized tag ranking of images, including identifying within a reference image collection any images that are similar to an input image, identifying within a source image collection any images that have associated tags that are similar to a set of input tags associated with the input image, identifying among the images identified in the reference image collection any images that are similar to the images identified in the source image collection, and calculating a weight for each of a plurality of tag pairs, where each of the tags in each of the tag pairs is associated with a different subset of the images in the reference image collection identified as being similar to the images identified in the source image collection, and ranking the input tags of the input image in accordance with a predefined ranking function as applied to the tag pair weights.
摘要:
Personalized tag ranking of images, including identifying within a reference image collection any images that are similar to an input image, identifying within a source image collection any images that have associated tags that are similar to a set of input tags associated with the input image, identifying among the images identified in the reference image collection any images that are similar to the images identified in the source image collection, and calculating a weight for each of a plurality of tag pairs, where each of the tags in each of the tag pairs is associated with a different subset of the images in the reference image collection identified as being similar to the images identified in the source image collection, and ranking the input tags of the input image in accordance with a predefined ranking function as applied to the tag pair weights.
摘要:
A method and system are provided of spoken document retrieval using multiple search transcription indices. The method includes receiving a query input formed of one or more query terms and determining a type of a query term, wherein a type includes a term in a speech recognition vocabulary or a term not in a speech recognition vocabulary. One or more indices of search transcriptions are selected for searching the query term based on the type of the query term. The one or more indices are generated using different speech transcription methods. The results for the query term are scored by the one or more indices and the results of the one or more indices for the query term are merged. The results of the one or more query terms are then merged to provide the results for the query.