摘要:
Relevance optimized representative content associated with a data storage system is disclosed. One example is a system including a data summarization module, a clustering module, and a representative content selection module. The data summarization module associates, via a processor, each data object in a storage system with a derived data object. The clustering module determines clusters of similar data objects based on a similarity between associated derived data objects, and selects a representative data object for each determined cluster. The representative content selection module selects representative content associated with the storage system, where the representative content is based on the data objects, the derived data objects, and the representative data objects, and relevance optimizes of the selected representative content to an analytics application.
摘要:
Topic identification based on functional summarization is disclosed. One example is a system including a plurality of summarization engines, each summarization engine to receive, via a processing system, a document to provide a summary of the document. At least one meta-algorithmic pattern is applied to at least two summaries to provide a meta-summary of the document using the at least two summaries. A content processor identifies, from the meta-summaries, topics associated with the document, maps the identified topics to a collection of topic dimensions, and identifies a representative point based on the identified topics. An evaluator determines distance measures of the representative point from topic dimensions in the collection of topic dimensions, the distance measures indicative of proximity of respective topic dimensions to the representative point. A selector selects a topic dimension to be associated with the document, the selection based on optimizing the distance measures.
摘要:
A method involves a step (402) of accessing first data representing text; a step (404) of identifying a term within the represented text; a step (406) of using the identified term as a basis to select a text-editing rule from among a set of text-editing rules; a step of generating second data that represents the represented text edited in accordance with the selected text-editing rule; and a step (408) of providing first output based on the generated second data.
摘要:
An information processing device includes a detail level estimation unit that estimates a detail level of each of at least two documents, the detail level indicating degree to which a content of the document is detailed, a similarity degree estimation unit that estimates a similarity degree between two of the at least two documents, and a document relationship output unit that outputs a document relationship for the two of the at least two documents the similarity degree of which satisfies a predetermined condition, wherein in the document relationship, one of the two of the at least two documents is determined as a summarized document that shows a summary of other document of the two of the at least two documents, and the detail level of the one of the two of the at least two documents is lower than the detail level of the other of the two.
摘要:
An improved technique of organizing content of online meetings involves generating an electronic summary based on a textual metadata derived from content presented in an online meeting. An online meeting server collects content such as audio, video, and slide files presented in a particular online meeting. From metadata associated with such content, the online meeting server generates an electronic summary of the particular online meeting which includes a textual description of the content. The online meeting server then stores the electronic summary and the content presented in the particular online meeting in a repository that is configured to store content from other online meetings.
摘要:
One or more techniques and/or systems are provided for providing a discussion summary corresponding to a search query and/or for providing discussion session search results. For example, discussion data (e.g., corresponding to real-time messaging, such as a microblog discussion) may be evaluated to identify a discussion topic for a discussion sessions (e.g., a kitchen renovation topic may be assigned to a 1 hour exchange of kitchen renovation messages by a discussion group). A discussion summary of a discussion session may be provided based upon the discussion session having a discussion topic corresponding to a search query topic of a search query. The discussion summary may be provided along with other results for the query and may describe the discussion group, identifiers such as hashtags used by the discussion group, meeting dates/times, average number(s) of participants, other discussion sessions hosted by the discussion group, future discussion sessions, and/or other information.
摘要:
A display apparatus including a communicator configured to perform data communication with a content server and to receive at least one of a main document and a sub document related to the main document; a document analyzer configured to extract a keyword having a high frequency of occurrence from the main document and to determine a head keyword for generating a summarized document from the extracted keyword with reference to the received sub document; and a processor configured to determine a reliability of each sentence of the main document based on the head keyword, extract a sentence that matches a predetermined condition with reference to the determined reliability, and analyze a structural format of the extracted sentence so as to re-configure a word that forms the sentence and generate a summarized sentence, thereby generating a summarized document where information and logical cohesion have been obtained.