RELEVANCE OPTIMIZED REPRESENTATIVE CONTENT ASSOCIATED WITH A DATA STORAGE SYSTEM

    公开(公告)号:EP3283984A4

    公开(公告)日:2018-04-04

    申请号:EP16862610

    申请日:2016-03-10

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30719 G06F17/30598

    摘要: Relevance optimized representative content associated with a data storage system is disclosed. One example is a system including a data summarization module, a clustering module, and a representative content selection module. The data summarization module associates, via a processor, each data object in a storage system with a derived data object. The clustering module determines clusters of similar data objects based on a similarity between associated derived data objects, and selects a representative data object for each determined cluster. The representative content selection module selects representative content associated with the storage system, where the representative content is based on the data objects, the derived data objects, and the representative data objects, and relevance optimizes of the selected representative content to an analytics application.

    TOPIC IDENTIFICATION BASED ON FUNCTIONAL SUMMARIZATION
    2.
    发明公开
    TOPIC IDENTIFICATION BASED ON FUNCTIONAL SUMMARIZATION 审中-公开
    基于功能概括的主题识别

    公开(公告)号:EP3230892A1

    公开(公告)日:2017-10-18

    申请号:EP15890920.0

    申请日:2015-04-29

    发明人: SIMSKE, Steven J

    CPC分类号: G06F17/30719 G06F17/30707

    摘要: Topic identification based on functional summarization is disclosed. One example is a system including a plurality of summarization engines, each summarization engine to receive, via a processing system, a document to provide a summary of the document. At least one meta-algorithmic pattern is applied to at least two summaries to provide a meta-summary of the document using the at least two summaries. A content processor identifies, from the meta-summaries, topics associated with the document, maps the identified topics to a collection of topic dimensions, and identifies a representative point based on the identified topics. An evaluator determines distance measures of the representative point from topic dimensions in the collection of topic dimensions, the distance measures indicative of proximity of respective topic dimensions to the representative point. A selector selects a topic dimension to be associated with the document, the selection based on optimizing the distance measures.

    摘要翻译: 公开了基于功能汇总的主题识别。 一个示例是包括多个汇总引擎的系统,每个汇总引擎经由处理系统接收文档以提供文档的摘要。 将至少一个元算法模式应用于至少两个摘要以使用至少两个摘要提供文档的元摘要。 内容处理器从元摘要中识别与文档相关联的主题,将所识别的主题映射到主题维度的集合,并且基于所识别的主题识别代表性的点。 评估者根据主题维度集合中的主题维度来确定代表点的距离度量,距离度量指示各个主题维度与代表点的接近度。 选择器选择要与文档相关联的主题维度,该选择基于优化距离度量。

    INFORMATION PROCESSING DEVICE AND INFORMATION PROCESSING PROGRAM
    5.
    发明公开
    INFORMATION PROCESSING DEVICE AND INFORMATION PROCESSING PROGRAM 审中-公开
    信息处理设备和信息处理程序

    公开(公告)号:EP3098724A2

    公开(公告)日:2016-11-30

    申请号:EP16156202.0

    申请日:2016-02-17

    IPC分类号: G06F17/22

    摘要: An information processing device includes a detail level estimation unit that estimates a detail level of each of at least two documents, the detail level indicating degree to which a content of the document is detailed, a similarity degree estimation unit that estimates a similarity degree between two of the at least two documents, and a document relationship output unit that outputs a document relationship for the two of the at least two documents the similarity degree of which satisfies a predetermined condition, wherein in the document relationship, one of the two of the at least two documents is determined as a summarized document that shows a summary of other document of the two of the at least two documents, and the detail level of the one of the two of the at least two documents is lower than the detail level of the other of the two.

    摘要翻译: 一种信息处理设备,包括:细节水平估计单元,估计至少两个文档中的每一个的细节水平,细节水平指示文档的内容被详细描述的程度;相似度估计单元,估计两个文档之间的相似度 以及文档关系输出单元,其输出关于其相似度满足预定条件的至少两个文档中的两个文档的文档关系,其中在文档关系中,两个文档中的一个 至少两个文档被确定为总结文档,该文档示出至少两个文档中的两个文档的其他文档的概要,并且至少两个文档中的两个文档中的一个的详细程度低于 另外两个。

    GENERATING ELECTRONIC SUMMARIES OF ONLINE MEETINGS
    6.
    发明公开
    GENERATING ELECTRONIC SUMMARIES OF ONLINE MEETINGS 审中-公开
    在线制作会议的电子摘要

    公开(公告)号:EP3069265A2

    公开(公告)日:2016-09-21

    申请号:EP14805752.4

    申请日:2014-11-12

    IPC分类号: G06F17/00

    摘要: An improved technique of organizing content of online meetings involves generating an electronic summary based on a textual metadata derived from content presented in an online meeting. An online meeting server collects content such as audio, video, and slide files presented in a particular online meeting. From metadata associated with such content, the online meeting server generates an electronic summary of the particular online meeting which includes a textual description of the content. The online meeting server then stores the electronic summary and the content presented in the particular online meeting in a repository that is configured to store content from other online meetings.

    DISCUSSION SUMMARY
    7.
    发明公开
    DISCUSSION SUMMARY 审中-公开
    DISKUSSIONSZUSAMMENFASSUNG

    公开(公告)号:EP3061005A1

    公开(公告)日:2016-08-31

    申请号:EP14796613.9

    申请日:2014-10-22

    IPC分类号: G06F17/30

    摘要: One or more techniques and/or systems are provided for providing a discussion summary corresponding to a search query and/or for providing discussion session search results. For example, discussion data (e.g., corresponding to real-time messaging, such as a microblog discussion) may be evaluated to identify a discussion topic for a discussion sessions (e.g., a kitchen renovation topic may be assigned to a 1 hour exchange of kitchen renovation messages by a discussion group). A discussion summary of a discussion session may be provided based upon the discussion session having a discussion topic corresponding to a search query topic of a search query. The discussion summary may be provided along with other results for the query and may describe the discussion group, identifiers such as hashtags used by the discussion group, meeting dates/times, average number(s) of participants, other discussion sessions hosted by the discussion group, future discussion sessions, and/or other information.

    DISPLAY APPARATUS AND METHOD FOR SUMMARIZING OF DOCUMENT
    10.
    发明公开
    DISPLAY APPARATUS AND METHOD FOR SUMMARIZING OF DOCUMENT 审中-公开
    显示装置和文件汇总方法

    公开(公告)号:EP3021239A3

    公开(公告)日:2016-05-25

    申请号:EP15194788.4

    申请日:2015-11-16

    IPC分类号: G06F17/30

    摘要: A display apparatus including a communicator configured to perform data communication with a content server and to receive at least one of a main document and a sub document related to the main document; a document analyzer configured to extract a keyword having a high frequency of occurrence from the main document and to determine a head keyword for generating a summarized document from the extracted keyword with reference to the received sub document; and a processor configured to determine a reliability of each sentence of the main document based on the head keyword, extract a sentence that matches a predetermined condition with reference to the determined reliability, and analyze a structural format of the extracted sentence so as to re-configure a word that forms the sentence and generate a summarized sentence, thereby generating a summarized document where information and logical cohesion have been obtained.

    摘要翻译: 一种显示设备,包括:通信器,被配置为执行与内容服务器的数据通信并接收与主文档相关的主文档和子文档中的至少一个; 文档分析器,被配置为从主文档中提取具有高出现频率的关键字,并且参考接收到的子文档从提取的关键字确定用于生成汇总文档的开头关键字; 以及处理器,其被配置为基于所述头部关键词来确定所述主文档的每个句子的可靠性,参考所确定的可靠性来提取与预定条件匹配的句子,并且分析所提取的句子的结构格式, 配置形成句子的单词并生成概括性句子,由此生成已经获得信息和逻辑内聚性的概括性文档。