System and method for document collection, grouping and summarization
    3.
    发明授权
    System and method for document collection, grouping and summarization 有权
    文件收集,分组和总结的系统和方法

    公开(公告)号:US08176418B2

    公开(公告)日:2012-05-08

    申请号:US11071968

    申请日:2005-03-04

    IPC分类号: G06F17/00

    CPC分类号: G06Q10/10

    摘要: A system for generating a summary of a plurality of documents and presenting the summary information to a user is provided which includes a computer readable document collection containing a plurality of related documents stored in electronic form. Documents can be pre-processed to group documents into document clusters. The document clusters can also be assigned to predetermined document categories for presentation to a user. A number of multiple document summarization engines are provided which generate summaries for specific classes of multiple documents clusters. A summarizer router is employed to determining a relationship of the documents in a cluster and select one of the document summarization engines for use in generating a summary of the cluster. A single event engine is provided to generate summaries of documents which are closely related temporally and to a specific event. A dissimilarity engine for multiple document summary generation is provided which generates summaries of document clusters having documents with varying degrees of relatedness. A user interface is provided to display categories, cluster titles, summaries, related images.

    摘要翻译: 提供了一种用于生成多个文档的摘要并向用户呈现摘要信息的系统,其包括包含以电子形式存储的多个相关文档的计算机可读文档集合。 可以对文档进行预处理,将文档分组成文档集群。 文档集群也可以被分配给预定的文档类别以呈现给用户。 提供了多个多个文档摘要引擎,为多个文档集群的特定类生成摘要。 采用汇总器路由器来确定集群中的文档的关系,并选择文档摘要引擎之一用于生成集群的摘要。 提供单个事件引擎来生成与时间上紧密相关的特定事件的文档的摘要。 提供了用于多文档摘要生成的不相似引擎,其产生具有不同程度相关性的文档的文档集合的摘要。 提供用户界面来显示类别,集群标题,摘要,相关图像。

    System and method for document collection, grouping and summarization
    4.
    发明申请
    System and method for document collection, grouping and summarization 有权
    文件收集,分组和总结的系统和方法

    公开(公告)号:US20050203970A1

    公开(公告)日:2005-09-15

    申请号:US11071968

    申请日:2005-03-04

    IPC分类号: G06F7/00 G06F15/00 G06Q10/00

    CPC分类号: G06Q10/10

    摘要: A system for generating a summary of a plurality of documents and presenting the summary information to a user is provided which includes a computer readable document collection containing a plurality of related documents stored in electronic form. Documents can be pre-processed to group documents into document clusters. The document clusters can also be assigned to predetermined document categories for presentation to a user. A number of multiple document summarization engines are provided which generate summaries for specific classes of multiple documents clusters. A summarizer router is employed to determining a relationship of the documents in a cluster and select one of the document summarization engines for use in generating a summary of the cluster. A single event engine is provided to generate summaries of documents which are closely related temporally and to a specific event. A dissimilarity engine for multiple document summary generation is provided which generates summaries of document clusters having documents with varying degrees of relatedness. A user interface is provided to display categories, cluster titles, summaries, related images.

    摘要翻译: 提供了一种用于生成多个文档的摘要并向用户呈现摘要信息的系统,其包括包含以电子形式存储的多个相关文档的计算机可读文档集合。 可以对文档进行预处理,将文档分组成文档集群。 文档集群也可以被分配给预定的文档类别以呈现给用户。 提供了多个多个文档摘要引擎,为多个文档集群的特定类生成摘要。 采用汇总器路由器来确定集群中的文档的关系,并选择文档摘要引擎之一用于生成集群的摘要。 提供单个事件引擎来生成与时间上紧密相关的特定事件的文档的摘要。 提供了用于多文档摘要生成的不相似引擎,其产生具有不同程度相关性的文档的文档集合的摘要。 提供用户界面来显示类别,集群标题,摘要,相关图像。

    Method and apparatus for classification of relative position of one or more text messages in an email thread
    5.
    发明申请
    Method and apparatus for classification of relative position of one or more text messages in an email thread 审中-公开
    用于在电子邮件线程中分类一个或多个文本消息的相对位置的方法和装置

    公开(公告)号:US20060031304A1

    公开(公告)日:2006-02-09

    申请号:US10833262

    申请日:2004-04-27

    IPC分类号: G06F15/16

    CPC分类号: G06Q10/107

    摘要: Methods and apparatus are disclosed for classifying the relative position of one or more text messages (including transcribed audio messages) in a related thread of text messages. One or more classifiers are applied to the text messages; and a classification of the text messages is obtained that indicates the relative position of the text messages in the thread. For example, a thread can include a root message, a leaf message and one or more inner messages, and the classification can indicate whether each text message is a root message, a leaf message or an inner message. The classifiers are trained on a set of training messages that have been previously classified to indicate a relative position of each training message in a corresponding thread. The classifiers employ one or more features that help to distinguish between root and non-root messages.

    摘要翻译: 公开了用于将一个或多个文本消息(包括转录音频消息)的相对位置分类在文本消息的相关线程中的方法和装置。 一个或多个分类器应用于文本消息; 并且获得指示文本消息在线程中的相对位置的文本消息的分类。 例如,线程可以包括根消息,叶消息和一个或多个内部消息,并且分类可以指示每个文本消息是根消息,叶消息还是内消息。 对一组训练消息进行分类训练,这些训练消息先前被分类以指示每个训练消息在相应线程中的相对位置。 分类器采用一个或多个有助于区分根和根消息的特征。

    Method and apparatus for summarizing one or more text messages using indicative summaries
    6.
    发明申请
    Method and apparatus for summarizing one or more text messages using indicative summaries 有权
    用于使用指示性摘要汇总一个或多个文本消息的方法和装置

    公开(公告)号:US20050262214A1

    公开(公告)日:2005-11-24

    申请号:US10833261

    申请日:2004-04-27

    摘要: A method and apparatus are provided for summarizing a text message, such as an email message or a transcribed audio message. A portion of each text message, such as a sentence, is extracted as an indicative summary of the text message based on a degree of overlap of words in the sentence with a set of words, such as words in the message subject or words in a related root message. The extracted portion is based on a score for each portion of the text message, such as a sentence. An interface is also provided for presenting the indicative summaries of a set of related text messages to a user.

    摘要翻译: 提供了一种用于汇总文本消息(例如电子邮件消息或转录音频消息)的方法和装置。 基于句子中的单词与一组单词的重叠程度,例如消​​息对象中的单词或者单词中的单词,提取每个文本消息(例如句子)的一部分作为文本消息的指示性摘要 相关的根消息。 提取的部分基于文本消息的每个部分的分数,诸如句子。 还提供了一个接口,用于向用户呈现一组相关文本消息的指示性摘要。