Method and system for topical segmentation, segment significance and segment function
    1.
    发明授权
    Method and system for topical segmentation, segment significance and segment function 失效
    局部分割方法和系统,分段意义和分段功能

    公开(公告)号:US06473730B1

    公开(公告)日:2002-10-29

    申请号:US09290643

    申请日:1999-04-12

    IPC分类号: G06F1727

    摘要: A “domain-general” method for topical segmentation of a document input includes the steps of: extracting one or more selected terms from a document; linking occurrences of the extracted terms based upon the proximity of similar terms; and assigning weighted scores to paragraphs of the document input corresponding to the linked occurrences. In accordance with the present invention, the values of the assigned scores depend upon the type of the selected terms, e.g., common noun, proper noun, pronominal, and the position of the linked occurrences with respect to the paragraphs, e.g., front, during, rear, etc. Upon zero-sum normalization, the assigned scores represent the boundaries of the topical segments of the document input.

    摘要翻译: 用于文档输入的局部分割的“一般”方法包括以下步骤:从文档中提取一个或多个所选项; 基于类似术语的接近度连接提取的术语的出现; 并将加权分数分配给对应于链接事件的文档输入的段落。 根据本发明,所分配的分数的值取决于所选项的类型,例如,常用名词,专有名词,代词和相对于段落的链接事件的位置,例如在前面,在 ,后方等。在零和标准化时,分配的分数表示文档输入的主题段的边界。