Document Key Phrase Extraction Method
    1.
    发明申请
    Document Key Phrase Extraction Method 有权
    文献关键短语提取方法

    公开(公告)号:US20120047149A1

    公开(公告)日:2012-02-23

    申请号:US13264806

    申请日:2009-05-12

    IPC分类号: G06F17/30

    摘要: A computer-implemented method of extracting key phrases from a document is disclosed comprising the steps of accessing a repository comprising linked subjects, the repository comprising first and second data structures representing the relationship between said subjects using different representation criteria; pruning the first data structure by removing links between subjects based on a further relationship between said subjects in the second data structure; matching phrases in said document to subjects in the pruned first data structure; further pruning the pruned first data structure by removing unmatched subjects that are not linked to matched subjects; determining a ranking for each matched subject; and selecting key phrases using the determined subject rankings. A computer program for implementing the steps of this method when executed on a computer is also disclosed.

    摘要翻译: 公开了一种从文档中提取关键短语的计算机实现的方法,包括以下步骤:访问包含链接对象的存储库,所述存储库包括表示使用不同表示标准的所述对象之间的关系的第一和第二数据结构; 基于第二数据结构中的所述对象之间的进一步的关系,通过去除主体之间的链接来修剪第一数据结构; 将所述文档中的短语与修剪的第一数据结构中的对象匹配; 通过删除与匹配对象无关的不匹配的主题,进一步修剪已修剪的第一个数据结构; 确定每个匹配对象的排名; 并使用确定的受试者排名选择关键短语。 还公开了一种用于在计算机上执行时实现该方法的步骤的计算机程序。

    Apparatus and method for text extraction
    2.
    发明授权
    Apparatus and method for text extraction 有权
    文本提取的装置和方法

    公开(公告)号:US08924846B2

    公开(公告)日:2014-12-30

    申请号:US13258464

    申请日:2009-07-03

    IPC分类号: G06F17/22

    CPC分类号: G06F17/2241

    摘要: A method of determining main text in a mark-up document is provided, which comprises determining a length of each paragraph in the mark-up document; and determining one or more main paragraphs of the mark-up document based upon the length of the paragraphs in the mark-up document.

    摘要翻译: 提供了一种确定标记文档中的主要文本的方法,其包括确定标记文档中每个段落的长度; 并且基于标记文档中的段落的长度来确定标记文档的一个或多个主要段落。

    Document key phrase extraction method
    3.
    发明授权
    Document key phrase extraction method 有权
    文献关键词提取方法

    公开(公告)号:US08935260B2

    公开(公告)日:2015-01-13

    申请号:US13264806

    申请日:2009-05-12

    IPC分类号: G06F17/30 G06F17/27

    摘要: A computer-implemented method of extracting key phrases from a document is disclosed comprising the steps of accessing a repository comprising linked subjects, the repository comprising first and second data structures representing the relationship between said subjects using different representation criteria; pruning the first data structure by removing links between subjects based on a further relationship between said subjects in the second data structure; matching phrases in said document to subjects in the pruned first data structure; further pruning the pruned first data structure by removing unmatched subjects that are not linked to matched subjects; determining a ranking for each matched subject; and selecting key phrases using the determined subject rankings. A computer program for implementing the steps of this method when executed on a computer is also disclosed.

    摘要翻译: 公开了一种从文档中提取关键短语的计算机实现的方法,包括以下步骤:访问包含链接对象的存储库,所述存储库包括表示使用不同表示标准的所述对象之间的关系的第一和第二数据结构; 基于所述第二数据结构中的所述对象之间的进一步关系,通过去除主体之间的链接来修剪第一数据结构; 将所述文档中的短语与修剪的第一数据结构中的对象匹配; 通过删除与匹配对象无关的不匹配的主题,进一步修剪已修剪的第一个数据结构; 确定每个匹配对象的排名; 并使用确定的受试者排名选择关键短语。 还公开了一种用于在计算机上执行时实现该方法的步骤的计算机程序。

    Apparatus and Method for Text Extraction
    4.
    发明申请
    Apparatus and Method for Text Extraction 有权
    文本提取的装置和方法

    公开(公告)号:US20120066587A1

    公开(公告)日:2012-03-15

    申请号:US13258464

    申请日:2009-07-03

    IPC分类号: G06F17/21

    CPC分类号: G06F17/2241

    摘要: A method of determining main text in a mark-up document is provided, which comprises determining a length of each paragraph in the mark-up document; and determining one or more main paragraphs of the mark-up document based upon the length of the paragraphs in the mark-up document.

    摘要翻译: 提供了一种确定标记文档中的主要文本的方法,其包括确定标记文档中每个段落的长度; 并且基于标记文档中的段落的长度来确定标记文档的一个或多个主要段落。

    HANDWRITTEN CHARACTER FONT LIBRARY
    7.
    发明申请
    HANDWRITTEN CHARACTER FONT LIBRARY 审中-公开
    手写字符字体图书馆

    公开(公告)号:US20130181995A1

    公开(公告)日:2013-07-18

    申请号:US13825323

    申请日:2010-09-21

    IPC分类号: G06T11/60

    CPC分类号: G06T11/60 G06F17/214

    摘要: Embodiments of the present disclosure may include methods, systems, and machine readable and executable instructions and/or logic. An example method for creating a handwritten character font library can include receiving a set of standard characters to a computing device, and deriving a group of character components from the initial set of characters. A subset of characters is selected from the set of standard characters, the subset collectively including substantially all the group of character components. Handwritten characters corresponding to the subset of characters are received to the computing device, and handwritten character components are extracted from the hand written characters corresponding to the group of character components. A set of handwritten characters is then constructed from the received handwritten characters and/or the handwritten character components.

    摘要翻译: 本公开的实施例可以包括方法,系统和机器可读和可执行指令和/或逻辑。 用于创建手写字符字体库的示例性方法可以包括:向计算设备接收一组标准字符,以及从初始字符集中导出一组字符组件。 从一组标准字符中选择字符的子集,该子集共同地包括基本上所有这些字符组分组。 对应于字符子集的手写字符被接收到计算设备,并且从与该组字符组件相对应的手写字符中提取手写字符分量。 然后从接收到的手写字符和/或手写字符组件构建一组手写字符。