System and method for providing orientation into digital information
    1.
    发明公开
    System and method for providing orientation into digital information 审中-公开
    系统和方法提供定向成数字信息

    公开(公告)号:EP2048606A2

    公开(公告)日:2009-04-15

    申请号:EP08166315.5

    申请日:2008-10-10

    IPC分类号: G06N5/02 G06N1/00

    摘要: A system and method for providing orientation into digital information is provided. A plurality of evergreen indexes for subject areas are maintained. The evergreen indexes include digital information and are each organized by topics that include a topic model matched to the digital information. A user interest within the digital information is determined. The topic models for the evergreen indexes are evaluated against the user interest and those topics models that best match the user interest are identified. Access to the digital information is provided via at least one of the topic models in at least one of the evergreen indexes.

    System and method for providing robust topic identification in social indexes
    3.
    发明公开
    System and method for providing robust topic identification in social indexes 有权
    用于在社交索引中提供健壮主题识别的系统和方法

    公开(公告)号:EP2192500A3

    公开(公告)日:2010-09-29

    申请号:EP09175873.0

    申请日:2009-11-13

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30

    摘要: A computer-implemented method for providing robust topic identification in social indexes is described. Electronically-stored articles and one or more indexes are maintained. Each index includes topics that each relate to one or more of the articles. A random sampling and a selective sampling of the articles are both selected. For each topic, characteristic words included in the articles in each of the random sampling and the selective sampling are identified. Frequencies of occurrence of the characteristic words in each of the random sampling and the selective sampling are determined. A ratio of the frequencies of occurrence for the characteristic words included in the random sampling and the selective sampling is identified. Finally, for each topic, a coarse-grained topic model is built, which includes the characteristic words included in the articles relating to the topic and scores assigned to those characteristic words.

    摘要翻译: 描述了用于在社交索引中提供健壮主题识别的计算机实现的方法。 电子储存的物品和一个或多个索引被维护。 每个索引都包含每个与一篇或多篇文章相关的主题。 随机抽样和文章的选择性抽样都被选中。 对于每个主题,识别包括在每个随机抽样和选择性抽样中的文章中的特征词。 确定每个随机采样和选择性采样中的特征词的出现频率。 识别包含在随机采样和选择性采样中的特征词的出现频率的比率。 最后,为每个主题建立一个粗粒度的主题模型,其中包括与主题相关的文章中包含的特征词以及分配给这些特征词的分数。

    System and method for providing default hierarchical training for social indexing
    4.
    发明公开
    System and method for providing default hierarchical training for social indexing 有权
    系统和方法提供社会索引默认层次培训

    公开(公告)号:EP2211280A2

    公开(公告)日:2010-07-28

    申请号:EP10150704.4

    申请日:2010-01-14

    IPC分类号: G06F17/30

    CPC分类号: G06Q10/10

    摘要: A system and method for providing default hierarchical training for social indexing is provided. Articles of digital information for social indexing are maintained. A hierarchically-structured tree of topics is specified. Each topic includes a label that includes one or more words. Constraints inherent in the literal structure of the topic tree are identified. For each topic in the topic tree, a topic model that includes at least one term derived from the words in at least one of the labels is created. The topic models for the topic tree are evaluated against the constraints. Those of the topic models, which best satisfy the constraints are identified.

    摘要翻译: 提供了一种用于提供社交索引默认层次人才培养模式与方法。 对社会索引数字信息的文章得以维持。 主题的分层结构树中指定。 每个主题都包含一个标签确实包含一个或多个单词。 内在约束在主题树的字面结构鉴定。 对于在主题树的每个主题,主题模型确实包括从词语的标签至少有一个衍生的至少一个词被创建。 对于主题树主题模型评估针对限制。 这些话题模型,最好的满足约束条件的标识。

    System and method for performing discovery of digital information in a subject area
    5.
    发明公开
    System and method for performing discovery of digital information in a subject area 有权
    系统和方法的学科领域进行的数字信息发现

    公开(公告)号:EP2048605A2

    公开(公告)日:2009-04-15

    申请号:EP08166314.8

    申请日:2008-10-10

    发明人: Stefik, Mark J.

    IPC分类号: G06N5/02 G06N1/00

    摘要: A system and method for performing discovery of digital information in a subject area is provided. Each of topics in a subject area, training material for the topics, and a corpus comprising digital information are designated. Topic models for each of the topics are built. The topic models are evaluated against the training material. The digital information from the corpus is organized by the topics using the topic models into an evergreen index.

    摘要翻译: 提供了一种用于在受试者中区域中执行的数字信息发现系统和方法。 在每一个学科领域的话题,锻炼材质为主题,并语料库包括数字信息被指定。 对于每个题目的主题模型构建。 主题模型评估对训练材料。 从语料库的数字信息是通过使用主题模型成常青指数的主题组织。

    System and method for prospecting digital information
    6.
    发明授权
    System and method for prospecting digital information 有权
    探测数字信息的系统和方法

    公开(公告)号:EP2048607B1

    公开(公告)日:2018-02-21

    申请号:EP08166316.3

    申请日:2008-10-10

    发明人: Stefik, Mark J.

    IPC分类号: G06N5/02 G06F17/30

    摘要: A system and method for prospecting digital information is provided. A home evergreen index for a home subject area within a corpus of digital information is maintained and includes topic models matched to the corpus. A frontier evergreen index for a frontier subject area within the corpus topically distinct from the home subject area is identified. Quality assessments for frontier articles from the corpus identified by the topic models of the frontier evergreen index are obtained. The frontier articles with positive quality assessments are reclassified against the topic models in the home evergreen index. The frontier articles are provided in a display with home articles previously classified against the topic models in the home evergreen index.

    System and method for managing user attention by detecting hot and cold topics in social indexes
    8.
    发明公开
    System and method for managing user attention by detecting hot and cold topics in social indexes 审中-公开
    系统和方法在社会指标检测电流和非当前主题管理用户关注

    公开(公告)号:EP2211282A3

    公开(公告)日:2011-05-18

    申请号:EP10150706.9

    申请日:2010-01-14

    IPC分类号: G06F17/30

    摘要: A system and method for managing user attention by detecting hot topics in social indexes is provided. Articles of digital information and at least one social index are maintained. The social index includes topics that each relate to one or more of the articles. Topic models matched to the digital information are retrieved for each topic. The articles are classified under the topics using the topic models. Each of the topics in the social index is evaluated for hotness. A plurality of time periods projected from the present is defined. Counts of the articles appearing under each time period are evaluated. The topics exhibiting a rising curve in the count of the articles that increases with recency during the time periods are chosen. Quality of the articles within the topics chosen is analyzed, The topics including the articles having acceptable quality are presented.

    System and method for providing default hierarchical training for social indexing
    9.
    发明公开
    System and method for providing default hierarchical training for social indexing 有权
    系统和方法提供社会索引默认层次培训

    公开(公告)号:EP2211280A3

    公开(公告)日:2011-02-02

    申请号:EP10150704.4

    申请日:2010-01-14

    IPC分类号: G06F17/30

    CPC分类号: G06Q10/10

    摘要: A system and method for providing default hierarchical training for social indexing is provided. Articles of digital information for social indexing are maintained. A hierarchically-structured tree of topics is specified. Each topic includes a label that includes one or more words. Constraints inherent in the literal structure of the topic tree are identified. For each topic in the topic tree, a topic model that includes at least one term derived from the words in at least one of the labels is created. The topic models for the topic tree are evaluated against the constraints. Those of the topic models, which best satisfy the constraints are identified.

    System and method for managing user attention by detecting hot and cold topics in social indexes
    10.
    发明公开
    System and method for managing user attention by detecting hot and cold topics in social indexes 审中-公开
    系统和方法在社会指标检测电流和非当前主题管理用户关注

    公开(公告)号:EP2211282A2

    公开(公告)日:2010-07-28

    申请号:EP10150706.9

    申请日:2010-01-14

    IPC分类号: G06F17/30

    摘要: A system and method for managing user attention by detecting hot topics in social indexes is provided. Articles of digital information and at least one social index are maintained. The social index includes topics that each relate to one or more of the articles. Topic models matched to the digital information are retrieved for each topic. The articles are classified under the topics using the topic models. Each of the topics in the social index is evaluated for hotness. A plurality of time periods projected from the present is defined. Counts of the articles appearing under each time period are evaluated. The topics exhibiting a rising curve in the count of the articles that increases with recency during the time periods are chosen. Quality of the articles within the topics chosen is analyzed, The topics including the articles having acceptable quality are presented.

    摘要翻译: 提供了一种通过检测指标的社会热点问题管理用户关注的系统和方法。 数字信息和至少一个社交索引文章得以维持。 社会指标包括主题做了各自与一个或一个以上的文章。 匹配的数字信息主题模型,每个主题检索。 这些文章在使用主题模型的主题分类。 每一个在社会指数的主题是评估的辣味。 从本投影时间段的多个定义。 在每个时间段露面的文章计数进行评估。 参展的物品的数量上升曲线的主题并在时间段新近度的增加被选中。 选择的被分解的主题内的物品的质量,包括具有可接受的质量的条款中的主题被呈现。