Automated analysis and summarization of comments in survey response data
    1.
    发明授权
    Automated analysis and summarization of comments in survey response data 有权
    调查回应数据中的意见的自动分析和总结

    公开(公告)号:US08577884B2

    公开(公告)日:2013-11-05

    申请号:US12119697

    申请日:2008-05-13

    IPC分类号: G06F17/30

    CPC分类号: G06Q30/02

    摘要: Technologies are described herein for providing automated analysis and summarization of free-form comments in survey response data. A number of topic words are identified from the survey response comments, and a numeric weight is calculated for each topic word that reflects the relevance of the topic word to each comment. Each topic word is associated with one or more topics and the comments relevant to each topic is then determined based on the weights of the associated topic words in each comment. A report is generated which summarizes the topics and their relative importance in the survey response comments based upon the number of comments relevant to each.

    摘要翻译: 本文描述了技术,用于在调查响应数据中提供自由分析和摘要自由形式的评论。 从调查回应评论中确定了一些主题词,并为每个主题词计算出反映主题词与每个评论的相关性的数字权重。 每个主题词与一个或多个主题相关联,然后基于每个注释中相关联的主题词的权重来确定与每个主题相关的评论。 根据与每个相关的意见数量,总结了调查回应评论中的主题及其相对重要性的报告。

    Automated Analysis and Summarization of Comments in Survey Response Data
    2.
    发明申请
    Automated Analysis and Summarization of Comments in Survey Response Data 有权
    调查回应数据中的意见的自动分析和总结

    公开(公告)号:US20090287642A1

    公开(公告)日:2009-11-19

    申请号:US12119697

    申请日:2008-05-13

    IPC分类号: G06F7/06 G06F17/30

    CPC分类号: G06Q30/02

    摘要: Technologies are described herein for providing automated analysis and summarization of free-form comments in survey response data. A number of topic words are identified from the survey response comments, and a numeric weight is calculated for each topic word that reflects the relevance of the topic word to each comment. Each topic word is associated with one or more topics and the comments relevant to each topic is then determined based on the weights of the associated topic words in each comment. A report is generated which summarizes the topics and their relative importance in the survey response comments based upon the number of comments relevant to each.

    摘要翻译: 本文描述了技术,用于在调查响应数据中提供自由分析和摘要自由形式的评论。 从调查回应评论中确定了一些主题词,并为每个主题词计算出反映主题词与每个评论的相关性的数字权重。 每个主题词与一个或多个主题相关联,然后基于每个注释中相关联的主题词的权重来确定与每个主题相关的评论。 根据与每个相关的意见数量,总结了调查回应评论中的主题及其相对重要性的报告。

    Method and apparatus for constructing a query based upon concepts associated with one or more search terms
    5.
    发明授权
    Method and apparatus for constructing a query based upon concepts associated with one or more search terms 有权
    基于与一个或多个搜索项相关联的概念构建查询的方法和装置

    公开(公告)号:US09589053B1

    公开(公告)日:2017-03-07

    申请号:US12971799

    申请日:2010-12-17

    IPC分类号: G06F17/30 G06Q30/02

    CPC分类号: G06F17/30864 G06Q30/02

    摘要: A method and apparatus are provided to efficiently generate a fulsome query in order to increase the recall and/or precision provided by the search. A method may construct a query by receiving the one or more initial search terms and then defining a concept for each search term. In order to define a concept, the method may determine if a concept associated with a respective search term has been previously defined. In an instance in which a concept associated with a respective search term has been previously defined, the method at least initially utilizes the previously defined concept. However, in an instance in which a concept associated with a respective search term has not been previously defined, the method constructs the concept based on terms related to the respective search term. The method may then combine the concepts defined for the one or more search terms to generate the query.

    摘要翻译: 提供了一种方法和装置来有效地产生一个fulsome查询,以便增加由搜索提供的召回和/或精确度。 方法可以通过接收一个或多个初始搜索项然后为每个搜索项定义概念来构造查询。 为了定义概念,该方法可以确定与相应搜索项相关联的概念是否已经被预先定义。 在其中先前已经定义与相应搜索项相关联的概念的情况下,该方法至少最初利用先前定义的概念。 然而,在与之前未定义相关搜索项相关联的概念的情况下,该方法基于与相应搜索项相关的术语来构造概念。 然后,该方法可以组合为一个或多个搜索项定义的概念以生成查询。

    Streaming text data mining method & apparatus using multidimensional subspaces
    7.
    发明申请
    Streaming text data mining method & apparatus using multidimensional subspaces 有权
    使用多维子空间的流文本数据挖掘方法和装置

    公开(公告)号:US20070083509A1

    公开(公告)日:2007-04-12

    申请号:US11246195

    申请日:2005-10-11

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30705 G06F17/30616

    摘要: A streaming text data comparator performs real-time text data mining on streaming text data. The comparator receives a streaming text data document and generates a vector representation of the term frequencies relating to an existing document collection. The comparator then transforms the term frequency vector into a projection in a precomputed multidimensional subspace that represents the original document collection. The comparator further calculates a relationship value representing the similarities or differences between the vector representation and the subspace, and compares the relationship value to a predetermined threshold to determine whether the streaming text data document is related to the original document collection. If the streaming text data document is related, the streaming text data comparator intercalates the new document into the document collection. If the new document is not related, the comparator may store or delete the unrelated document.

    摘要翻译: 流文本数据比较器在流文本数据上执行实时文本数据挖掘。 比较器接收流文本数据文档并生成与现有文档集合相关的术语频率的向量表示。 比较器然后将术语频率矢量转换成表示原始文档集合的预计算多维子空间中的投影。 比较器还计算表示向量表示和子空间之间的相似性或差异的关系值,并将关系值与预定阈值进行比较,以确定流文本数据文档是否与原始文档集合相关。 如果流文本数据文档相关,则流文本数据比较器将新文档插入到文档集合中。 如果新文档不相关,则比较器可以存储或删除不相关的文档。

    Text differentiation methods, systems, and computer program products for content analysis
    8.
    发明申请
    Text differentiation methods, systems, and computer program products for content analysis 有权
    文本分类方法,系统和计算机程序产品进行内容分析

    公开(公告)号:US20070022072A1

    公开(公告)日:2007-01-25

    申请号:US11173600

    申请日:2005-07-01

    IPC分类号: G06N5/00 G06F17/00

    CPC分类号: G06F17/2211 G06F17/30719

    摘要: Provided are improved methods, apparatus, and computer program products for text differentiation which involves identifying differences between documents with similar content, not merely similar terms, and generating results. Text differentiation provides the ability to find non-similar, or different, content hidden within documents with similar overall content, but not exactly the same content. Text differentiation may be used to quickly identify key differences between similar documents.

    摘要翻译: 提供了用于文本区分的改进的方法,装置和计算机程序产品,其涉及识别具有相似内容的文档之间的差异,而不仅仅是类似的术语,并且产生结果。 文本区分提供了找到隐藏在具有类似总体内容但不完全相同的内容的文档内的不相似或不同的内容的能力。 文本差异可能用于快速识别类似文档之间的关键差异。

    Large-scale visualization of temporal data
    9.
    发明申请
    Large-scale visualization of temporal data 审中-公开
    时间数据的大规模可视化

    公开(公告)号:US20050177540A1

    公开(公告)日:2005-08-11

    申请号:US10769066

    申请日:2004-01-30

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F16/2477 G06F16/248

    摘要: Methods, computer-readable media, and systems for representing data associable with intervals are provided. A frame is associated with each of a number of intervals in a period. The frame is configured to display a maximum number of points. A first number of points representative of a first data quantity associable with each interval is determined, wherein a proportion of the first number of points to the maximum number of points represents a relative magnitude of the first data quantity. The first number of points is contiguously displayed in the frame for each of the intervals. Additional numbers of points suitably also are displayed to represent a relative magnitude of additional data quantities associable with each interval.

    摘要翻译: 提供了用于表示与间隔相关联的数据的方法,计算机可读介质和系统。 帧在一段时间内与多个间隔中的每一个相关联。 该帧被配置为显示最大点数。 确定表示与每个间隔相关联的第一数据量的第一数量的点,其中第一数量点与最大点数的比例表示第一数据量的相对大小。 对于每个间隔,第一个数量点连续地显示在帧中。 适当地还显示附加数量的点以表示与每个间隔相关联的附加数据量的相对大小。

    Methods and framework for constraint-based activity mining (CMAP)
    10.
    发明授权
    Methods and framework for constraint-based activity mining (CMAP) 有权
    基于约束的活动挖掘(CMAP)的方法和框架

    公开(公告)号:US08046322B2

    公开(公告)日:2011-10-25

    申请号:US11835225

    申请日:2007-08-07

    IPC分类号: G06F17/30 G06F17/00

    CPC分类号: G06F17/30539

    摘要: A method of mining data to discover activity patterns within the data is described. The method includes receiving data to be mined from at least one data source, determining which of a number of specified interests and constraints are associated with the mining process, selecting corresponding mining agents that combine search algorithms with propagators from the specified constraints, and finding any activity patterns that meet the specified interests and constraints.

    摘要翻译: 描述了挖掘数据以发现数据内的活动模式的方法。 该方法包括从至少一个数据源接收要挖掘的数据,确定多个指定兴趣和约束中的哪一个与挖掘过程相关联,选择将搜索算法与来自指定约束的传播者相结合的相应挖掘代理,以及找到任何 满足特定兴趣和约束的活动模式。