Document analysis device, document analysis method, and computer readable recording medium
    1.
    发明授权
    Document analysis device, document analysis method, and computer readable recording medium 有权
    文件分析装置,文件分析方法和计算机可读记录介质

    公开(公告)号:US09104761B2

    公开(公告)日:2015-08-11

    申请号:US13511918

    申请日:2010-11-08

    IPC分类号: G06F17/30

    摘要: A document analysis device (1) comprises a common assessment information selection unit (90) and an event impact analysis unit (100). The common assessment information selection unit (90) identifies information that matches second assessment information that appears in event-related documents which include descriptions concerning a designated specific event, from among first assessment information that appears in documents for analysis which include descriptions relating to items for analysis, and classifies the information thus identified as common assessment information. The event impact analysis unit (100) counts the number of times the common assessment information appears in the documents that are generated prior to the event occurring, and the number of times the common assessment information appears in the documents for analysis that are generated subsequent to the event occurring, and derives an index that denotes the impact of the specific event upon the documents for analysis, on the basis of the results of the counts thereupon.

    摘要翻译: 文件分析装置(1)包括公共评估信息选择单元(90)和事件影响分析单元(100)。 公共评估信息选择单元(90)从出现在用于分析的文档中的第一评估信息之中,识别与包括关于指定的特定事件的描述的事件相关文档中出现的第二评估信息相匹配的信息,其包括与 分析,并将所确定的信息分类为常见评估信息。 事件影响分析单元(100)计算在事件发生之前生成的文档中出现共同评估信息的次数以及在分析之后生成的用于分析的文档中出现的共同评估信息的次数 事件发生,并根据其上的计数结果,得出一个表示特定事件对分析文件的影响的指数。

    DOCUMENT ANALYSIS DEVICE, DOCUMENT ANALYSIS METHOD, AND COMPUTER READABLE RECORDING MEDIUM
    2.
    发明申请
    DOCUMENT ANALYSIS DEVICE, DOCUMENT ANALYSIS METHOD, AND COMPUTER READABLE RECORDING MEDIUM 有权
    文件分析装置,文件分析方法和计算机可读记录介质

    公开(公告)号:US20120278327A1

    公开(公告)日:2012-11-01

    申请号:US13511918

    申请日:2010-11-08

    IPC分类号: G06F17/30

    摘要: A document analysis device (1) comprises a common assessment information selection unit (90) and an event impact analysis unit (100). The common assessment information selection unit (90) identifies information that matches second assessment information that appears in event-related documents which include descriptions concerning a designated specific event, from among first assessment information that appears in documents for analysis which include descriptions relating to items for analysis, and classifies the information thus identified as common assessment information. The event impact analysis unit (100) counts the number of times the common assessment information appears in the documents that are generated prior to the event occurring, and the number of times the common assessment information appears in the documents for analysis that are generated subsequent to the event occurring, and derives an index that denotes the impact of the specific event upon the documents for analysis, on the basis of the results of the counts thereupon.

    摘要翻译: 文件分析装置(1)包括公共评估信息选择单元(90)和事件影响分析单元(100)。 公共评估信息选择单元(90)从出现在用于分析的文档中的第一评估信息之中,识别与包括关于指定的特定事件的描述的事件相关文档中出现的第二评估信息相匹配的信息,其包括与 分析,并将所确定的信息分类为常见评估信息。 事件影响分析单元(100)计算在事件发生之前生成的文档中出现共同评估信息的次数以及在分析之后生成的用于分析的文档中出现的共同评估信息的次数 事件发生,并根据其上的计数结果,得出一个表示特定事件对分析文件的影响的指数。

    INFORMATION ANALYSIS APPARATUS, INFORMATION ANALYSIS METHOD, AND PROGRAM
    3.
    发明申请
    INFORMATION ANALYSIS APPARATUS, INFORMATION ANALYSIS METHOD, AND PROGRAM 审中-公开
    信息分析装置,信息分析方法和程序

    公开(公告)号:US20110153601A1

    公开(公告)日:2011-06-23

    申请号:US13060572

    申请日:2009-09-18

    IPC分类号: G06F17/30

    CPC分类号: G06F16/3347

    摘要: An information analysis apparatus 1 that executes information analysis on a document set including documents to which time information is attached, the apparatus includes: a corresponding section selection unit 30 that mutually compares a plurality of time-series data generated and selects two or more sections that change corresponding to each of two or more sections of another time-series data from each time-series data; a feature extraction unit 40 that extracts features from the documents belonging to the selected two or more sections; a comparison unit 50 that acquires, from extracted features, an inter-feature distance of the selected one section and another section, and mutually compares the inter-feature distances of each of the time-series data; and a correlation degree calculation unit 70 that calculates a degree of correlation between the document sets based on the comparison result.

    摘要翻译: 一种信息分析装置1,对包含附加了时间信息的文件的文件集执行信息分析,该装置包括:相应部分选择单元30,其相互比较生成的多个时间序列数据,并选择两个或多个部分, 对应于来自每个时间序列数据的另一时间序列数据的两个或更多个部分中的每一个的变化; 特征提取单元40,从属于所选择的两个或多个部分的文档中提取特征; 比较单元50,从提取的特征获取所选择的一个部分和另一个部分的特征间距离,并且相互比较每个时间序列数据的特征间距离; 以及相关度计算单元70,其基于比较结果来计算文档集合之间的相关度。

    Reputation analysis system and reputation analysis method
    4.
    发明授权
    Reputation analysis system and reputation analysis method 有权
    声誉分析系统和信誉分析方法

    公开(公告)号:US09245023B2

    公开(公告)日:2016-01-26

    申请号:US13511099

    申请日:2010-11-15

    IPC分类号: G06F17/30 G06Q30/02 G06Q10/10

    摘要: Described are a reputation analysis device, reputation analysis method, and reputation analysis-use program capable of suitably analyzing temporal changes in reputation for an object indicated by a keyword. The disclosed reputation analysis device is provided with a voluntary activity description extraction means for extracting descriptions representing voluntary activity related to an object indicated by a keyword that has been input from within a plurality of documents; and a reputation chronological data estimation means for counting the number of occurrences of voluntary activity at each time point wherein the voluntary activity expressed by a description representing the voluntary activity related to the object has been performed, and estimating reputation chronological data for chronologically representing evaluations for the object by the agents of the voluntary activity.

    摘要翻译: 描述了能够适当地分析由关键字指示的对象的信誉的时间变化的信誉分析装置,信誉分析方法和信誉分析用途程序。 所公开的信誉分析装置设置有自愿活动描述提取装置,用于提取表示与由多个文档中输入的关键字指示的对象相关的自愿活动的描述; 以及声誉时间数据估计装置,用于计算在每个时间点的自愿活动的发生次数,其中由表示与该对象相关的自愿活动的描述表示的自愿活动已经被执行,以及根据时间顺序表示评估的信誉时间数据, 对象由代理人进行志愿活动。

    REPUTATION ANALYSIS SYSTEM AND REPUTATION ANALYSIS METHOD
    5.
    发明申请
    REPUTATION ANALYSIS SYSTEM AND REPUTATION ANALYSIS METHOD 有权
    报告分析系统和报告分析方法

    公开(公告)号:US20120239665A1

    公开(公告)日:2012-09-20

    申请号:US13511099

    申请日:2010-11-15

    IPC分类号: G06F17/30

    摘要: Described are a reputation analysis device, reputation analysis method, and reputation analysis-use program capable of suitably analyzing temporal changes in reputation for an object indicated by a keyword. The disclosed reputation analysis device is provided with a voluntary activity description extraction means for extracting descriptions representing voluntary activity related to an object indicated by a keyword that has been input from within a plurality of documents; and a reputation chronological data estimation means for counting the number of occurrences of voluntary activity at each time point wherein the voluntary activity expressed by a description representing the voluntary activity related to the object has been performed, and estimating reputation chronological data for chronologically representing evaluations for the object by the agents of the voluntary activity.

    摘要翻译: 描述了能够适当地分析由关键字指示的对象的信誉的时间变化的信誉分析装置,信誉分析方法和信誉分析用途程序。 所公开的信誉分析装置设置有自愿活动描述提取装置,用于提取表示与由多个文档中输入的关键字指示的对象相关的自愿活动的描述; 以及声誉时间数据估计装置,用于计算在每个时间点的自愿活动的发生次数,其中由表示与该对象相关的自愿活动的描述表示的自愿活动已经被执行,并且根据时间顺序表示评估的信誉时间数据, 对象由代理人进行志愿活动。

    TIME-SERIES DOCUMENT SUMMARIZATION DEVICE, TIME-SERIES DOCUMENT SUMMARIZATION METHOD AND COMPUTER-READABLE RECORDING MEDIUM
    6.
    发明申请
    TIME-SERIES DOCUMENT SUMMARIZATION DEVICE, TIME-SERIES DOCUMENT SUMMARIZATION METHOD AND COMPUTER-READABLE RECORDING MEDIUM 审中-公开
    时间序列文件概述装置,时间序列文件概要方法和计算机可读记录介质

    公开(公告)号:US20130311471A1

    公开(公告)日:2013-11-21

    申请号:US13982523

    申请日:2011-12-09

    IPC分类号: G06F17/30

    CPC分类号: G06F16/35 G06F16/345

    摘要: A time-series document summarization (201) device outputs a summary sentence of a document-of-interest collection that is a document collection to be an object. A time-series document summarization (201) comprises: a background topic word extraction part (20) configured to acquire a set of the document-of-interest collection and a document-of-interest topic word that is a feature word of the document-of-interest collection, and a reference-use document collection that is a document collection different from the document-of-interest collection, and extract a background topic word representing a topic to be a background of a topic described in the document-of-interest collection from the reference-use document collection; and a representative character string extraction part (30) configured to extract a representative character string including the document-of-interest topic word and the background topic word as a summary sentence of the document-of-interest collection from among character strings included in the document-of-interest collection.

    摘要翻译: 时间序列文件摘要(201)设备输出作为对象的文档集合的兴趣文档集合的摘要句子。 时间序列文件摘要(201)包括:背景主题词提取部分(20),其被配置为获取一组所述文档感兴趣集合以及作为所述文档的特征词的感兴趣文档主题词 以及与利益文献收集不同的文档集合的参考使用文档集合,并且将表示主题的背景主题词提取为文档中描述的主题的背景 从参考使用文件收集的兴趣收集; 以及代表性字符串提取部(30),被配置为从包含在所述文档中的字符串中提取包含所述文档感兴趣主题词和所述背景主题词的代表性字符串作为所述文档感兴趣集合的汇总句。 利益文件收集。

    Information analyzing device, information analyzing method, information analyzing program, and search system
    7.
    发明授权
    Information analyzing device, information analyzing method, information analyzing program, and search system 有权
    信息分析装置,信息分析方法,信息分析程序和搜索系统

    公开(公告)号:US08606810B2

    公开(公告)日:2013-12-10

    申请号:US12864976

    申请日:2009-01-30

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/2785 G06F17/30684

    摘要: Provided are a related expression generation section that accepts an evaluation object expression, an linguistic expression to be evaluated, as input and generates a linguistic expression related to the evaluation object expression as a related expression; and a credibility calculation section that acquires the evaluation object expression and the related expression from a plurality of electronic documents along with time information and calculates credibility concerning the meaning of the evaluation object expression at a specific point in time by comparing the number of times that the acquired evaluation object expression appears and the number of times that the acquired related expression appears at the same time point.

    摘要翻译: 提供一种相关表达生成部,其接受评价对象表达式,待评价的语言表达式作为输入,并生成与评价对象表达相关的语言表达作为相关表达式; 以及可信度计算部,其与多个电子文档一起从时间信息中获取评价对象表达和相关表达,并且通过比较所述评估对象表达的次数,计算与特定时间点相关的评价对象表达的含义的可信度 获得的评估对象表达出现,并且所获取的相关表达在同一时间点出现的次数。

    DOCUMENT ANALYSIS APPARATUS, DOCUMENT ANALYSIS METHOD, AND COMPUTER-READABLE RECORDING MEDIUM
    8.
    发明申请
    DOCUMENT ANALYSIS APPARATUS, DOCUMENT ANALYSIS METHOD, AND COMPUTER-READABLE RECORDING MEDIUM 有权
    文件分析装置,文件分析方法和计算机可读记录介质

    公开(公告)号:US20120304055A1

    公开(公告)日:2012-11-29

    申请号:US13576669

    申请日:2011-01-25

    IPC分类号: G06F17/24

    CPC分类号: G06F17/30699

    摘要: A document analysis apparatus comprises: a feature expression acquisition unit acquiring a feature expression appearing during an attention period in an analysis object document collection; a document collection acquisition unit acquiring a feature expression containing document (FECD) collection in which a feature expression appears, from an analysis population including an analysis object document collection; a context determination unit specifying an analysis/FECD corresponding to an analysis object document among a FECD collection for every feature expression, and specifies a context in which the feature expression appeared in multiple analysis/FECDs; a context comparison determination unit specifying a non analysis/FECD not corresponding to an analysis object document among a FECD collection, and within that, compares a context in which the feature expression has appeared and a context specified previously; and a feature degree setting unit performing giving or the like of a feature degree to a feature expression from the comparison.

    摘要翻译: 文件分析装置包括:特征表达获取单元,获取在分析对象文档收集期间在注意期间出现的特征表达; 文档收集获取单元从包括分析对象文档集合的分析群体获取包含其中出现特征表达的文档(FECD)集合的特征表达式; 指定对于每个特征表达式的FECD集合中的与分析对象文档对应的分析/ FECD的上下文确定单元,并且指定在多个分析/ FECD中出现特征表达的上下文; 指定在FECD集合中与分析对象文档不对应的非分析/ FECD的上下文比较确定单元,并且在其中比较特征表达式出现的上下文和先前指定的上下文; 以及特征度设定单元,对来自所述比较的特征表达进行特征度的赋予等。

    Correlation of linguistic expressions in electronic documents with time information
    9.
    发明授权
    Correlation of linguistic expressions in electronic documents with time information 有权
    电子文件中的语言表达与时间信息的相关性

    公开(公告)号:US08612202B2

    公开(公告)日:2013-12-17

    申请号:US13057842

    申请日:2009-09-04

    IPC分类号: G06F17/20 G06F17/30

    摘要: An information analysis device includes a correlation value calculation unit which specifies the number of appearances of links between one linguistic expression and other linguistic expression and an appearance time of each link based on link information. The correlation value calculation unit calculates a correlation value between the one linguistic expression and the other linguistic expression according to a degree that the link continuously appears by using the specified number of appearances of the link and the appearance time of each link.

    摘要翻译: 信息分析装置包括相关值计算单元,其基于链接信息指定一个语言表达式和其他语言表达之间的链接的出现次数和每个链接的出现时间。 相关值计算单元根据链接连续出现的程度,通过使用指定的链接次数和每个链接的出现时间,来计算一个语言表达式和另一个语言表达式之间的相关值。

    INFORMATION ANALYSIS DEVICE, SEARCH SYSTEM, INFORMATION ANALYSIS METHOD, AND INFORMATION ANALYSIS PROGRAM
    10.
    发明申请
    INFORMATION ANALYSIS DEVICE, SEARCH SYSTEM, INFORMATION ANALYSIS METHOD, AND INFORMATION ANALYSIS PROGRAM 审中-公开
    信息分析设备,搜索系统,信息分析方法和信息分析程序

    公开(公告)号:US20100318526A1

    公开(公告)日:2010-12-16

    申请号:US12864780

    申请日:2009-01-30

    IPC分类号: G06F17/30

    CPC分类号: G06F17/2785

    摘要: Time-series data corresponding to an input linguistic expression to be analyzed is acquired, a relevant linguistic expression candidate which is highly relevant to the input linguistic expression is generated, time-series data corresponding to the relevant linguistic expression candidate generated is acquired, temporal correlation between the time-series data corresponding to the input linguistic expression and the time-series data corresponding to the relevant linguistic expression candidate is analyzed and a relevance level between the input linguistic expression and the relevant linguistic expression candidate generated is calculated using an analysis result of the time-series data.

    摘要翻译: 获取与要分析的输入语言表达式对应的时间序列数据,生成与输入语言表达高度相关的相关语言表达候选,获得与生成的相关语言表达候选对应的时间序列数据,时间相关 分析对应于输入语言表达式的时间序列数据与对应于相关语言表达候选的时间序列数据之间的相关性,并且使用分析结果计算输入语言表达与产生的相关语言表达候选者之间的相关性水平 时间序列数据。