INFORMATION ANALYSIS DEVICE, INFORMATION ANALYSIS METHOD, AND PROGRAM
    41.
    发明申请
    INFORMATION ANALYSIS DEVICE, INFORMATION ANALYSIS METHOD, AND PROGRAM 有权
    信息分析设备,信息分析方法和程序

    公开(公告)号:US20110137641A1

    公开(公告)日:2011-06-09

    申请号:US13057842

    申请日:2009-09-04

    IPC分类号: G06F17/27

    摘要: An information analysis device (1) uses a plurality of linguistic expressions as an analysis target, includes a link information generating unit (3) and a correlation value calculation unit (4). The link information generating unit (3) extracts time information included in each of a plurality of electronic documents including at least any one of the plurality of linguistic expressions and a relationship between the electronic documents in the plurality of electronic documents from the plurality of electronic documents, detects a link between one linguistic expression and another linguistic expression in the plurality of linguistic expressions and an appearance time of the link based on the extracted time information and the relationship between the electronic documents, and generates link information specifying the extracted link and the appearance time of the link. The correlation value calculation unit (4) specifies the number of appearances of links between the one linguistic expression and the other linguistic expression and an appearance time of each link based on the link information, and calculates a correlation value between the one linguistic expression and the other linguistic expression according to a degree that the link continuously appears by using the specified number of appearances of the link and the appearance time of each link.

    摘要翻译: 信息分析装置(1)使用多个语言表达作为分析对象,包括链接信息生成单元(3)和相关值计算单元(4)。 链接信息生成单元(3)从多个电子文档中提取包括多个语言表达中的至少一个的多个电子文档中的每一个以及多个电子文档中的电子文档之间的关系的时间信息 基于所提取的时间信息和电子文档之间的关系,检测多个语言表达中的一个语言表达与另一个语言表达之间的链接以及链接的出现时间,并且生成指定所提取的链接和外观的链接信息 链接的时间。 相关值计算单元(4)基于链接信息指定一个语言表达式和另一个语言表达式之间的链接的出现次数和每个链接的出现时间,并且计算一个语言表达式和 其他语言表达根据链接连续出现的程度,通过使用指定的链接次数和每个链接的出现时间。

    NEW CASE GENERATION DEVICE, NEW CASE GENERATION METHOD, AND NEW CASE GENERATION PROGRAM
    42.
    发明申请
    NEW CASE GENERATION DEVICE, NEW CASE GENERATION METHOD, AND NEW CASE GENERATION PROGRAM 审中-公开
    新案例生成设备,新案例生成方法和新案例生成程序

    公开(公告)号:US20110106849A1

    公开(公告)日:2011-05-05

    申请号:US12922396

    申请日:2009-03-09

    IPC分类号: G06F17/30

    CPC分类号: G06F16/30

    摘要: A new case whose type is the same as that of a case about information desired to be extracted can be generated with high accuracy.A new case generation device according to the present invention includes: new case generating means that receives a case about information desired to be extracted and a case context being text data that includes data on the case and parts present near the case, and generates, on the basis of the received case and the received case context, new cases and new case contexts with the use of document data, the type of the new cases being the same as that of the received case, and the new case contexts being text data that includes data on the new cases and parts present near the new cases and being different from the case context; similarity calculating means that calculates similarities between the case context and the new case contexts; and new case narrowing down means that narrows down, on the basis of the similarities calculated by the similarity calculating means, the new cases generated by the new case generating means and outputs a new case selected by the narrowing-down operation.

    摘要翻译: 可以高精度地生成与需要提取的信息的情况相同的新情况。 根据本发明的新的案例生成设备包括:新的案例生成单元,其接收关于期望提取的信息的案例,以及案例上下文是包含关于该案件附近的案例和部分的数据的文本数据,并且生成 收到案件的基础和收到的案件情况,新案件和新案件上下文使用文件数据,新案件的类型与接收案件的类型相同,新案件上下文为文本数据, 包括新案件和新案件附近出现的与案件背景不同的数据; 相似度计算装置,其计算情况上下文与新情况上下文之间的相似度; 并且新的情况缩小意味着基于由相似度计算装置计算出的相似度来缩小由新情况生成装置生成的新情况并输出通过缩小操作选择的新情况。

    SIMILARITY CALCULATION DEVICE AND INFORMATION SEARCH DEVICE
    43.
    发明申请
    SIMILARITY CALCULATION DEVICE AND INFORMATION SEARCH DEVICE 有权
    相似计算设备和信息搜索设备

    公开(公告)号:US20090319513A1

    公开(公告)日:2009-12-24

    申请号:US12374035

    申请日:2007-08-02

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30038 G06F17/30781

    摘要: [Problems] To accurately calculate similarity between media data and a query even if the media data or its meta data has an error.[Means for Solving the Problems] A similarity calculation device includes: a single score calculation device used when calculating similarity between first media data and a query, which calculates a single score that shows similarity between second media data different from the first media data and the query; an inter-media similarity calculation device which calculates inter-media similarity that shows the similarity between the second media data and the first media data; and a query similarity calculation device which obtains similarity between the first media data and the query by using the inter-media similarity of the second media data and the single score.

    摘要翻译: [问题]即使媒体数据或其元数据有错误,也可准确地计算媒体数据和查询之间的相似度。 解决问题的手段相似度计算装置包括:当计算第一媒体数据和查询之间的相似度时使用的单分计算装置,其计算显示与第一媒体数据不同的第二媒体数据与第一媒体数据之间的相似度的单个分数, 查询; 媒体间相似度计算装置,其计算显示第二媒体数据和第一媒体数据之间的相似度的媒体间相似度; 以及查询相似度计算装置,其通过使用第二媒体数据和单个分数的媒体间相似度来获得第一媒体数据和查询之间的相似度。

    TRANSLATION SUPPORTING APPARATUS AND METHOD AND COMPUTER-READABLE RECORDING MEDIUM, WHEREIN A TRANSLATION EXAMPLE USEFUL FOR THE TRANSLATION TASK IS SEARCHED OUT FROM WITHIN A TRANSLATION EXAMPLE DATABASE
    44.
    发明授权
    TRANSLATION SUPPORTING APPARATUS AND METHOD AND COMPUTER-READABLE RECORDING MEDIUM, WHEREIN A TRANSLATION EXAMPLE USEFUL FOR THE TRANSLATION TASK IS SEARCHED OUT FROM WITHIN A TRANSLATION EXAMPLE DATABASE 失效
    翻译支持的装置和方法以及可计算机可读记录介质,用于翻译任务的翻译示例是从翻译示例数据库中搜索的

    公开(公告)号:US06523000B1

    公开(公告)日:2003-02-18

    申请号:US09472036

    申请日:1999-12-27

    IPC分类号: G06F1728

    CPC分类号: G06F17/2827

    摘要: A translation supporting apparatus which searches out a translation example useful for a translation task from within a translation example database is disclosed. The translation example database stores character strings of a first language and translation results of a second language corresponding to the character strings in a unit of a document. A retrieval request inputting apparatus inputs a translation target sentence. A similarity retrieval apparatus determines, for each translation example, a similarity to the translation target sentence, a similarity to a translation example context which is another translation example having such a predetermined relationship that it is included in the same document and is present within one sentence before or after the translation example, a similarity to a retrieval request context which is another translation target character string having such a predetermined relationship that it is included in the same document as the translation target character string and is present within the range of one sentence before or after the translation target character string, and a similarity between the translation example context and the retrieval request context, and integrates the four similarities. A similar example outputting apparatus refers to the integrated similarities and outputs those translation examples similar to the translation target character string.

    摘要翻译: 公开了一种翻译支持装置,其从翻译示例数据库内搜索用于翻译任务的翻译示例。 翻译示例数据库以文档为单位存储第一语言的字符串和对应于字符串的第二语言的翻译结果。 检索请求输入装置输入翻译目标句子。 对于每个翻译示例,相似性检索装置确定与翻译目标句子的相似度,与具有这样的预定关系的另一个翻译示例的翻译示例上下文的相似性,其被包括在同一文档中并且存在于一个句子中 在翻译示例之前或之后,与检索请求上下文的相似性,该检索请求上下文是具有这样的预定关系的另一个翻译目标字符串,它被包含在与翻译目标字符串相同的文档中,并且存在于一个句子之前的范围内 或者在翻译目标字符串之后,以及翻译示例上下文和检索请求上下文之间的相似性,并且整合了四个相似之处。 类似的示例输出装置参考集成的相似性并输出类似于翻译目标字符串的翻译示例。

    Similarity search apparatus for searching unit string based on similarity
    45.
    发明授权
    Similarity search apparatus for searching unit string based on similarity 失效
    基于相似度搜索单元串的相似性搜索装置

    公开(公告)号:US6009424A

    公开(公告)日:1999-12-28

    申请号:US922452

    申请日:1997-09-03

    IPC分类号: G06F17/30

    摘要: Provided is a similarity search apparatus for searching data at a higher speed than that of the prior art without limiting the types of letter of a search key. A unit position correspondence memory stores therein a table that expresses the ordinal number among units at which each unit in a search key inputted by means of a keyboard has appeared within the search key. A search section refers to the table stored in the unit position correspondence memory and operates every time units are read out one by one from a database memory including a plurality of units to generate a plurality of status parameters each of which includes a similarity, a position of coincidence and a skip number, which express with what number of units from the top of the search key the units read out from the database have coincided at what degree of similarity, and express how many units in the database have been skipped over subsequently. Through the above process, the search section updates each status parameter stored in a status parameter memory and operates upon detecting a unit string coincident at a similarity equal to or lower than an inputted similarity, to output the detected unit string as a unit string of a similarity.

    摘要翻译: 提供了一种用于以比现有技术更高速度搜索数据的相似性搜索装置,而不限制搜索关键字的字母的类型。 单元位置对应存储器存储表示在搜索关键字中出现的通过键盘输入的搜索关键字中的每个单元的单位之间的顺序号的表。 搜索部分参考存储在单元位置对应存储器中的表,并且每当从包括多个单元的数据库存储器中逐个读出单元时,操作单元,以产生多个状态参数,每个状态参数包括相似性,位置 表示从数据库中读出的单位的搜索关键字顶部的单位数量与什么程度的相似度相符,并表示数据库中的单位数量随后被跳过了多少单位。 通过上述处理,搜索部分更新存储在状态参数存储器中的每个状态参数,并且在检测到以等于或低于所输入的相似度的相似度的相似度重合的单位串时进行操作,以将检测到的单位串输出为 相似。

    Information extraction processor
    46.
    发明授权
    Information extraction processor 失效
    信息提取处理器

    公开(公告)号:US5774845A

    公开(公告)日:1998-06-30

    申请号:US304945

    申请日:1994-09-13

    IPC分类号: G06F17/27 G06F17/30 G06F17/20

    摘要: In a processor for extracting information on a specified field from a text described in a natural language, keywords and structural analysis are jointly used to improve the performance. When a set of keywords is divided in more than one sentence, this set of keywords is assembled by context defining words in a sentence. A multi-language summary generator uses this type of a processor.

    摘要翻译: 在从自然语言描述的文本中提取关于指定字段的信息的处理器中,联合使用关键字和结构分析来提高性能。 当一组关键字被分成多个句子时,这组关键词是通过在句子中定义单词的上下文来组合的。 多语言摘要生成器使用这种类型的处理器。

    Natural-language processing system and dictionary registration system
    47.
    发明授权
    Natural-language processing system and dictionary registration system 有权
    自然语言处理系统和字典注册系统

    公开(公告)号:US09575953B2

    公开(公告)日:2017-02-21

    申请号:US12310773

    申请日:2007-09-06

    摘要: A natural-language processing system includes a registration-candidate storage section that stores therein registration-candidate dictionary data, a judgment means that compares input data against the registration-candidate dictionary data to thereby judge whether or not the input data includes a word corresponding to the registration-candidate dictionary data, an inquiry means that inquires to a user whether or not corresponding dictionary data is to be registered in a dictionary storage section to accept a user's instruction if it is judged that a corresponding word exists, a dictionary registration means that registers the corresponding dictionary data in the dictionary storage section based on the input instruction, and a natural-language processing means that executes a natural-language processing onto the input data by using the dictionary data registered in the dictionary storage section.

    摘要翻译: 自然语言处理系统包括:注册候选者存储部分,其存储注册候选词典数据;判断装置,其将输入数据与注册候选词典数据进行比较,从而判断输入数据是否包括与 注册候选词典数据;查询装置,如果判断出存在相应的单词,则向用户询问对应的字典数据是否要被登记在字典存储部分中以接受用户的指令,字典登记表示 基于输入指令,在字典存储部分中登记相应的字典数据;以及自然语言处理装置,其通过使用登记在字典存储部分中的字典数据,对输入数据执行自然语言处理。

    Document analysis apparatus, document analysis method, and computer-readable recording medium
    48.
    发明授权
    Document analysis apparatus, document analysis method, and computer-readable recording medium 有权
    文件分析装置,文件分析方法和计算机可读记录介质

    公开(公告)号:US09311392B2

    公开(公告)日:2016-04-12

    申请号:US13576669

    申请日:2011-01-25

    IPC分类号: G06F17/24 G06F17/30 G06F17/21

    CPC分类号: G06F17/30699

    摘要: A document analysis apparatus comprises: a feature expression acquisition unit acquiring a feature expression appearing during an attention period in an analysis object document collection; a document collection acquisition unit acquiring a feature expression containing document (FECD) collection in which a feature expression appears, from an analysis population including an analysis object document collection; a context determination unit specifying an analysis/FECD corresponding to an analysis object document among a FECD collection for every feature expression, and specifies a context in which the feature expression appeared in multiple analysis/FECDs; a context comparison determination unit specifying a non analysis/FECD not corresponding to an analysis object document among a FECD collection, and within that, compares a context in which the feature expression has appeared and a context specified previously; and a feature degree setting unit performing giving or the like of a feature degree to a feature expression from the comparison.

    摘要翻译: 文件分析装置包括:特征表达获取单元,获取在分析对象文档收集期间在注意期间出现的特征表达; 文档收集获取单元从包括分析对象文档集合的分析群体获取包含其中出现特征表达的文档(FECD)集合的特征表达式; 指定对于每个特征表达式的FECD集合中的与分析对象文档对应的分析/ FECD的上下文确定单元,并且指定在多个分析/ FECD中出现特征表达的上下文; 指定在FECD集合中与分析对象文档不对应的非分析/ FECD的上下文比较确定单元,并且在其中比较特征表达式出现的上下文和先前指定的上下文; 以及特征度设定单元,对来自所述比较的特征表达进行特征度的赋予等。

    Reputation analysis system and reputation analysis method
    49.
    发明授权
    Reputation analysis system and reputation analysis method 有权
    声誉分析系统和信誉分析方法

    公开(公告)号:US09245023B2

    公开(公告)日:2016-01-26

    申请号:US13511099

    申请日:2010-11-15

    IPC分类号: G06F17/30 G06Q30/02 G06Q10/10

    摘要: Described are a reputation analysis device, reputation analysis method, and reputation analysis-use program capable of suitably analyzing temporal changes in reputation for an object indicated by a keyword. The disclosed reputation analysis device is provided with a voluntary activity description extraction means for extracting descriptions representing voluntary activity related to an object indicated by a keyword that has been input from within a plurality of documents; and a reputation chronological data estimation means for counting the number of occurrences of voluntary activity at each time point wherein the voluntary activity expressed by a description representing the voluntary activity related to the object has been performed, and estimating reputation chronological data for chronologically representing evaluations for the object by the agents of the voluntary activity.

    摘要翻译: 描述了能够适当地分析由关键字指示的对象的信誉的时间变化的信誉分析装置,信誉分析方法和信誉分析用途程序。 所公开的信誉分析装置设置有自愿活动描述提取装置,用于提取表示与由多个文档中输入的关键字指示的对象相关的自愿活动的描述; 以及声誉时间数据估计装置,用于计算在每个时间点的自愿活动的发生次数,其中由表示与该对象相关的自愿活动的描述表示的自愿活动已经被执行,以及根据时间顺序表示评估的信誉时间数据, 对象由代理人进行志愿活动。

    Information estimation device, information estimation method, and computer-readable storage medium
    50.
    发明授权
    Information estimation device, information estimation method, and computer-readable storage medium 有权
    信息估计装置,信息估计方法和计算机可读存储介质

    公开(公告)号:US08832087B2

    公开(公告)日:2014-09-09

    申请号:US13516937

    申请日:2010-12-09

    IPC分类号: G06F17/30 G06F17/27

    CPC分类号: G06F17/277 G06F17/30985

    摘要: Disclosed is an information estimation device for estimating an appropriate issue time from a time representation described in a document without intervention of any operator; wherein an information estimation device (1) which is a device for estimating an issue time of a document to be estimated, includes a candidate generation unit (11) which extracts a time representation described in the document, and on the basis of the extracted time representation, generates a plurality of possible issue time candidates having possibilities corresponding to the issue time of the document; and an issue time estimation unit (12) for obtaining a temporal proximity, for each of the plurality of issue time candidates, between the issue time candidate and other issue time candidates, and on the basis of the obtained temporal proximity, estimating the issue time of the document.

    摘要翻译: 公开了一种信息估计装置,用于从文档中描述的时间表示估计适当的发行时间,而不需要任何操作者的干预; 其中,作为用于估计要估计的文档的发行时间的装置的信息估计装置(1)包括候选生成单元(11),其提取所述文档中描述的时间表示,并且基于所提取的时间 生成具有与文档的发行时间对应的可能性的多个可能的发行时间候选者; 以及发布时间估计单元(12),用于针对所述发布时间候选中的每一个在所述发布时间候选和其他发布时间候选之间获得时间接近度,并且基于所获得的时间邻近度,估计所述发布时间 的文件。