ENTAILMENT EVALUATION DEVICE, ENTAILMENT EVALUATION METHOD, AND RECORDING MEDIUM
    2.
    发明申请
    ENTAILMENT EVALUATION DEVICE, ENTAILMENT EVALUATION METHOD, AND RECORDING MEDIUM 有权
    安全评估设备,安全评估方法和记录介质

    公开(公告)号:US20160012034A1

    公开(公告)日:2016-01-14

    申请号:US14769866

    申请日:2014-02-28

    CPC classification number: G06F17/279 G06F17/2705 G06F17/2785

    Abstract: An entailment evaluation device includes: a generation unit which generates first information indicating at least the order of occurrence of events of first and second simple sentences included in the hypothesis text and generates second information indicating at least the order of occurrence of events of third and fourth simple sentences included in a target text, the third simple sentence being related to the first simple sentence, the fourth simple sentence being related to the second simple sentence; a calculation unit which obtains a calculation result by comparing, based on the first and second information, the order of occurrence of events of first and second simple sentences and order of occurrence of events of third and fourth simple sentences; and a determination unit which determines, based on at least the calculation result, whether or not the target text entails the hypothesis text.

    Abstract translation: 包含评估装置包括:生成单元,生成指示至少包含在假设文本中的第一和第二简单句子的事件的发生顺序的第一信息,并生成表示至少第三和第四事件的发生顺序的第二信息 包含在目标文本中的简单句子,第三个简单句子与第一个简单句子相关,第四个简单句子与第二个简单句子相关; 计算单元,其通过基于第一和第二信息比较第一和第二简单句子的事件的发生顺序以及第三和第四简单句子的事件的发生顺序来比较获得计算结果; 以及确定单元,其至少基于所述计算结果确定所述目标文本是否涉及所述假设文本。

    SIMILAR DATA SEARCH DEVICE,SIMILAR DATA SEARCH METHOD,AND COMPUTER-READABLE STORAGE MEDIUM
    3.
    发明申请
    SIMILAR DATA SEARCH DEVICE,SIMILAR DATA SEARCH METHOD,AND COMPUTER-READABLE STORAGE MEDIUM 审中-公开
    类似数据搜索设备,类似数据搜索方法和计算机可读存储介质

    公开(公告)号:US20160004736A1

    公开(公告)日:2016-01-07

    申请号:US14770534

    申请日:2014-03-05

    CPC classification number: G06F16/2228 G06F16/2455 G06F16/90

    Abstract: A similar data search device includes: an inverted index generating unit which determines size ranges of sets of search targets for each of inverted indexes so that the number of sets of search targets is not smaller than a specified number and generates inverted indexes by dividing the sets of search targets according to the determined size ranges; an unnecessary inverted index identifying unit which determines, based on a size of a set of search conditions and a threshold value specified for a similarity between sets, a condition necessary for the similarity to be no smaller than the threshold value, and identifies, as an inverted index unnecessary for searches, any inverted index other than those inverted indexes containing a set whose minimum size value satisfies the condition; and a data search unit which conducts a search on a non-identified inverted index.

    Abstract translation: 类似的数据搜索装置包括:反向索引生成单元,其确定每个反向索引的搜索目标集合的大小范围,使得搜索目标的集合的数量不小于指定的数量,并且通过将集合 的搜索目标根据确定的大小范围; 不必要的反转索引识别单元,其基于搜索条件的集合的大小和为集合之间的相似性指定的阈值,确定相似度不小于阈值的条件,并将其识别为 搜索所需的倒排索引,除了包含最小尺寸值满足条件的集合的反转索引以外的任何反向索引; 以及数据搜索单元,其对未识别的反向索引进行搜索。

    TEXT PROCESSING SYSTEM, TEXT PROCESSING METHOD AND STORAGE MEDIUM STORING COMPUTER PROGRAM

    公开(公告)号:US20170255611A1

    公开(公告)日:2017-09-07

    申请号:US15506293

    申请日:2015-08-20

    CPC classification number: G06F17/2785 G06F17/2705 G06F17/2755

    Abstract: A text processing system that is able to appropriately determine textual entailment between sentences with high coverage is provided. The text processing system is configured to execute: processing of extracting a common substructure that is a partial structure of a same type, the partial structure being common to a first sentence and a second sentence and, based on the a structure representing the first sentence and a structure representing the second sentence; processing of extracting at least one of a feature amount representing a dependency relationship between the at least one common substructure in the first and second sentences and a feature amount representing a dependency relationship between the common substructure in the first and second sentences and a substructure different from the common substructure; and processing of determining an entailment relationship between the first sentence and the second sentence by using the extracted feature amount.

    TEXT MINING DEVICE, TEXT MINING METHOD, AND RECORDING MEDIUM
    5.
    发明申请
    TEXT MINING DEVICE, TEXT MINING METHOD, AND RECORDING MEDIUM 审中-公开
    文本采矿设备,文字挖掘方法和记录介质

    公开(公告)号:US20150356152A1

    公开(公告)日:2015-12-10

    申请号:US14759264

    申请日:2014-01-10

    Abstract: A text mining device includes: an analysis unit which acquires, from data including text and one or more attributes including an attribute name and an attribute value and associated with the text, the attributes as analysis viewpoints, analyzes the data using the respective analysis viewpoints to obtain an analysis result from each analysis viewpoint, and generates result vectors of the respective analysis viewpoints; a similarity acquisition unit which acquires a vector similarity between the result vectors of the plural analysis viewpoints; and a recommendation unit which extracts and output a combination of the analysis viewpoints as a recommendation candidate on basis of the vector similarity.

    Abstract translation: 一种文本挖掘装置包括:分析单元,从包括文本的数据和包括属性名称和属性值的一个或多个属性并与文本相关联的数据获取作为分析视点的属性,使用各自的分析视点分析数据 从每个分析视点获得分析结果,并生成各个分析视点的结果向量; 相似度获取单元,其获取所述多个分析视点的结果矢量之间的向量相似度; 以及推荐单元,其基于矢量相似度提取并输出分析视点的组合作为推荐候选。

    INFORMATION PROCESSING SYSTEM, INFORMATION PROCESSING METHOD, AND RECORDING MEDIUM

    公开(公告)号:US20170330108A1

    公开(公告)日:2017-11-16

    申请号:US15529330

    申请日:2015-11-16

    CPC classification number: G06N20/00 G06F16/00 G06F16/285

    Abstract: A classification model with a high precision ratio at a high recall ratio is learned. A classification model learning system (100) includes a learning data storage unit (110) and a learning unit (130). The learning data storage unit (110) stores pieces of learning data each of which has been classified as a positive example or a negative example. The learning unit (130) learns, by using the pieces of learning data, a classification model in such a way that a precision ratio of classification by the classification model is made larger under a constraint of a minimum value of a recall ratio of classification by the classification model.

    CLASSIFICATION DICTIONARY GENERATION APPARATUS, CLASSIFICATION DICTIONARY GENERATION METHOD, AND RECORDING MEDIUM
    8.
    发明申请
    CLASSIFICATION DICTIONARY GENERATION APPARATUS, CLASSIFICATION DICTIONARY GENERATION METHOD, AND RECORDING MEDIUM 审中-公开
    分类词典生成装置,分类词典生成方法和记录介质

    公开(公告)号:US20160224654A1

    公开(公告)日:2016-08-04

    申请号:US14915797

    申请日:2014-09-17

    CPC classification number: G06F16/285 G06F16/35 G06F16/93 G06N20/00

    Abstract: A classification dictionary generation apparatus includes: a lower threshold storage unit that stores lower threshold information that determines a lower threshold of dimensional values of a classification dictionary for classifying a category of a document; and a control unit that generates the classification dictionary based on learning data whose category is known, wherein the control unit generates, based on the lower threshold information stored in the lower threshold storage unit, the classification dictionary in which all of the dimensional values are equal to or larger than the lower threshold.

    Abstract translation: 分类词典生成装置包括:下阈值存储单元,其存储下位阈值信息,所述下阈值信息确定用于对文档的类别进行分类的分类词典的下限阈值; 以及控制单元,其基于类别已知的学习数据生成分类词典,其中,所述控制单元基于存储在所述下阈值存储单元中的下阈值信息,生成其中所有维值相等的分类词典 达到或大于下阈值。

    TEXT VISUALIZATION SYSTEM, TEXT VISUALIZATION METHOD, AND RECORDING MEDIUM

    公开(公告)号:US20180081966A1

    公开(公告)日:2018-03-22

    申请号:US15558354

    申请日:2015-03-18

    CPC classification number: G06F16/34 G06F16/00 G06F16/353

    Abstract: A text visualization system which allows a user to efficiently ascertain a result of clustering of texts is provided. A clustering system (1) includes a representative text display unit (51), a reception unit (55), and an element text display unit (52). The clustering system (1) is accessibly connected to a storage that stores a plurality of texts and information indicating a representative text and an element text that entails the representative text among the plurality of texts. The representative text display unit (51) displays a plurality of representative texts. The reception unit (55) receives a designation of a specific representative text among the plurality of representative texts. The element text display unit (52) extracts, in response to receiving the designation of the specific representative text, an element text that entails the designated specific representative text from the plurality of texts, and displays the extracted element text.

    SENTENCE SET EXTRACTION SYSTEM, METHOD, AND PROGRAM

    公开(公告)号:US20170220585A1

    公开(公告)日:2017-08-03

    申请号:US15328199

    申请日:2015-07-21

    CPC classification number: G06F16/355

    Abstract: A similar sentence set generation unit 81 groups sentences representing a same concept or event from a set of analysis target sentences, to generate a similar sentence set. A similar sentence set extraction unit 82 extracts, using one or more specific sentence extractors each capable of extracting a specific sentence belonging to a specific classification from the set of analysis target sentences, one or more sentences not extracted by any of the specific sentence extractors from among the sentences belonging to the similar sentence set, as an exclusion similar sentence set.

Patent Agency Ranking