INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND COMPUTER-READABLE RECORDING MEDIUM

    公开(公告)号:US20210174021A1

    公开(公告)日:2021-06-10

    申请号:US16761583

    申请日:2017-11-10

    Inventor: Kai ISHIKAWA

    Abstract: An information processing apparatus includes a lexical analysis unit that generates a training word string, a group generation unit that generates a plurality of training word groups, a matrix generation unit that generates, for each training word group, a training matrix in which a plurality of words and respective semantic vectors of the words are associated, a classification unit that calculates, for a word of each position of the training word string, a probability of the word corresponding to a specific word, using the training matrices generated by the matrix generation unit and a determination model that uses a convolutional neural network, and an optimization processing unit that updates parameters of the determination model, such that the probability of the word labeled as corresponding to the specific word is high, among the probabilities of the words of the respective positions of the training word string calculated by the classification unit.

    INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND COMPUTER-READABLE RECORDING MEDIUM

    公开(公告)号:US20210192137A1

    公开(公告)日:2021-06-24

    申请号:US16761318

    申请日:2017-11-10

    Inventor: Kai ISHIKAWA

    Abstract: An information processing apparatus includes a lexical analysis unit that generates a training word string, a pair generation unit that generates a plurality of training word pairs, a matrix generation unit that generates, for each training word pair, a training matrix in which a plurality of words and respective semantic vectors of the words are associated, a classification unit that calculates, for a word of each position of the training word string, a probability of the word corresponding to a specific word, using the training matrices generated by the matrix generation unit and a determination model that uses a convolutional neural network, and an optimization processing unit that updates parameters of the determination model, such that the probability of the word labeled as corresponding to the specific word is high, among the probabilities of the words of the respective positions of the training word string calculated by the classification unit.

    TEXT PROCESSING SYSTEM, TEXT PROCESSING METHOD AND STORAGE MEDIUM STORING COMPUTER PROGRAM

    公开(公告)号:US20170255611A1

    公开(公告)日:2017-09-07

    申请号:US15506293

    申请日:2015-08-20

    CPC classification number: G06F17/2785 G06F17/2705 G06F17/2755

    Abstract: A text processing system that is able to appropriately determine textual entailment between sentences with high coverage is provided. The text processing system is configured to execute: processing of extracting a common substructure that is a partial structure of a same type, the partial structure being common to a first sentence and a second sentence and, based on the a structure representing the first sentence and a structure representing the second sentence; processing of extracting at least one of a feature amount representing a dependency relationship between the at least one common substructure in the first and second sentences and a feature amount representing a dependency relationship between the common substructure in the first and second sentences and a substructure different from the common substructure; and processing of determining an entailment relationship between the first sentence and the second sentence by using the extracted feature amount.

    TEXT MINING DEVICE, TEXT MINING METHOD, AND RECORDING MEDIUM
    5.
    发明申请
    TEXT MINING DEVICE, TEXT MINING METHOD, AND RECORDING MEDIUM 审中-公开
    文本采矿设备,文字挖掘方法和记录介质

    公开(公告)号:US20150356152A1

    公开(公告)日:2015-12-10

    申请号:US14759264

    申请日:2014-01-10

    Abstract: A text mining device includes: an analysis unit which acquires, from data including text and one or more attributes including an attribute name and an attribute value and associated with the text, the attributes as analysis viewpoints, analyzes the data using the respective analysis viewpoints to obtain an analysis result from each analysis viewpoint, and generates result vectors of the respective analysis viewpoints; a similarity acquisition unit which acquires a vector similarity between the result vectors of the plural analysis viewpoints; and a recommendation unit which extracts and output a combination of the analysis viewpoints as a recommendation candidate on basis of the vector similarity.

    Abstract translation: 一种文本挖掘装置包括:分析单元,从包括文本的数据和包括属性名称和属性值的一个或多个属性并与文本相关联的数据获取作为分析视点的属性,使用各自的分析视点分析数据 从每个分析视点获得分析结果,并生成各个分析视点的结果向量; 相似度获取单元,其获取所述多个分析视点的结果矢量之间的向量相似度; 以及推荐单元,其基于矢量相似度提取并输出分析视点的组合作为推荐候选。

    METHOD FOR CLASSIFYING A NEW INSTANCE
    6.
    发明申请

    公开(公告)号:US20170116332A1

    公开(公告)日:2017-04-27

    申请号:US15318853

    申请日:2014-06-20

    CPC classification number: G06F17/30707 G06F17/30011 G06N7/005 G06N99/005

    Abstract: A method for classifying a new instance including a text document by using training instances with class including labeled data and zero or more training instances with class including unlabeled data, comprising: estimating a word distribution for each class by using the labeled data and the unlabeled data; estimating a background distribution and a degree of interpolation between the background distribution and the word distribution by using the labeled data and the unlabeled data; calculating two probabilities for that the word generated from the word distribution and the word generated from the background distribution; combining the two probabilities by using the interpolation; combining the resulting probabilities of all words to estimate a document probability for the class that indicates the document is generated from the class; and classifying the new instance as a class for which the document probability is the highest.

    ENTAILMENT EVALUATION DEVICE, ENTAILMENT EVALUATION METHOD, AND RECORDING MEDIUM
    7.
    发明申请
    ENTAILMENT EVALUATION DEVICE, ENTAILMENT EVALUATION METHOD, AND RECORDING MEDIUM 有权
    安全评估设备,安全评估方法和记录介质

    公开(公告)号:US20160012034A1

    公开(公告)日:2016-01-14

    申请号:US14769866

    申请日:2014-02-28

    CPC classification number: G06F17/279 G06F17/2705 G06F17/2785

    Abstract: An entailment evaluation device includes: a generation unit which generates first information indicating at least the order of occurrence of events of first and second simple sentences included in the hypothesis text and generates second information indicating at least the order of occurrence of events of third and fourth simple sentences included in a target text, the third simple sentence being related to the first simple sentence, the fourth simple sentence being related to the second simple sentence; a calculation unit which obtains a calculation result by comparing, based on the first and second information, the order of occurrence of events of first and second simple sentences and order of occurrence of events of third and fourth simple sentences; and a determination unit which determines, based on at least the calculation result, whether or not the target text entails the hypothesis text.

    Abstract translation: 包含评估装置包括:生成单元,生成指示至少包含在假设文本中的第一和第二简单句子的事件的发生顺序的第一信息,并生成表示至少第三和第四事件的发生顺序的第二信息 包含在目标文本中的简单句子,第三个简单句子与第一个简单句子相关,第四个简单句子与第二个简单句子相关; 计算单元,其通过基于第一和第二信息比较第一和第二简单句子的事件的发生顺序以及第三和第四简单句子的事件的发生顺序来比较获得计算结果; 以及确定单元,其至少基于所述计算结果确定所述目标文本是否涉及所述假设文本。

    SIMILAR DATA SEARCH DEVICE,SIMILAR DATA SEARCH METHOD,AND COMPUTER-READABLE STORAGE MEDIUM
    8.
    发明申请
    SIMILAR DATA SEARCH DEVICE,SIMILAR DATA SEARCH METHOD,AND COMPUTER-READABLE STORAGE MEDIUM 审中-公开
    类似数据搜索设备,类似数据搜索方法和计算机可读存储介质

    公开(公告)号:US20160004736A1

    公开(公告)日:2016-01-07

    申请号:US14770534

    申请日:2014-03-05

    CPC classification number: G06F16/2228 G06F16/2455 G06F16/90

    Abstract: A similar data search device includes: an inverted index generating unit which determines size ranges of sets of search targets for each of inverted indexes so that the number of sets of search targets is not smaller than a specified number and generates inverted indexes by dividing the sets of search targets according to the determined size ranges; an unnecessary inverted index identifying unit which determines, based on a size of a set of search conditions and a threshold value specified for a similarity between sets, a condition necessary for the similarity to be no smaller than the threshold value, and identifies, as an inverted index unnecessary for searches, any inverted index other than those inverted indexes containing a set whose minimum size value satisfies the condition; and a data search unit which conducts a search on a non-identified inverted index.

    Abstract translation: 类似的数据搜索装置包括:反向索引生成单元,其确定每个反向索引的搜索目标集合的大小范围,使得搜索目标的集合的数量不小于指定的数量,并且通过将集合 的搜索目标根据确定的大小范围; 不必要的反转索引识别单元,其基于搜索条件的集合的大小和为集合之间的相似性指定的阈值,确定相似度不小于阈值的条件,并将其识别为 搜索所需的倒排索引,除了包含最小尺寸值满足条件的集合的反转索引以外的任何反向索引; 以及数据搜索单元,其对未识别的反向索引进行搜索。

    REASONING SYSTEM, REASONING METHOD, AND RECORDING MEDIUM

    公开(公告)号:US20180314951A1

    公开(公告)日:2018-11-01

    申请号:US15772678

    申请日:2015-11-10

    CPC classification number: G06N5/04

    Abstract: A reasoning system that enables reasoning when there is a shortage of knowledge. An input unit receives a start state and an end state. A rule candidate generation unit identifies a first state, obtained by tracking one or more known rules from the start state, and a second state, obtained by backtracking one or more known rules from the end state, respectively. The generation unit generates a rule candidate relating to the first state and the second state or generates a rule candidate relating to the first state and a rule candidate relating to the second state. A rule selection unit selects, based on feasibility of the generated rule candidate, which is calculated based on one or more known rules, the generated rule candidate as a new rule. A derivation unit derives the end state from the start state, based on one or more known rules and the new rule.

Patent Agency Ranking