Text matching device and method, and text classification device and method

    公开(公告)号:US10803103B2

    公开(公告)日:2020-10-13

    申请号:US14898565

    申请日:2014-05-15

    摘要: [Object] To provide a system for automatically and reliably collecting information belonging to a given category, and matching the information appropriately in a timely manner.[Solution] A text classifying device 30 analyzes dependency of collected texts by a morpheme analyzing unit 52 and a dependency analyzing unit 54. A problem report collecting unit 64 specifies a core consisting of noun+predicate in a text based on dependency relation of the text, and using a combination of noun classification (trouble/non-trouble) and predicate classification (excitatory/inhibitory), classifies the text to a problem report or the rest, by a method referred to as core-based matrix. Support information collecting device 66 and request message collecting device 68 collect support information and request messages in the similar manner. A matching device 76 matches problem reports and support information collected by problem report collecting unit 64 and support information collecting device 66 by a method referred to as co-occurrence core matrix, and thus associates problem information (support information) with appropriate support information (problem information).

    Non-factoid question-answering device

    公开(公告)号:US11176328B2

    公开(公告)日:2021-11-16

    申请号:US16629293

    申请日:2018-06-14

    摘要: A question answering device includes: a general word vector converter converting a question and an answer to semantic vectors in accordance with general context; a general sentence level CNN 214, in response to similarities of semantic vectors between words in question and answer and to strength of causality between the words, for weighting each semantic vector to calculate sentence level representations of the question and the answer; a general passage level CNN 218, in response to similarity between sentence level representations of question and answer, and to strength of relation of vectors in the sentence level representations viewed from causality, for weighting the sentence level representation to calculate a passage level representation for the question and answer passage; and a classifier determining whether or not an answer is a correct answer, based on the similarities between outputs from CNNs 214 and 218.

    Predicate template collecting device, specific phrase pair collecting device and computer program therefor
    4.
    发明授权
    Predicate template collecting device, specific phrase pair collecting device and computer program therefor 有权
    谓词模板收集装置,特定短语对收集装置及其计算机程序

    公开(公告)号:US09582487B2

    公开(公告)日:2017-02-28

    申请号:US14377988

    申请日:2013-01-23

    IPC分类号: G06F17/27

    摘要: A predicate template collector allowing efficient and automatic recognition of predicate templates is adapted to include: a noun pair collector 94 and a noun pair polarity determiner 98 for collecting noun pairs co-occurring with predicate template pairs and determining polarity of relation between nouns, using conjunctions and seed templates; a template pair collector 100, collecting template pairs co-occurring with noun pairs and determining, based on the relation of noun pairs co-occurring with the predicate template pairs and the conjunctions between predicate templates pairs, whether the polarity of excitatory class of predicate template pair is the same or not; a template network builder 106 building a template network connecting predicate templates based on the predicate template pairs and match/mismatch of excitatory class thereof; and a template excitation value calculator 112 calculating excitation value to be assigned to each node, using the excitation value of seed templates and the relation between each of the nodes in the network.

    摘要翻译: 允许对谓词模板进行有效和自动识别的谓词模板收集器适于包括:名词对收集器94和名词对极性确定器98,用于收集与谓词模板对共存的名词对,并使用连词确定名词之间的关系的极性 和种子模板; 模板对收集器100,收集与名词对共同存在的模板对,并且基于与谓词模板对共同出现的名词对与谓词模板对之间的连接的关系确定谓词模板的兴奋类的极性 对是一样的 模板网络构建器106构建基于谓词模板对连接谓词模板的模板网络,并且其兴奋类别的匹配/不匹配; 以及模板激励值计算器112,使用种子模板的激励值和网络中的每个节点之间的关系来计算要分配给每个节点的激励值。

    PREDICATE TEMPLATE COLLECTING DEVICE, SPECIFIC PHRASE PAIR COLLECTING DEVICE AND COMPUTER PROGRAM THEREFOR
    5.
    发明申请
    PREDICATE TEMPLATE COLLECTING DEVICE, SPECIFIC PHRASE PAIR COLLECTING DEVICE AND COMPUTER PROGRAM THEREFOR 有权
    预测模板收集设备,特定对付收集设备及其计算机程序

    公开(公告)号:US20150039296A1

    公开(公告)日:2015-02-05

    申请号:US14377988

    申请日:2013-01-23

    IPC分类号: G06F17/27

    摘要: A predicate template collector allowing efficient and automatic recognition of predicate templates is adapted to include: a noun pair collector 94 and a noun pair polarity determiner 98 for collecting noun pairs co-occurring with predicate template pairs and determining polarity of relation between nouns, using conjunctions and seed templates; a template pair collector 100, collecting template pairs co-occurring with noun pairs and determining, based on the relation of noun pairs co-occurring with the predicate template pairs and the conjunctions between predicate templates pairs, whether the polarity of excitatory class of predicate template pair is the same or not; a template network builder 106 building a template network connecting predicate templates based on the predicate template pairs and match/mismatch of excitatory class thereof; and a template excitation value calculator 112 calculating excitation value to be assigned to each node, using the excitation value of seed templates and the relation between each of the nodes in the network.

    摘要翻译: 允许对谓词模板进行有效和自动识别的谓词模板收集器适于包括:名词对收集器94和名词对极性确定器98,用于收集与谓词模板对共存的名词对,并使用连词确定名词之间的关系的极性 和种子模板; 模板对收集器100,收集与名词对共同存在的模板对,并且基于与谓词模板对共同出现的名词对与谓词模板对之间的连接的关系确定谓词模板的兴奋类的极性 对是一样的 模板网络构建器106构建基于谓词模板对连接谓词模板的模板网络,并且其兴奋类别的匹配/不匹配; 以及模板激励值计算器112,使用种子模板的激励值和网络中的每个节点之间的关系来计算要分配给每个节点的激励值。

    NON-FACTOID QUESTION-ANSWERING SYSTEM AND COMPUTER PROGRAM
    8.
    发明申请
    NON-FACTOID QUESTION-ANSWERING SYSTEM AND COMPUTER PROGRAM 有权
    非FACTOID问题答案系统和计算机程序

    公开(公告)号:US20150026106A1

    公开(公告)日:2015-01-22

    申请号:US14377999

    申请日:2013-01-23

    摘要: In order to provide a non-factoid question answering system with improved precision, the question answering system (160) includes: a candidate retrieving unit (222), responsive to a question, extracting answer candidates from a corpus storage (178); a feature vector generating unit (232) for generating features from combinations of a question with each of the answer candidates; SVMs (176) trained to calculate a score of how correct a combination of the question with an answer candidate is, upon receiving the feature vector therefor; and an answer ranker unit (234) outputting the answer candidate with the highest calculated score as the answer. The features are generated on the basis of the results of morphological analysis and parsing of the question, a phrase in the question evaluated as being positive or negative as well as its polarity, and the semantic classes of nouns in the features.

    摘要翻译: 为了提供更精确的非事实型问答系统,问答系统(160)包括:候选者检索单元(222),响应于问题,从语料库存储(178)提取答案候选者; 特征向量生成单元,用于从问题的组合中产生每个候选答案的特征; 在接收到该特征向量时,经训练的SVM(176)被训练以计算问题组合与答案候选者的正确度的得分; 以及将具有最高计算得分的答案候选作为答案输出的回答保持单元(234)。 这些特征是基于形态学分析和问题解析的结果生成的,被评价为正或负的一个短语以及其极性,以及特征中名词的语义类别。