Communication assistance device, communication assistance method, and computer readable recording medium
    31.
    发明授权
    Communication assistance device, communication assistance method, and computer readable recording medium 有权
    通信辅助装置,通信辅助方法和计算机可读记录介质

    公开(公告)号:US09244970B2

    公开(公告)日:2016-01-26

    申请号:US13810478

    申请日:2011-07-13

    IPC分类号: G06F7/00 G06F17/30 G06Q50/00

    CPC分类号: G06F17/30386 G06Q50/01

    摘要: A communication assistance device (10) includes a communication level determination unit (11) so as to determine a level of a relationship between users who communicate with each other. The communication level determination unit (11) determines the level (communication level) of the relationship between the users based on similarity between the users obtained from preference information showing preferences of the users, and on user action records showing records of actions taken by a certain user toward a partner user with whom the certain user communicates out of the users.

    摘要翻译: 通信辅助装置(10)包括通信级别确定单元(11),以确定彼此通信的用户之间的关系的级别。 通信级别确定单元(11)基于从显示用户的偏好的偏好信息获得的用户之间的相似性以及显示由某个特定的用户的动作所采取的动作的记录的用户动作记录来确定用户之间的关系的级别(通信级别) 用户向某个用户与用户通信的合作伙伴用户。

    Meaning extraction system, meaning extraction method, and recording medium
    32.
    发明授权
    Meaning extraction system, meaning extraction method, and recording medium 有权
    含义提取系统,含义提取方法和记录介质

    公开(公告)号:US09171071B2

    公开(公告)日:2015-10-27

    申请号:US13636061

    申请日:2011-03-24

    IPC分类号: G06F15/18 G06F17/30

    CPC分类号: G06F17/30705 G06F17/30616

    摘要: A meaning extraction device includes a clustering unit, an extraction rule generation unit and an extraction rule application unit. The clustering unit acquires feature vectors that transform numerical features representing the features of words having specific meanings and the surrounding words into elements, and clusters the acquired feature vectors into a plurality of clusters on the basis of the degree of similarity between feature vectors. The extraction rule generation unit performs machine learning based on the feature vectors within a cluster for each cluster, and generates extraction rules to extract words having specific meanings. The extraction rule application unit receives feature vectors generated from the words in documents which are subject to meaning extraction, specifies the optimum extraction rules for the feature vectors, and extracts the meanings of the words on the basis of which the feature vectors were generated by applying the specified extraction rules to the feature vectors.

    摘要翻译: 意思提取装置包括聚类单元,提取规则生成单元和提取规则应用单元。 聚类单元获取将表示具有特定含义的单词的特征的数字特征和周围单词的数字特征变换为元素的特征向量,并且基于特征向量之间的相似度将所获取的特征向量聚类成多个群集。 提取规则生成单元基于针对每个群集的群集内的特征向量执行机器学习,并且生成提取规则以提取具有特定含义的单词。 提取规则应用单元接收从文本中经过含义提取的单词生成的特征向量,指定特征向量的最优提取规则,并提取基于特征向量生成的单词的含义, 指定的提取规则到特征向量。

    Determining whether text information corresponds to target information
    33.
    发明授权
    Determining whether text information corresponds to target information 有权
    确定文本信息是否对应于目标信息

    公开(公告)号:US08510249B2

    公开(公告)日:2013-08-13

    申请号:US13063231

    申请日:2009-10-06

    IPC分类号: G06F17/27

    摘要: An information analysis apparatus that performs an analysis on text information to determine whether or not the text information corresponds to the target information. The information analysis apparatus includes a storage device that stores the text information; a density estimation unit that estimates, in units of analysis each composed of a plurality of sentences of text information, a density indicating the degree to which the target information is included in the unit of analysis; and a determination unit that obtains an evaluation value indicating the degree to which each sentence included in each unit of analysis corresponds to the target information from the estimated density of the unit of analysis, and determines whether or not the sentence corresponds to the target information based on the evaluation value.

    摘要翻译: 一种对文本信息执行分析以确定文本信息是否对应于目标信息的信息分析装置。 信息分析装置包括存储文本信息的存储装置; 密度估计单元,以分析为单位,以文本信息的多个句子为单位,以表示分析单位包含目标信息的程度的浓度进行估计; 以及确定单元,其从分析单元的估计密度获得指示每个分析单元中包括的每个句子的程度对应于目标信息的评估值,并且确定该句子是否对应于目标信息 对评价值。

    Attribute extraction method, system, and program
    34.
    发明授权
    Attribute extraction method, system, and program 有权
    属性提取方法,系统和程序

    公开(公告)号:US08463738B2

    公开(公告)日:2013-06-11

    申请号:US12866215

    申请日:2009-03-05

    IPC分类号: G06F17/30

    摘要: Sets of strings of which the drawing positions are arranged in one direction are extracted from a document as attribute groups. An attribute name score is calculated for each attribute group to determine an extent to which each attribute group is a set of attribute names. Based on the attribute name scores, an attribute name group is selected out of the attribute groups. From among the attribute groups, an attribute group which includes a string which is the same as at least one string of the attribute name group and of which the drawing position is the same as that of the string of the attribute name group is selected. From the string at the same drawing position, an attribute name is extracted. From the other strings of the selected attribute group than those at the same drawing position, an attribute value corresponding to the attribute name is extracted.

    摘要翻译: 绘图位置在一个方向排列的一组字符串作为属性组从文档中提取出来。 为每个属性组计算属性名称得分,以确定每个属性组是一组属性名称的范围。 根据属性名称分数,从属性组中选出属性名称组。 在属性组中,选择包括与属性名称组的至少一个字符串相同的字符串并且其绘制位置与属性名称组的字符串相同的字符串的属性组。 从相同绘图位置的字符串中提取属性名称。 从所选择的属性组的其他字符串中,与相同的绘图位置相对应的属性值被提取。

    Cooccurrence dictionary creating system, scoring system, cooccurrence dictionary creating method, scoring method, and program thereof
    35.
    发明授权
    Cooccurrence dictionary creating system, scoring system, cooccurrence dictionary creating method, scoring method, and program thereof 有权
    并发词典创建系统,评分系统,同时发生字典创建方法,评分方法及其程序

    公开(公告)号:US08443008B2

    公开(公告)日:2013-05-14

    申请号:US12922320

    申请日:2009-04-01

    IPC分类号: G06F17/30

    CPC分类号: G06F17/2735

    摘要: A cooccurrence dictionary creating system includes: a language analyzing section which subjects a text to a morpheme analysis, a clause specification, and a modification relationship analysis between clauses, a cooccurrence relationship collecting section which collects cooccurrences of nouns in each clause of the text, modification relationships of nouns and declinable words, and modification relationships between declinable words as cooccurrence relationships, a cooccurrence score calculating section which calculates a cooccurrence score of the cooccurrence relationship based on a frequency of the collected cooccurrence relationship, and a cooccurrence dictionary storage section which stores a cooccurrence dictionary in which a correspondence between the calculated cooccurrence score and the cooccurrence relationship is described.

    摘要翻译: 并发词典创建系统包括:语言分析部分,其对文本进行语素分析,子句规范,以及条款之间的修改关系分析,在文本的每个子句中收集名词的一致性的共同关系收集部分,修改 名词和不可否认的词的关系,以及可下降词之间的修饰关系作为共同发生关系,基于收集的同现关系的频率来计算并发关系的同现比分的共同出发分数计算部分,以及存储 描述了计算出的并发分数与共同发生关系之间的对应关系的同时发生词典。

    COMMUNICATION ASSISTANCE DEVICE, COMMUNICATION ASSISTANCE METHOD, AND COMPUTER READABLE RECORDING MEDIUM
    36.
    发明申请
    COMMUNICATION ASSISTANCE DEVICE, COMMUNICATION ASSISTANCE METHOD, AND COMPUTER READABLE RECORDING MEDIUM 有权
    通信辅助设备,通信辅助方法和计算机可读记录介质

    公开(公告)号:US20130117296A1

    公开(公告)日:2013-05-09

    申请号:US13810478

    申请日:2011-07-13

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30386 G06Q50/01

    摘要: A communication assistance device (10) includes a communication level determination unit (11) so as to determine a level of a relationship between users who communicate with each other. The communication level determination unit (11) determines the level (communication level) of the relationship between the users based on similarity between the users obtained from preference information showing preferences of the users, and on user action records showing records of actions taken by a certain user toward a partner user with whom the certain user communicates out of the users.

    摘要翻译: 通信辅助装置(10)包括通信级别确定单元(11),以确定彼此通信的用户之间的关系的级别。 通信级别确定单元(11)基于从显示用户的偏好的偏好信息获得的用户之间的相似性以及显示由某个特定的用户的动作所采取的动作的记录的用户动作记录来确定用户之间的关系的级别(通信级别) 用户向某个用户与用户通信的合作伙伴用户。

    DICTIONARY CREATION DEVICE, WORD GATHERING METHOD AND RECORDING MEDIUM
    37.
    发明申请
    DICTIONARY CREATION DEVICE, WORD GATHERING METHOD AND RECORDING MEDIUM 审中-公开
    词典创作设备,词汇记录方法和记录介质

    公开(公告)号:US20120303359A1

    公开(公告)日:2012-11-29

    申请号:US13515135

    申请日:2010-12-03

    IPC分类号: G06F17/21

    CPC分类号: G06F17/2735 G06F16/353

    摘要: When gathering words through a dictionary growth process, a dictionary growth unit (102) stores information indicating through what process of input and output a word has been gathered in a gathering process memory unit (107). Then, a clustering unit (103) classifies the word that has been gathered by the dictionary growth process into clusters on the basis of information recorded in the gathering process memory unit (107). Next, a type determination unit (104) determines whether a word comprising a cluster is of the same type as a seed word or of a different type, for each cluster into which the word has been classified, on the basis of information recorded in the gather process memory unit (107). In addition, an output unit (105) associates information indicating the gathered word, the cluster to which the word belongs and whether the cluster is of the same type as the seed word or of a different type, and displays such.

    摘要翻译: 当通过字典增长过程收集单词时,词典生成单元(102)存储指示通过什么进程输入和输出一个单词被收集在收集处理存储单元(107)中的信息。 然后,聚类单元(103)根据记录在采集处理存储单元(107)中的信息,将由字典成长处理收集的单词分类成簇。 接下来,类型确定单元(104)基于记录在该文件中的信息,确定包含群集的单词是否与种子单词或不同类型的单词相对应, 收集过程存储单元(107)。 此外,输出单元(105)将指示所收集的单词,单词所属的集群与集群是否与种子单词或不同类型相同的类型的信息相关联,并且显示这样的信息。

    TRAINING DATA GENERATION APPARATUS, CHARACTERISTIC EXPRESSION EXTRACTION SYSTEM, TRAINING DATA GENERATION METHOD, AND COMPUTER-READABLE STORAGE MEDIUM
    38.
    发明申请
    TRAINING DATA GENERATION APPARATUS, CHARACTERISTIC EXPRESSION EXTRACTION SYSTEM, TRAINING DATA GENERATION METHOD, AND COMPUTER-READABLE STORAGE MEDIUM 有权
    培训数据生成装置,特征表达提取系统,培训数据生成方法和计算机可读存储介质

    公开(公告)号:US20120030157A1

    公开(公告)日:2012-02-02

    申请号:US13263280

    申请日:2010-03-17

    IPC分类号: G06F15/18

    摘要: The disclosed apparatus uses a training data generation apparatus 2, which generates training data used for creating characteristic expression extraction rules. The training data generation apparatus 2 includes: a training data candidate clustering unit 21, which clusters a plurality of training data candidates assigned labels indicating annotation classes based on feature values containing respective context information, and a training data generation unit 22 which, by referring to each cluster obtained using the clustering results, obtains the distribution of the labels of the training data candidates within the cluster, identifies training data candidates that meet a preset condition based on the obtained distribution, and generates training data using the identified training data candidates.

    摘要翻译: 所公开的装置使用训练数据生成装置2,其生成用于创建特征表达式提取规则的训练数据。 训练数据产生装置2包括:训练数据候选聚类单元21,其基于包含各个上下文信息的特征值聚集分配了指示注释类别的标签的多个训练数据候选者;训练数据生成单元22,通过参考 使用聚类结果获得的每个聚类获得聚类内的训练数据候选的标签的分布,基于获得的分布来识别满足预设条件的训练数据候选,并使用所识别的训练数据候选来生成训练数据。

    ATTRIBUTE EXTRACTION METHOD, SYSTEM, AND PROGRAM
    39.
    发明申请
    ATTRIBUTE EXTRACTION METHOD, SYSTEM, AND PROGRAM 有权
    属性提取方法,系统和程序

    公开(公告)号:US20100318525A1

    公开(公告)日:2010-12-16

    申请号:US12866215

    申请日:2009-03-05

    IPC分类号: G06F17/30

    摘要: Sets of strings of which the drawing positions are arranged in one direction are extracted from a document as attribute groups. An attribute name score is calculated for each attribute group to determine an extent to which each attribute group is a set of attribute names. Based on the attribute name scores, an attribute name group is selected out of the attribute groups. From among the attribute groups, an attribute group which includes a string which is the same as at least one string of the attribute name group and of which the drawing position is the same as that of the string of the attribute name group is selected. From the string at the same drawing position, an attribute name is extracted. From the other strings of the selected attribute group than those at the same drawing position, an attribute value corresponding to the attribute name is extracted.

    摘要翻译: 绘图位置在一个方向排列的一组字符串作为属性组从文档中提取出来。 为每个属性组计算属性名称得分,以确定每个属性组是一组属性名称的范围。 根据属性名称分数,从属性组中选出属性名称组。 在属性组中,选择包括与属性名称组的至少一个字符串相同的字符串并且其绘制位置与属性名称组的字符串相同的字符串的属性组。 从相同绘图位置的字符串中提取属性名称。 从所选择的属性组的其他字符串中,与相同的绘图位置相对应的属性值被提取。

    ONTOLOGY PROCESSING DEVICE, ONTOLOGY PROCESSING METHOD, AND ONTOLOGY PROCESSING PROGRAM
    40.
    发明申请
    ONTOLOGY PROCESSING DEVICE, ONTOLOGY PROCESSING METHOD, AND ONTOLOGY PROCESSING PROGRAM 有权
    民族处理装置,本土化处理方法和本地化处理方案

    公开(公告)号:US20100121885A1

    公开(公告)日:2010-05-13

    申请号:US12598304

    申请日:2008-05-27

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30734

    摘要: To provide a technique for structuralizing ontology in a prescribed form to a structure to which features of data are reflected. An ontology processing device has a structuralizing device for structuralizing properties of the ontology in the prescribed form generated from a set of instance data containing a combination of a subject, a property, and an object expressed with a character string according to the features of the object, and has a ontology storage device which stores the ontology structuralized by the structuralizing device. With this structure, the properties of the ontology in the prescribed form are corrected or expressed as an ontology structure by reflecting the characteristics of a set of the objects obtained from the data.

    摘要翻译: 提供一种以规定的形式将本体结构化为反映数据特征的结构的技术。 本体处理装置具有结构化装置,用于根据包含对象,属性和根据对象的特征的字符串表示的对象的组合的一组实例数据生成的规定形式来结构化本体的属性 并具有存储由结构化装置构成的本体的本体存储装置。 利用这种结构,通过反映从数据获得的一组对象的特征,以规定形式的本体的属性被校正或表示为本体结构。