Rule learning by updating training example and a remaining feature weights
    1.
    发明授权
    Rule learning by updating training example and a remaining feature weights 有权
    通过更新训练示例和剩余的特征权重来规则学习

    公开(公告)号:US08296249B2

    公开(公告)日:2012-10-23

    申请号:US12505529

    申请日:2009-07-20

    IPC分类号: G06F15/18

    CPC分类号: G06N5/025

    摘要: A rule learning method for making a computer perform rule learning processing in machine learning includes firstly calculating an evaluation value of respective features in a training example by using data and weights of the training examples; selecting a given number of features in descending order of the evaluation values; secondly calculating a confidence value for one of the given number of selected features; updating the weights of training example, by using the data and weights of the training examples, and the confidence value corresponding to the one feature; firstly repeating the updating for the remaining features of the given number of features; and secondly repeating, for a given number of times, the firstly calculating, the selecting, the secondly calculating, the updating, and the firstly repeating.

    摘要翻译: 用于使计算机执行机器学习中的规则学习处理的规则学习方法包括首先通过使用训练示例的数据和权重来计算训练示例中的各个特征的评估值; 以评估值的降序选择给定数量的特征; 其次计算给定数量的所选特征中的一个的置信度值; 通过使用训练样本的数据和权重以及与该特征对应的置信度值来更新训练样本的权重; 首先重复对给定数量的特征的剩余特征的更新; 第二次重复,给定次数,首先计算,选择,第二次计算,更新和首次重复。

    Rule learning method, program, and device selecting rule for updating weights based on confidence value
    2.
    发明授权
    Rule learning method, program, and device selecting rule for updating weights based on confidence value 有权
    规则学习方法,程序和设备选择规则,用于基于置信度值更新权重

    公开(公告)号:US08370276B2

    公开(公告)日:2013-02-05

    申请号:US12507379

    申请日:2009-07-22

    IPC分类号: G06F15/18

    CPC分类号: G06N5/025

    摘要: A rule learning method in machine learning includes distributing features to a given number of buckets based on a weight of the features which are correlated with a training example; specifying a feature with a maximum gain value as a rule based on a weight of the training example from each of the buckets; calculating a confidence value of the specified rule based on the weight of the training example; storing the specified rule and the confidence value in a rule data storage unit; updating the weights of the training examples based on the specified rule, the confidence value of the specified rule, data of the training example, and the weight of the training example; and repeating the distributing, the specifying, the calculating, the storing, and the updating, when the rule and the confidence value are to be further generated.

    摘要翻译: 机器学习中的规则学习方法包括基于与训练示例相关联的特征的权重将特征分配到给定数量的桶; 基于来自每个桶的训练样本的权重,规定具有作为规则的最大增益值的特征; 基于训练样本的权重计算指定规则的置信度值; 将所述指定规则和所述置信度值存储在规则数据存储单元中; 基于指定规则,指定规则的置信度,训练样本的数据和训练示例的权重来更新训练样本的权重; 并且当要进一步产生规则和置信度值时,重复分发,指定,计算,存储和更新。

    COMPUTER-READABLE RECORD MEDIUM IN WHICH NAMED ENTITY EXTRACTION PROGRAM IS RECORDED, NAMED ENTITY EXTRACTION METHOD AND NAMED ENTITY EXTRACTION APPARATUS
    3.
    发明申请
    COMPUTER-READABLE RECORD MEDIUM IN WHICH NAMED ENTITY EXTRACTION PROGRAM IS RECORDED, NAMED ENTITY EXTRACTION METHOD AND NAMED ENTITY EXTRACTION APPARATUS 审中-公开
    记录了实名制提取程序的计算机可读记录介质,名称实体提取方法和名称实体提取装置

    公开(公告)号:US20080201134A1

    公开(公告)日:2008-08-21

    申请号:US12025482

    申请日:2008-02-04

    IPC分类号: G10L11/06

    摘要: A named entity extraction apparatus includes an extraction result acquisition unit for acquiring a named entity extraction result obtained as a result of a named entity extraction process; and a lexicon information creation unit for creating lexicon information which is utilized as clues in extracting named entities from text data, on the basis of the named entity extraction result acquired by said extraction result acquisition unit.

    摘要翻译: 命名实体提取装置包括:提取结果获取单元,用于获取作为命名实体提取处理结果获得的命名实体提取结果; 以及词典信息创建单元,用于根据由所述提取结果获取单元获取的命名实体提取结果,创建用作从文本数据中提取命名实体的线索的词典信息。

    Method and apparatus for extracting information, and computer product
    4.
    发明申请
    Method and apparatus for extracting information, and computer product 审中-公开
    提取信息的方法和装置以及计算机产品

    公开(公告)号:US20050261889A1

    公开(公告)日:2005-11-24

    申请号:US10963372

    申请日:2004-10-12

    申请人: Tomoya Iwakura

    发明人: Tomoya Iwakura

    IPC分类号: G06F17/27 G06F17/28

    CPC分类号: G06F17/278

    摘要: A generation-target selecting unit selects supervised data from a supervised-data storage unit. A supervised generation unit generates the supervised data to produce new supervised data. A validity determining unit makes a rule learning unit learn the generated data and the supervised data, and makes an extracting unit to extract information using test data to evaluate a result of extracting the information. When the result is improved compared with a result before adding the supervised data generated, the supervised data generated is taken as the correct supervised data.

    摘要翻译: 生成目标选择单元从监督数据存储单元中选择监督数据。 受监督的生成单元生成监督数据以产生新的监督数据。 有效性确定单元使规则学习单元学习所生成的数据和监督数据,并使提取单元使用测试数据提取信息以评估提取信息的结果。 在添加所生成的监督数据之前,与结果相比,结果得到改善,将所监视的数据作为正确的监督数据。

    Information publication control method and apparatus, and information publication control instruction method, and apparatus
    5.
    发明申请
    Information publication control method and apparatus, and information publication control instruction method, and apparatus 审中-公开
    信息发布控制方法和装置以及信息发布控制指令方法和装置

    公开(公告)号:US20070198683A1

    公开(公告)日:2007-08-23

    申请号:US11443129

    申请日:2006-05-31

    IPC分类号: G06F15/173

    CPC分类号: G06F16/958

    摘要: An object of the present invention is to carry out publication control for a portion of contents according to its valid period. This invention includes: reading out publication data including first data whose publication should be controlled, publication control condition data relating to a valid period of the first data, and second data whose publication does not have to be controlled from a publication data storage storing the publication data to judge whether or not a condition defined in the publication control condition data is satisfied; and upon detecting that the condition defined in the publication control condition data is satisfied, generating current publication data including the first data corresponding to the publication control condition data whose condition is judged to be satisfied and the second data and outputting the generated current publication data. In this way, when the publication of the first data is controlled based on the publication control condition data concerning the valid period, it becomes possible to control not to open information whose validity has been lost such as the contact telephone number to inquire the event, to the public, for example, after the event ended or the like.

    摘要翻译: 本发明的目的是根据其有效期对一部分内容进行发布控制。 本发明包括:读出包括其出版物应受控制的第一数据的发布数据,与第一数据有效期有关的发布控制条件数据,以及不需要从出版物数据存储存储该出版物的第二数据 用于判断出版物控制条件数据中定义的条件是否满足的数据; 并且当检测到满足发布控制条件数据中定义的条件时,产生包括对应于其条件被判定为满足的发布控制条件数据的第一数据和第二数据的当前发布数据,并输出生成的当前发布数据。 以这种方式,当基于关于有效期的发布控制条件数据来控制第一数据的发布时,可以控制不打开有效性已经丢失的信息,诸如联系电话号码来查询事件, 例如,事件结束后等等。

    Option display device, option display method, and computer product
    6.
    发明申请
    Option display device, option display method, and computer product 审中-公开
    选项显示设备,选项显示方式和电脑产品

    公开(公告)号:US20080007484A1

    公开(公告)日:2008-01-10

    申请号:US11638391

    申请日:2006-12-14

    申请人: Tomoya Iwakura

    发明人: Tomoya Iwakura

    IPC分类号: G09G5/00

    CPC分类号: G06F17/2223 G06F3/018

    摘要: In an information processing device, a display control unit arranges one of options at a predetermined center position, while arranging others radially around the center option, and displays the options in a selectable manner. Before a user provides input through an input unit by operating an arrow key, a cursor is placed on the center option. The options are displayed in a matrix, and the cursor is initially placed and displayed in the center of the matrix.

    摘要翻译: 在信息处理装置中,显示控制单元将选项之一布置在预定中心位置,同时围绕中心选项径向布置其他选项,并且以可选择的方式显示选项。 在用户通过操作箭头键提供输入单元输入之前,光标位于中心选项上。 选项以矩阵形式显示,光标最初放置并显示在矩阵的中心。

    Data conversion method and apparatus to partially hide data
    7.
    发明申请
    Data conversion method and apparatus to partially hide data 有权
    用于部分隐藏数据的数据转换方法和装置

    公开(公告)号:US20070220609A1

    公开(公告)日:2007-09-20

    申请号:US11450403

    申请日:2006-06-12

    IPC分类号: H04N7/16 H04L9/32 G06F17/00

    CPC分类号: G06F21/6245 G06F21/62

    摘要: This invention provides a technique to correctly inform the human being of content of contents to be published but to prevent machines from collecting part of the contents whose distribution is not desired by the information provider. This invention includes: reading out contents data to be published, which includes text data, and identifying a character string whose output as the text data should be avoided from the contents data; converting the identified character string into substitution data other than the text data so as to maintain content of the identified character string; and generating publication contents data to maintain publication content of the contents data by using data other than the identified character string in the contents data and the substitution data. Thus, by carrying out such a processing, it becomes possible to conceal the character string against machines without changing the publication content for the human being.

    摘要翻译: 本发明提供了一种技术,用于向人类正确地通知要发布的内容的内容,但是防止机器收集信息提供者不希望分发的内容的一部分。 本发明包括:读出要发布的内容数据,其中包括文本数据,以及识别作为文本数据的输出作为内容数据应避免的字符串; 将所识别的字符串转换成除了文本数据之外的替换数据,以保持所识别的字符串的内容; 以及通过使用内容数据和替换数据中的所识别的字符串以外的数据来生成发布内容数据来维护内容数据的发布内容。 因此,通过进行这样的处理,可以在不改变人的出版内容的情况下将字符串隐藏在机器上。

    Data conversion method and apparatus to partially hide data
    8.
    发明授权
    Data conversion method and apparatus to partially hide data 有权
    用于部分隐藏数据的数据转换方法和装置

    公开(公告)号:US07770112B2

    公开(公告)日:2010-08-03

    申请号:US11450403

    申请日:2006-06-12

    IPC分类号: G06F17/00

    CPC分类号: G06F21/6245 G06F21/62

    摘要: This invention provides a technique to correctly inform the human being of content of contents to be published but to prevent machines from collecting part of the contents whose distribution is not desired by the information provider. This invention includes: reading out contents data to be published, which includes text data, and identifying a character string whose output as the text data should be avoided from the contents data; converting the identified character string into substitution data other than the text data so as to maintain content of the identified character string; and generating publication contents data to maintain publication content of the contents data by using data other than the identified character string in the contents data and the substitution data. Thus, by carrying out such a processing, it becomes possible to conceal the character string against machines without changing the publication content for the human being.

    摘要翻译: 本发明提供了一种技术,用于向人类正确地通知要发布的内容的内容,但是防止机器收集信息提供者不希望分发的内容的一部分。 本发明包括:读出要发布的内容数据,其中包括文本数据,以及识别作为文本数据的输出作为内容数据应避免的字符串; 将所识别的字符串转换成除了文本数据之外的替换数据,以保持所识别的字符串的内容; 以及通过使用内容数据和替换数据中的所识别的字符串以外的数据来生成发布内容数据来维护内容数据的发布内容。 因此,通过进行这样的处理,可以在不改变人的出版内容的情况下将字符串隐藏在机器上。

    Method and apparatus for creating index, and computer program product
    9.
    发明申请
    Method and apparatus for creating index, and computer program product 审中-公开
    创建索引的方法和设备,以及计算机程序产品

    公开(公告)号:US20080005151A1

    公开(公告)日:2008-01-03

    申请号:US11589403

    申请日:2006-10-30

    申请人: Tomoya Iwakura

    发明人: Tomoya Iwakura

    IPC分类号: G06F7/00

    CPC分类号: G06F16/319

    摘要: An index-item extracting unit extracts an index item that forms an index of an electronic document, together with appearing position information of the index item, from the electronic document. An index-list creating unit creates link information that includes the appearing position in the electronic document of the extracted index item as a link, attaches the created link information to the index item, and creates an index list by arranging the index item to which the link information is attached.

    摘要翻译: 索引项目提取单元从电子文档中提取形成电子文档的索引的索引项目以及索引项目的出现位置信息。 索引列表创建单元创建包括作为链接的所提取的索引项目的电子文档中的出现位置的链接信息,将所创建的链接信息附加到索引项目,并通过布置索引项目来创建索引列表, 链接信息。