INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD AND INFORMATION RECORDING MEDIUM
    1.
    发明申请
    INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD AND INFORMATION RECORDING MEDIUM 审中-公开
    信息处理设备,信息处理方法和信息记录介质

    公开(公告)号:US20130110499A1

    公开(公告)日:2013-05-02

    申请号:US13656893

    申请日:2012-10-22

    CPC classification number: G06F17/2785

    Abstract: A word string acquirer unit acquires a word string including a plurality of words. An extractor extracts partial strings including words contained in the word string acquired by the word string acquirer. A division pattern generator generates division patterns containing division flags indicating whether or not the word string acquired by the word string acquirer is divided at spaces between the words contained in the partial strings extracted by the extractor. The division probability coefficient acquirer acquires division probability coefficients indicating a degree of a certainty that the word string is divided with a division method indicated by the division patterns generated by the division pattern generator, for each of the partial strings extracted by the extractor. A partitioning unit partitions the word string based on the division probability coefficients acquired by the division probability coefficient acquirer.

    Abstract translation: 字串获取单元获取包括多个单词的单词串。 提取器提取部分字符串,包括由字串获取器获取的字串中包含的字。 分割图案生成器生成包含分割标志的分割模式,指示由字串获取器获取的字串是否被包含在由提取器提取的部分字符串中的单词之间的空格划分。 划分概率系数获取器获取指示由提取器提取的每个部分字符串由分割模式生成器生成的分割模式指示的字串被划分的确定性程度的划分概率系数。 分割单元基于由分割概率系数获取器获取的分割概率系数来划分字串。

    Text search apparatus and text search method
    2.
    发明授权
    Text search apparatus and text search method 有权
    文本搜索装置和文本搜索方法

    公开(公告)号:US08996571B2

    公开(公告)日:2015-03-31

    申请号:US13734174

    申请日:2013-01-04

    Inventor: Katsuhiko Satoh

    CPC classification number: G06F17/30542 G06F17/30622 G06F17/30675

    Abstract: The text search apparatus has an information storage that stores plural transposed indexes associating characters or character strings appearing in a document to be searched with the appearance positions of the characters or character strings. The transposed indexes were generated for a document in which beginning marks are added in front of texts to be subject to forward matching search. The incremental searcher of the text search apparatus adds a beginning mark in front of a search keyword and executes a forward matching search using a set of transposed indexes. The main searcher executes a partial match search using the same set of transposed indexes.

    Abstract translation: 文本搜索装置具有存储将出现在要搜索的文档中的字符或字符串与字符或字符串的出现位置相关联的多个转置索引的信息存储。 为文档生成转置索引,其中在文本前添加开始标记以进行转发匹配搜索。 文本搜索装置的增量搜索器在搜索关键字之前添加开始标记,并使用一组转置索引执行前向匹配搜索。 主搜索器使用相同的转置索引集执行部分匹配搜索。

    Search device, search method and recording medium
    4.
    发明授权
    Search device, search method and recording medium 有权
    搜索设备,搜索方法和记录介质

    公开(公告)号:US09292508B2

    公开(公告)日:2016-03-22

    申请号:US14137319

    申请日:2013-12-20

    Inventor: Katsuhiko Satoh

    CPC classification number: G06F17/30011 G06F17/30619

    Abstract: A search device comprises a memory device for storing document data containing search target character strings to which delimiting characters are appended at both ends; an acquirer for acquiring keywords; a generator for generating a search character string by appending delimiting characters to both ends of the keywords; a designator for designating appearance positions where those extracted partial strings from the search character string appear in the search target character string of the document data; a determiner for determining the frequency with which partial strings common to the partial strings of the search character string appear with a positional relationship similar to the search character string in the search target character string; an evaluator for evaluating the degree of similarity between the search target character string and the search character string; and an output device for outputting the search target character string.

    Abstract translation: 搜索装置包括存储装置,用于存储包含在两端附加有定界字符的搜索目标字符串的文档数据; 获取关键字的收购方; 生成器,用于通过将分隔符附加到关键字的两端来生成搜索字符串; 用于指定来自搜索字符串的那些提取的部分字符串出现在文档数据的搜索目标字符串中的外观位置的指示符; 确定器,用于确定与搜索目标字符串中的搜索字符串类似的位置关系出现搜索字符串的部分字符串共有的部分字符串的频率; 用于评估搜索目标字符串和搜索字符串之间的相似程度的评估器; 以及用于输出搜索目标字符串的输出装置。

Patent Agency Ranking