Information search apparatus, and information search method, and computer product
    1.
    发明授权
    Information search apparatus, and information search method, and computer product 有权
    信息搜索装置,信息搜索方法,计算机产品

    公开(公告)号:US09087118B2

    公开(公告)日:2015-07-21

    申请号:US12507680

    申请日:2009-07-22

    IPC分类号: G06F17/30

    CPC分类号: G06F17/3061 G06F17/30011

    摘要: A computer-readable recording medium stores therein an information search program that causes a computer to search for text items described in a text file. The information search program causes the computer to execute receiving input of a search keyword; searching an index file for a writing keyword that includes the search keyword, the index file including writing keywords described, for respective entries, in an order identical to the order in which the text items are described in the text file; identifying an entry that corresponds to the writing keyword retrieved at the searching; and outputting the identified entry.

    摘要翻译: 计算机可读记录介质中存储有使计算机搜索文本文件中描述的文本项的信息搜索程序。 信息搜索程序使计算机执行接收到搜索关键字的输入; 在与所述文本文件中描述的文本项目的顺序相同的顺序中搜索包括搜索关键字的所述索引文件的所述索引文件,所述索引文件包括针对各个条目描述的描述关键字的所述索引文件; 识别与在搜索中检索到的书写关键字相对应的条目; 并输出所识别的条目。

    Computer product, information processing apparatus, and information search apparatus
    2.
    发明授权
    Computer product, information processing apparatus, and information search apparatus 有权
    计算机产品,信息处理装置和信息搜索装置

    公开(公告)号:US08866647B2

    公开(公告)日:2014-10-21

    申请号:US13486192

    申请日:2012-06-01

    摘要: A recording medium stores an information processing program that causes a computer to execute storing a compression symbol map group having a bit string indicating for each character code, presence or absence of the character code in a file group, and a Huffman tree whose leaf corresponding to the character code has a pointer to a compression symbol map of the character code, the Huffman tree converting the character code into a compression symbol of the character code; compressing sequentially and according to the Huffman tree, a character code to be compressed and described in a file of the file group; detecting access to the leaf at the compressing; identifying by a pointer in the accessed leaf, a compression symbol map of the character code to be compressed; and updating a bit that indicates presence or absence of the character code to be compressed, in the identified compression symbol map.

    摘要翻译: 记录介质存储信息处理程序,使计算机执行存储具有针对每个字符代码指示的位串的压缩符号映射组,文件组中存在或不存在字符代码,以及与叶对应的霍夫曼树 所述字符代码具有指向所述字符代码的压缩符号映射的指针,所述霍夫曼树将所述字符代码转换为所述字符代码的压缩符号; 按照霍夫曼树顺序压缩要在文件组的文件中进行压缩和描述的字符代码; 在压缩时检测叶片的通路; 通过访问的叶中的指针识别要压缩的字符代码的压缩符号图; 并且在所识别的压缩符号图中更新指示要压缩的字符代码的存在或不存在的位。

    Computer product, information retrieval method, and information retrieval apparatus
    3.
    发明授权
    Computer product, information retrieval method, and information retrieval apparatus 失效
    计算机产品,信息检索方法和信息检索装置

    公开(公告)号:US08712977B2

    公开(公告)日:2014-04-29

    申请号:US12623025

    申请日:2009-11-20

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30979

    摘要: A computer-readable recording medium stores therein an information retrieval program that causes a computer to execute a retrieval process in which files to be retrieved are narrowed down by using a bit string for each character in the files to find characters making up a retrieval keyword to retrieve a keyword identical to or related to the retrieval keyword in the files to be retrieved. The bit strings indicate the presence of the characters in the files. The information retrieval program causes the computer to execute extracting, from among the bit strings, a bit string of an arbitrary character; and compressing the extracted bit string, by using a special Huffman tree having leaves of plural types of symbol strings covering patterns represented by a predetermined number of bits and a special symbol string having a number of bits greater than the predetermined number of bits.

    摘要翻译: 一种计算机可读记录介质,其中存储有使计算机执行检索处理的信息检索程序,其中通过使用文件中的每个字符的位串来缩小要检索的文件,以查找构成检索关键字的字符 检索与要检索的文件中的检索关键字相同或相关的关键字。 位串表示文件中存在字符。 信息检索程序使计算机执行从位串中提取任意字符的位串; 并且通过使用具有覆盖由预定位数表示的图案的多种类型的符号串的叶的特殊霍夫曼树和具有大于预定比特数的比特数的特殊符号串来压缩提取的比特串。

    Information retrieval apparatus, information retrieval method and computer product
    4.
    发明授权
    Information retrieval apparatus, information retrieval method and computer product 有权
    信息检索装置,信息检索方法和计算机产品

    公开(公告)号:US07882083B2

    公开(公告)日:2011-02-01

    申请号:US11985101

    申请日:2007-11-14

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30979

    摘要: An information retrieval apparatus includes contents, an index data generating unit, a character frequency management data generating unit, a compressing/encrypting unit, a retrieval initializing unit, a full text retrieving unit, and a retrieval result displaying unit. The character frequency management data generating unit generates character frequency management data based on the contents. The compressing/encrypting unit compresses the contents and encrypts the character frequency management data. The retrieval initializing unit decrypts encrypted character frequency management data. The full text retrieving unit executes full text retrieval for compressed contents using the character frequency management data and index data when receiving a retrieval keyword. The retrieval result displaying unit decompresses a retrieval candidate selected from retrieval candidates and displays as a retrieval result.

    摘要翻译: 信息检索装置包括内容,索引数据生成单元,字符频率管理数据生成单元,压缩/加密单元,检索初始化单元,全文检索单元和检索结果显示单元。 字符频率管理数据生成部基于内容生成字符频率管理数据。 压缩/加密单元压缩内容并加密字符频率管理数据。 检索初始化单元解密加密的字符频率管理数据。 全文检索单元在收到检索关键词时,使用字符频率管理数据和索引数据执行压缩内容的全文检索。 检索结果显示单元将从检索候选和显示中选择的检索候选解压缩作为检索结果。

    Information processing apparatus, information processing method, and computer product
    5.
    发明授权
    Information processing apparatus, information processing method, and computer product 有权
    信息处理装置,信息处理方法和计算机产品

    公开(公告)号:US07880648B2

    公开(公告)日:2011-02-01

    申请号:US12543243

    申请日:2009-08-18

    IPC分类号: H03M7/40

    CPC分类号: H03M7/40

    摘要: A computer-readable recording medium stores therein an information processing program that causes a computer to execute storing an aggregate of layers of nodes respectively having a pointer to an upper node, pointers to a leaf and/or a lower node and branches to lower nodes; obtaining a totaling result of appearance frequencies of character codes described in a file; classifying the character codes by layer, based on appearance probabilities thereof and the totaling result; calculating, based on a quantity of character codes in an ith layer and for the ith layer, a quantity of pointers pointing to leaves, and based on the quantity calculated and for the ith layer, further calculating a number of times nodes are used and a quantity of pointers pointing to lower nodes; generating, based on calculation results, a Huffman tree; and converting the Huffman tree into a node-less Huffman tree and storing the node-less Huffman tree.

    摘要翻译: 计算机可读记录介质存储信息处理程序,使计算机执行存储分别具有指向上位节点的指针,指向叶子和/或下层节点的指针的节点的聚合层,并分支到下层节点; 获得在文件中描述的字符代码的出现频率的总计结果; 根据其出现概率和总计结果逐层分类字符编码; 基于第i层和第i层中的字符代码的数量来计算指向叶的指针的数量,并且基于所计算的数量,并且对于第i层,进一步计算使用节点的次数和 指向下节点的指针数量; 根据计算结果生成霍夫曼树; 并将霍夫曼(Huffman)树转换成无节点霍夫曼(Huffman)树,并存储无节点霍夫曼(Huffman)树。

    INFORMATION SEARCHING APPARATUS, INFORMATION MANAGING APPARATUS, INFORMATION SEARCHING METHOD, INFORMATION MANAGING METHOD, AND COMPUTER PRODUCT
    6.
    发明申请
    INFORMATION SEARCHING APPARATUS, INFORMATION MANAGING APPARATUS, INFORMATION SEARCHING METHOD, INFORMATION MANAGING METHOD, AND COMPUTER PRODUCT 有权
    信息搜索装置,信息管理装置,信息搜索方法,信息管理方法和计算机产品

    公开(公告)号:US20090299973A1

    公开(公告)日:2009-12-03

    申请号:US12361316

    申请日:2009-01-28

    IPC分类号: G06F17/30 G06F12/08

    摘要: A computer-readable recording medium stores therein an information searching program that causes a computer having access to archives including a compressed file group of compressed files that are to be searched and that have described therein character strings, to execute: sorting the compressed files in descending order of access frequency of the compressed files; combining the compressed files in descending order of access frequency after the sorting at the sorting such that a storage capacity of a cache area for a storage area that stores therein the compressed file group is not exceeded by a combined size of the compressed files combined; and writing, from the storage area into the cache area, the compressed files combined at the combining, the compressed files combined being written prior to a search of the compressed files combined.

    摘要翻译: 一种计算机可读记录介质,其中存储有一个信息搜索程序,其使得能够访问存档的计算机包括要搜索的并且已经在其中描述的压缩文件的压缩文件组,其中描述了字符串,以执行:按降序排序压缩文件 压缩文件的访问次数顺序; 在分类排序之后按照访问频率的降序组合压缩文件,使得存储在其中的压缩文件组的存储区域的高速缓存区域的存储容量不被组合的压缩文件的组合大小超过; 以及从所述存储区域到所述高速缓存区域中将所述压缩文件合并在一起,所述压缩文件在组合的压缩文件的搜索之前被写入。

    INFORMATION RETRIEVAL METHOD, INFORMATION RETRIEVAL APPARATUS, AND COMPUTER PRODUCT
    7.
    发明申请
    INFORMATION RETRIEVAL METHOD, INFORMATION RETRIEVAL APPARATUS, AND COMPUTER PRODUCT 有权
    信息检索方法,信息检索装置和计算机产品

    公开(公告)号:US20090193020A1

    公开(公告)日:2009-07-30

    申请号:US12418886

    申请日:2009-04-06

    IPC分类号: G06F17/30

    摘要: An information retrieval apparatus includes an acquiring unit that acquires a numerical value defining a boundary of a numerical range; a detecting unit that detects a number of places in and a head numeral of the numerical value; an extracting unit that extracts from a bit string group, a bit string indicating whether a numerical value in a numerical value group having the number of places and the head numeral is present in files subject to retrieval; a specifying unit that specifies a file corresponding to a bit in the extracted bit string, the bit indicating the presence of a numerical value of the numerical value group; a determining unit that determines whether a numerical value in the specified file meets the boundary condition; and a designating unit that, based on a determination by the determining unit designates the specified file to have a numerical value within the numerical range.

    摘要翻译: 信息检索装置包括获取单元,获取定义数值范围的边界的数值; 检测单元,其检测数字的位置和数字的头数字; 提取单元,从位串组提取表示具有位数的数值组中的数值和头数的位串是否存在于被检索的文件中; 指定单元,其指定与提取的比特串中的比特相对应的文件,表示数值组的数值的存在的比特; 确定单元,确定所述指定文件中的数值是否满足所述边界条件; 以及指定单元,其基于所述确定单元的确定指定所述指定文件以具有所述数值范围内的数值。

    INFORMATION SEARCHING APPARATUS, INFORMATION MANAGING APPARATUS, INFORMATION SEARCHING METHOD, INFORMATION MANAGING METHOD, AND COMPUTER PRODUCT
    10.
    发明申请
    INFORMATION SEARCHING APPARATUS, INFORMATION MANAGING APPARATUS, INFORMATION SEARCHING METHOD, INFORMATION MANAGING METHOD, AND COMPUTER PRODUCT 审中-公开
    信息搜索装置,信息管理装置,信息搜索方法,信息管理方法和计算机产品

    公开(公告)号:US20120005172A1

    公开(公告)日:2012-01-05

    申请号:US13232089

    申请日:2011-09-14

    IPC分类号: G06F7/00 G06F17/30

    摘要: A computer-readable recording medium stores therein an information searching program that causes a computer having access to archives including a compressed file group of compressed files that are to be searched and that have described therein character strings, to execute: sorting the compressed files in descending order of access frequency of the compressed files; combining the compressed files in descending order of access frequency after the sorting at the sorting such that a storage capacity of a cache area for a storage area that stores therein the compressed file group is not exceeded by a combined size of the compressed files combined; and writing, from the storage area into the cache area, the compressed files combined at the combining, the compressed files combined being written prior to a search of the compressed files combined.

    摘要翻译: 一种计算机可读记录介质,其中存储有一个信息搜索程序,其使得能够访问存档的计算机包括要搜索的并且已经在其中描述的压缩文件的压缩文件组,其中描述了字符串,以执行:按降序排序压缩文件 压缩文件的访问次数顺序; 在分类排序之后按照访问频率的降序组合压缩文件,使得存储在其中的压缩文件组的存储区域的高速缓存区域的存储容量不被组合的压缩文件的组合大小超过; 以及从所述存储区域到所述高速缓存区域中将所述压缩文件合并在一起,所述压缩文件在组合的压缩文件的搜索之前被写入。