Apparatus for searching and managing compressed files
    1.
    发明授权
    Apparatus for searching and managing compressed files 有权
    用于搜索和管理压缩文件的设备

    公开(公告)号:US08037035B2

    公开(公告)日:2011-10-11

    申请号:US12361316

    申请日:2009-01-28

    IPC分类号: G06F7/00 G06F17/00 G06F17/30

    摘要: A computer-readable, non-transitory medium stores a program that manages compressed file groups on a plurality of slave servers. The file groups include compressed files that are to be searched and have character strings. Each of the compressed file groups is expanded, using a Huffman tree that was used for compressing the compressed file group. A common compression parameter is generated based on appearance frequency, by summing, for each character, the appearance frequency in each of the compressed file groups. The expanded files are recompressed using the common Huffman tree such that sums of the access frequencies of the compressed files that are origins of the recompressed files are substantially equivalent among various slave servers. New archives including the re-compressed files are transmitted to the respective slave servers.

    摘要翻译: 计算机可读的非暂时介质存储管理多个从属服务器上的压缩文件组的程序。 文件组包括要搜索并具有字符串的压缩文件。 使用用于压缩压缩文件组的霍夫曼树来扩展每个压缩文件组。 基于出现频率,通过对每个字符求和每个压缩文件组中的出现频率,生成公共压缩参数。 使用公共霍夫曼树对扩展的文件进行再压缩,使得作为重新压缩文件的来源的压缩文件的访问频率的总和在各种从属服务器之间基本相同。 包括重新压缩文件的新档案被传送到相应的从属服务器。

    INFORMATION SEARCHING APPARATUS, INFORMATION MANAGING APPARATUS, INFORMATION SEARCHING METHOD, INFORMATION MANAGING METHOD, AND COMPUTER PRODUCT
    2.
    发明申请
    INFORMATION SEARCHING APPARATUS, INFORMATION MANAGING APPARATUS, INFORMATION SEARCHING METHOD, INFORMATION MANAGING METHOD, AND COMPUTER PRODUCT 审中-公开
    信息搜索装置,信息管理装置,信息搜索方法,信息管理方法和计算机产品

    公开(公告)号:US20120005172A1

    公开(公告)日:2012-01-05

    申请号:US13232089

    申请日:2011-09-14

    IPC分类号: G06F7/00 G06F17/30

    摘要: A computer-readable recording medium stores therein an information searching program that causes a computer having access to archives including a compressed file group of compressed files that are to be searched and that have described therein character strings, to execute: sorting the compressed files in descending order of access frequency of the compressed files; combining the compressed files in descending order of access frequency after the sorting at the sorting such that a storage capacity of a cache area for a storage area that stores therein the compressed file group is not exceeded by a combined size of the compressed files combined; and writing, from the storage area into the cache area, the compressed files combined at the combining, the compressed files combined being written prior to a search of the compressed files combined.

    摘要翻译: 一种计算机可读记录介质,其中存储有一个信息搜索程序,其使得能够访问存档的计算机包括要搜索的并且已经在其中描述的压缩文件的压缩文件组,其中描述了字符串,以执行:按降序排序压缩文件 压缩文件的访问次数顺序; 在分类排序之后按照访问频率的降序组合压缩文件,使得存储在其中的压缩文件组的存储区域的高速缓存区域的存储容量不被组合的压缩文件的组合大小超过; 以及从所述存储区域到所述高速缓存区域中将所述压缩文件合并在一起,所述压缩文件在组合的压缩文件的搜索之前被写入。

    INFORMATION SEARCHING APPARATUS, INFORMATION MANAGING APPARATUS, INFORMATION SEARCHING METHOD, INFORMATION MANAGING METHOD, AND COMPUTER PRODUCT
    3.
    发明申请
    INFORMATION SEARCHING APPARATUS, INFORMATION MANAGING APPARATUS, INFORMATION SEARCHING METHOD, INFORMATION MANAGING METHOD, AND COMPUTER PRODUCT 有权
    信息搜索装置,信息管理装置,信息搜索方法,信息管理方法和计算机产品

    公开(公告)号:US20090299973A1

    公开(公告)日:2009-12-03

    申请号:US12361316

    申请日:2009-01-28

    IPC分类号: G06F17/30 G06F12/08

    摘要: A computer-readable recording medium stores therein an information searching program that causes a computer having access to archives including a compressed file group of compressed files that are to be searched and that have described therein character strings, to execute: sorting the compressed files in descending order of access frequency of the compressed files; combining the compressed files in descending order of access frequency after the sorting at the sorting such that a storage capacity of a cache area for a storage area that stores therein the compressed file group is not exceeded by a combined size of the compressed files combined; and writing, from the storage area into the cache area, the compressed files combined at the combining, the compressed files combined being written prior to a search of the compressed files combined.

    摘要翻译: 一种计算机可读记录介质,其中存储有一个信息搜索程序,其使得能够访问存档的计算机包括要搜索的并且已经在其中描述的压缩文件的压缩文件组,其中描述了字符串,以执行:按降序排序压缩文件 压缩文件的访问次数顺序; 在分类排序之后按照访问频率的降序组合压缩文件,使得存储在其中的压缩文件组的存储区域的高速缓存区域的存储容量不被组合的压缩文件的组合大小超过; 以及从所述存储区域到所述高速缓存区域中将所述压缩文件合并在一起,所述压缩文件在组合的压缩文件的搜索之前被写入。

    COMPUTER PRODUCT, INFORMATION RETRIEVING APPARATUS, AND INFORMATION RETRIEVAL METHOD
    4.
    发明申请
    COMPUTER PRODUCT, INFORMATION RETRIEVING APPARATUS, AND INFORMATION RETRIEVAL METHOD 有权
    计算机产品,信息检索设备和信息检索方法

    公开(公告)号:US20100131475A1

    公开(公告)日:2010-05-27

    申请号:US12622902

    申请日:2009-11-20

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30911

    摘要: A recording medium stores therein an information retrieval program that causes a computer to execute generating a Huffman tree based on an XML tag written in an XML file and an appearance frequency of character data exclusive of the XML tag; compressing the XML file using the Huffman tree; receiving a retrieval condition that includes a retrieval keyword and type information concerning the retrieval keyword; setting a decompression start flag for a compression code that is for an XML start tag related to the type information, the decompression start flag instructing commencement of decompression of a compression code string subsequent to the XML start tag; detecting, in the compressed XML file, the compression code for which the decompression start flag has been set; and decompressing, when the compression code for which the decompression start flag has been set is detected, the compression code string, using the Huffman tree.

    摘要翻译: 记录媒体存储信息检索程序,该程序使计算机执行基于XML文件中写入的XML标签和排除XML标签的字符数据的出现频率来生成霍夫曼树; 使用霍夫曼树压缩XML文件; 接收包括检索关键字的检索条件和关于检索关键字的类型信息; 设置用于与类型信息相关的XML开始标签的压缩代码的解压开始标志,所述解压开始标志指示开始解压缩XML开始标签之后的压缩代码串; 在压缩XML文件中检测已经设置了解压开始标志的压缩码; 并且当检测到已经设置了解压开始标志的压缩码时,解压缩压缩码串,使用霍夫曼树。

    CHARACTER SEQUENCE MAP GENERATING APPARATUS, INFORMATION SEARCHING APPARATUS, CHARACTER SEQUENCE MAP GENERATING METHOD, INFORMATION SEARCHING METHOD, AND COMPUTER PRODUCT
    5.
    发明申请
    CHARACTER SEQUENCE MAP GENERATING APPARATUS, INFORMATION SEARCHING APPARATUS, CHARACTER SEQUENCE MAP GENERATING METHOD, INFORMATION SEARCHING METHOD, AND COMPUTER PRODUCT 审中-公开
    特征序列生成装置,信息搜索装置,特征序列生成方法,信息搜索方法和计算机产品

    公开(公告)号:US20090299974A1

    公开(公告)日:2009-12-03

    申请号:US12362183

    申请日:2009-01-29

    IPC分类号: G06F17/30 G10L13/08

    摘要: A computer-readable recording medium stores therein a sequence-map generating program that causes a computer to execute extracting from files that include character strings written therein, a word having q (q≧2) characters; extracting from the word extracted at the extracting the word, consecutive characters from a character position s-th (1≦s≦q−r+1) from a head of the word to a character position determined by a number of characters r (r≦q); and generating, for each character position s-th from the head, a consecutive-character sequence map including a flag row that indicates, for each file, whether a file includes the consecutive characters extracted at the extracting the consecutive characters.

    摘要翻译: 计算机可读记录介质中存储有使计算机执行从其中写入的字符串的文件提取的序列映射生成程序,具有q(q> = 2)个字符的字; 从提取单词中提取的单词中提取出的字符,从单词的头部的字符位置s(1 <= s <= q-r + 1)的连续字符到由字符数r确定的字符位置 (r <= q); 并且对于从头部开始的每个字符位置,生成包括标记行的连续字符序列图,该标志行针对每个文件指示文件是否包括在提取连续字符时提取的连续字符。

    Information retrieval apparatus, information retrieval method and computer product
    6.
    发明申请
    Information retrieval apparatus, information retrieval method and computer product 有权
    信息检索装置,信息检索方法和计算机产品

    公开(公告)号:US20080098024A1

    公开(公告)日:2008-04-24

    申请号:US11985101

    申请日:2007-11-14

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30979

    摘要: An information retrieval apparatus includes contents, an index data generating unit, a character frequency management data generating unit, a compressing/encrypting unit, a retrieval initializing unit, a full text retrieving unit, and a retrieval result displaying unit. The character frequency management data generating unit generates character frequency management data based on the contents. The compressing/encrypting unit compresses the contents and encrypts the character frequency management data. The retrieval initializing unit decrypts encrypted character frequency management data. The full text retrieving unit executes full text retrieval for compressed contents using the character frequency management data and index data when receiving a retrieval keyword. The retrieval result displaying unit decompresses a retrieval candidate selected from retrieval candidates and displays as a retrieval result.

    摘要翻译: 信息检索装置包括内容,索引数据生成单元,字符频率管理数据生成单元,压缩/加密单元,检索初始化单元,全文检索单元和检索结果显示单元。 字符频率管理数据生成部基于内容生成字符频率管理数据。 压缩/加密单元压缩内容并加密字符频率管理数据。 检索初始化单元解密加密的字符频率管理数据。 全文检索单元在收到检索关键词时,使用字符频率管理数据和索引数据执行压缩内容的全文检索。 检索结果显示单元将从检索候选和显示中选择的检索候选解压缩作为检索结果。

    Semantic retrieval method and computer product
    7.
    发明申请
    Semantic retrieval method and computer product 审中-公开
    语义检索方法和计算机产品

    公开(公告)号:US20080082318A1

    公开(公告)日:2008-04-03

    申请号:US11974184

    申请日:2007-10-11

    IPC分类号: G06F17/27

    摘要: A dictionary server includes a retrieval-display processing unit. Upon receipt of a request for retrieval of semantic information related to a term from a client PC, the retrieval-display processing unit acquires the semantic information, header information, and link information related to the semantic information from knowledge reference data, dictionary content data, and dictionary data. Based on the acquired information, the retrieval-display processing unit causes the client PC to display items on webpage related to the semantic information, the header information, and the link information.

    摘要翻译: 字典服务器包括检索显示处理单元。 检索显示处理单元在从客户端PC接收到与术语相关的语义信息的检索请求时,从知识参考数据,词典内容数据,语义信息中获取与语义信息相关的语义信息,头信息和链接信息, 和字典数据。 基于所获取的信息,检索显示处理单元使客户PC显示与语义信息相关的网页上的项目,头信息和链接信息。

    Communication server, mobile communication terminal, communication method, and computer product
    8.
    发明授权
    Communication server, mobile communication terminal, communication method, and computer product 有权
    通信服务器,移动通信终端,通信方式和计算机产品

    公开(公告)号:US09008621B2

    公开(公告)日:2015-04-14

    申请号:US12335322

    申请日:2008-12-15

    CPC分类号: G06F21/10 H04W12/02

    摘要: A method of controlling decompression, wherein the method includes: transmitting, by a first computer that already has stored therein compressed data that are compressed based on compression parameters, identification information for identifying the first computer to a second computer that stores therein the compression parameters; and encrypting, by the second computer, the compression parameters using the identification information received from the first computer. The compression parameters include at least a frequency of appearance and an allocated sign for each piece of character data. The method also includes: transmitting, by the second computer, the encrypted compression parameters to the first computer; decrypting, by the first computer, the encrypted compression parameters received from the second computer using the identification information; and decompressing, by the first computer, the compressed data based on the decrypted compression parameters.

    摘要翻译: 一种控制解压缩的方法,其特征在于,所述方法包括:通过已经存储了压缩数据的第一计算机根据压缩参数压缩的识别信息,用于将第一计算机识别为存储有压缩参数的第二计算机; 以及使用从第一计算机接收的识别信息,由第二计算机加密压缩参数。 压缩参数至少包括出现频率和每个字符数据的分配符号。 该方法还包括:由第二计算机将加密的压缩参数发送到第一计算机; 由第一计算机使用识别信息解密从第二计算机接收的加密压缩参数; 并且由第一计算机基于解密的压缩参数解压缩压缩数据。

    Computer product, information retrieving apparatus, and information retrieval method
    9.
    发明授权
    Computer product, information retrieving apparatus, and information retrieval method 有权
    计算机产品,信息检索装置和信息检索方法

    公开(公告)号:US08595196B2

    公开(公告)日:2013-11-26

    申请号:US12622902

    申请日:2009-11-20

    IPC分类号: G06F17/00 G06F17/30 G06F7/00

    CPC分类号: G06F17/30911

    摘要: A recording medium stores therein an information retrieval program that causes a computer to execute generating a Huffman tree based on an XML tag written in an XML file and an appearance frequency of character data exclusive of the XML tag; compressing the XML file using the Huffman tree; receiving a retrieval condition that includes a retrieval keyword and type information concerning the retrieval keyword; setting a decompression start flag for a compression code that is for an XML start tag related to the type information, the decompression start flag instructing commencement of decompression of a compression code string subsequent to the XML start tag; detecting, in the compressed XML file, the compression code for which the decompression start flag has been set; and decompressing, when the compression code for which the decompression start flag has been set is detected, the compression code string, using the Huffman tree.

    摘要翻译: 记录媒体存储信息检索程序,该程序使计算机执行基于XML文件中写入的XML标签和排除XML标签的字符数据的出现频率来生成霍夫曼树; 使用霍夫曼树压缩XML文件; 接收包括检索关键字的检索条件和关于检索关键字的类型信息; 设置用于与类型信息相关的XML开始标签的压缩代码的解压开始标志,所述解压开始标志指示开始解压缩XML开始标签之后的压缩代码串; 在压缩XML文件中检测已经设置了解压开始标志的压缩码; 并且当检测到已经设置了解压开始标志的压缩码时,解压缩压缩码串,使用霍夫曼树。

    INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND COMPUTER PRODUCT
    10.
    发明申请
    INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND COMPUTER PRODUCT 有权
    信息处理设备,信息处理方法和计算机产品

    公开(公告)号:US20100085222A1

    公开(公告)日:2010-04-08

    申请号:US12543243

    申请日:2009-08-18

    IPC分类号: H03M7/40

    CPC分类号: H03M7/40

    摘要: A computer-readable recording medium stores therein an information processing program that causes a computer to execute storing an aggregate of layers of nodes respectively having a pointer to an upper node, pointers to a leaf and/or a lower node and branches to lower nodes; obtaining a totaling result of appearance frequencies of character codes described in a file; classifying the character codes by layer, based on appearance probabilities thereof and the totaling result; calculating, based on a quantity of character codes in an ith layer and for the ith layer, a quantity of pointers pointing to leaves, and based on the quantity calculated and for the ith layer, further calculating a number of times nodes are used and a quantity of pointers pointing to lower nodes; generating, based on calculation results, a Huffman tree; and converting the Huffman tree into a node-less Huffman tree and storing the node-less Huffman tree.

    摘要翻译: 计算机可读记录介质存储信息处理程序,使计算机执行存储分别具有指向上位节点的指针,指向叶子和/或下层节点的指针的节点的聚合层,并分支到下层节点; 获得文件中描述的字符代码的出现频率的总计结果; 根据其出现概率和总计结果逐层分类字符编码; 基于第i层和第i层中的字符代码的数量来计算指向叶的指针的数量,并且基于所计算的数量,并且对于第i层,进一步计算使用节点的次数和 指向下节点的指针数量; 根据计算结果生成霍夫曼树; 并将霍夫曼(Huffman)树转换成无节点霍夫曼(Huffman)树,并存储无节点霍夫曼(Huffman)树。