-
公开(公告)号:US20190079947A1
公开(公告)日:2019-03-14
申请号:US15701994
申请日:2017-09-12
发明人: Ole Asmussen , Robert Beiderbeck , Erik Rueger , Markus Schäfer
CPC分类号: G06F16/1744 , G06F3/0608 , G06F3/0641 , G06F3/067 , G06F3/0682 , G06F11/1453 , G06F16/1752 , H03M7/3091 , H03M7/3095
摘要: A method and system for improving tape drive memory storage is provided. The method includes receiving, by a storage tape drive, a data stream for storage. The data stream is passed through a non-volatile memory device (NVS2) of the storage tape drive. The data stream is divided into adjacent variable length data chunks and a chunk list file including similarity identifiers for each of the adjacent variable length data chunks is generated and stored within a (non-volatile memory device) NVS1. Duplicate data including duplicated data with respect to a group of data chunks of the adjacent variable length data chunks is identified and deleted from the NVS2 of the storage tape drive such that the group of data chunks remains within NVS2. The group of data chunks is written to a data storage tape cartridge. Pointers identifying each data chunk and an associated storage position are generated and stored.
-
公开(公告)号:US09054730B2
公开(公告)日:2015-06-09
申请号:US14474680
申请日:2014-09-02
申请人: Compart AG
发明人: Jing Huang
CPC分类号: H03M7/3095 , H03M7/3088
摘要: A computer implemented method, computer program product, and system for decompressing compressed data. The system (400) has an interface component (420) configured to access and read a compressed data set (310) encoded with a LZW based compression algorithm. The compressed data set (310) includes a sequence of elements including byte characters (B) and symbols (S1/S2), each symbol corresponding to an encoded subset of the data set, and each symbol having a first portion (S1) including a previous symbol or a byte character and having a second portion (S2) including a byte character. The system further has a decoder (410) configured to generate a decompressed data set (320) by applying a LZW based decompression algorithm to the compressed data set (310) The decoder (410) makes use of a reverse order stack (440) configured for intermediate storage of byte characters decoded by the decoder (410) from the compressed data set (310). In one embodiment the system further has a dictionary which can store the full content values of symbols.
摘要翻译: 计算机实现的方法,计算机程序产品和用于解压缩压缩数据的系统。 系统(400)具有被配置为访问和读取用基于LZW的压缩算法编码的压缩数据集(310)的接口组件(420)。 压缩数据集(310)包括包括字节字符(B)和符号(S1 / S2)的元素序列,每个符号对应于数据集的编码子集,并且每个符号具有第一部分(S1),其包括 先前符号或字节字符,并具有包括字节字符的第二部分(S2)。 该系统还具有一个解码器(410),被配置为通过对压缩数据集(310)应用基于LZW的解压缩算法来生成解压缩数据集(320)。解码器(410)利用配置的反向顺序堆栈(440) 用于中间存储由解码器(410)从压缩数据集(310)解码的字节字符。 在一个实施例中,系统还具有可以存储符号的全部内容值的字典。
-
公开(公告)号:US20120281920A1
公开(公告)日:2012-11-08
申请号:US13384951
申请日:2010-01-29
IPC分类号: G06K9/18
CPC分类号: H03M7/3093 , G11B20/182 , G11B2220/80 , H03M7/3095 , H03M7/40 , H04N1/00045 , H04N1/32256
摘要: A parallel test payload includes a bit sequence configured to be segmented into a plurality of sub-sequences having variable bit length carriers. Respective carriers are represented uniformly in each one of the plurality of sub-sequences.
摘要翻译: 并行测试有效载荷包括被配置为被分段成具有可变位长度载波的多个子序列的比特序列。 各载波在多个子序列的每一个中均匀地表示。
-
公开(公告)号:US12008255B2
公开(公告)日:2024-06-11
申请号:US17538385
申请日:2021-11-30
申请人: PURE STORAGE, INC.
发明人: Ethan Miller , John Colgrove
CPC分类号: G06F3/0641 , G06F3/0608 , G06F3/0673 , G06F16/215 , G06F21/602 , H03M7/3095
摘要: Preparing data for deduplication including: generating, by a storage system for a compressed data block, a padded compressed data block by padding the compressed data block to conform to a fixed block size, wherein the fixed block size is greater than a size of the compressed data block; storing, in the storage system, the padded compressed data block beginning at a block boundary of a storage device in the storage system; and performing block-based deduplication on the storage system, wherein the block-based deduplication determines whether the padded compressed data block matches one or more other padded compressed data blocks stored in the storage system.
-
公开(公告)号:US09753666B2
公开(公告)日:2017-09-05
申请号:US14671929
申请日:2015-03-27
申请人: Intel Corporation
发明人: Vinodh Gopal , Kirk S. Yap , James D. Guilford , Jawad B. Khan
IPC分类号: G06F13/00 , G06F3/06 , H03M7/30 , G06F12/0875 , G06F12/0895
CPC分类号: G06F3/0638 , G06F3/061 , G06F3/0671 , G06F12/0875 , G06F12/0895 , G06F2212/401 , G06F2212/60 , H03M7/3086 , H03M7/3095 , H03M7/6058
摘要: Compression and decompression technology within a solid-state device (SSD) is disclosed that provides a good compression ratio while taking up less on-chip area. An input interface receives an input stream to be compressed. An output interface provides a compressed stream. A history buffer is of a fixed size that is a fraction of a size of a data buffer. Processing logic encodes into the compressed stream element types, literals and pointers, the latter which reference copies of data found elsewhere within the history buffer during compression. The history buffer may be multiple banks in width, where the data is loaded from the input stream sequentially across rows of the banks. The decompression side may be similarly designed, optionally with a different number of banks. The pointers may be a fixed two bytes including four bits for length and eleven bits for offset of back reference to a copy (or other combination).
-
公开(公告)号:US20150130646A1
公开(公告)日:2015-05-14
申请号:US14474680
申请日:2014-09-02
申请人: Compart AG
发明人: Jing HUANG
IPC分类号: H03M7/30
CPC分类号: H03M7/3095 , H03M7/3088
摘要: A computer implemented method, computer program product, and system for decompressing compressed data. The system (400) has an interface component (420) configured to access and read a compressed data set (310) encoded with a LZW based compression algorithm. The compressed data set (310) includes a sequence of elements including byte characters (B) and symbols (S1/S2), each symbol corresponding to an encoded subset of the data set, and each symbol having a first portion (S1) including a previous symbol or a byte character and having a second portion (S2) including a byte character. The system further has a decoder (410) configured to generate a decompressed data set (320) by applying a LZW based decompression algorithm to the compressed data set (310) The decoder (410) makes use of a reverse order stack (440) configured for intermediate storage of byte characters decoded by the decoder (410) from the compressed data set (310). In one embodiment the system further has a dictionary which can store the full content values of symbols.
摘要翻译: 计算机实现的方法,计算机程序产品和用于解压缩压缩数据的系统。 系统(400)具有被配置为访问和读取用基于LZW的压缩算法编码的压缩数据集(310)的接口组件(420)。 压缩数据集(310)包括包括字节字符(B)和符号(S1 / S2)的元素序列,每个符号对应于数据集的编码子集,并且每个符号具有第一部分(S1),其包括 先前符号或字节字符,并具有包括字节字符的第二部分(S2)。 该系统还具有一个解码器(410),被配置为通过对压缩数据集(310)应用基于LZW的解压缩算法来生成解压缩数据集(320)。解码器(410)利用配置的反向顺序堆栈(440) 用于中间存储由解码器(410)从压缩数据集(310)解码的字节字符。 在一个实施例中,系统还具有可以存储符号的全部内容值的字典。
-
公开(公告)号:US20140172795A1
公开(公告)日:2014-06-19
申请号:US14186226
申请日:2014-02-21
发明人: Xindong TENG
CPC分类号: G06F17/30575 , H03M7/30 , H03M7/3091 , H03M7/3095 , H04L69/04
摘要: Embodiments of the present invention provide a data processing method and a data processing device. If to-be-compressed data includes a data fragment that is the same as a first half of a variable block in a duplicate database and is different from a second half of the variable block, a new variable block with granularity smaller than that of the matching variable block can be generated, and the new variable block is added to the duplicate database. The granularity of the new variable block is smaller, which increases the probability of subsequent to-be-compressed data matching the updated duplicate database, thereby improving compression efficiency.
摘要翻译: 本发明的实施例提供一种数据处理方法和数据处理装置。 如果要被压缩的数据包括与重复数据库中的可变块的前半部分相同的数据片段,并且与可变块的后半部分不同,则新的可变块的粒度小于 可以生成匹配变量块,并将新的变量块添加到重复数据库。 新可变块的粒度较小,这增加了后续的被压缩数据与更新的重复数据库匹配的概率,从而提高了压缩效率。
-
公开(公告)号:US20240319901A1
公开(公告)日:2024-09-26
申请号:US18732515
申请日:2024-06-03
申请人: PURE STORAGE, INC.
发明人: ETHAN MILLER , JOHN COLGROVE
IPC分类号: G06F3/06 , G06F16/215 , G06F21/60 , H03M7/30
CPC分类号: G06F3/0641 , G06F3/0608 , G06F3/0673 , G06F16/215 , G06F21/602 , H03M7/3095
摘要: Preparing data for deduplication including: generating, by a storage system for a compressed data block, a padded compressed data block by padding the compressed data block to conform to a fixed block size, wherein the fixed block size is greater than a size of the compressed data block; storing, in the storage system, the padded compressed data block beginning at a block boundary of a storage device in the storage system; and performing block-based deduplication on the storage system, wherein the block-based deduplication determines whether the padded compressed data block matches one or more other padded compressed data blocks stored in the storage system.
-
公开(公告)号:US20170155402A1
公开(公告)日:2017-06-01
申请号:US15321175
申请日:2015-06-26
发明人: Tuomas KARKKAINEN , Ossi KALEVO
IPC分类号: H03M7/30
CPC分类号: H03M7/3095 , H03M7/3084 , H03M7/3091 , H03M7/6005 , H03M7/6011 , H03M7/6058
摘要: An encoder for encoding input data (D1) to generate corresponding encoded data (E2) includes data processing hardware which is operable: (a) to determine at least partial reoccurrences of data blocks or data packets within the input data (D1), wherein the data blocks or data packets include a plurality of bytes; (b) to employ at least one reference symbol to relate reoccurrences of mutually similar data blocks or data packets and/or to indicate whether or not there are reoccurrences of mutually similar data blocks or data packets within the input data (D1); (c) to employ a plurality of change symbols, for example a plurality of mask bits, to indicate changed and unchanged data elements of partial reoccurrences of data blocks or data packets within the input data (D1) and a change of data values of changed data elements; and (d) to encode the at least one reference symbol and the plurality of change symbols into the encoded data (E2). There are provided methods of using the encoder to encode input data (D1) to generate the corresponding encoded data (E2). Moreover, there are provided a corresponding decoder, and a corresponding method of decoding the encoded data (E2) to generate corresponding decoded data (D3).
-
公开(公告)号:US20170026054A1
公开(公告)日:2017-01-26
申请号:US15160105
申请日:2016-05-20
申请人: FUJITSU LIMITED
发明人: Jun-ichi Nagata
IPC分类号: H03M7/30
CPC分类号: H03M7/30 , G06F17/40 , H03M7/3059 , H03M7/3088 , H03M7/3095 , H03M7/50 , H03M7/55
摘要: A data compression device including a processor to perform a procedure comprising: obtaining data of a predetermined number (Z) of digits in a time series; and performing a compression process on the data. The data is obtained by encoding a vibration state of a measurement target. The compression process includes: deleting upper digits when the upper digits do not include significant information; and adding a unique code to a top of the upper digits when the upper digits include significant information. A digit number (X) of the upper digits is smaller than the predetermined number (Z).
摘要翻译: 一种数据压缩装置,包括处理器,用于执行程序,包括:以时间序列获得预定数量(Z)的数字的数据; 并对数据执行压缩处理。 通过编码测量对象的振动状态来获得数据。 压缩过程包括:当高位数字不包含重要信息时,删除高位数字; 并且当高位数字包含重要信息时,将唯一的代码添加到高位数字的顶部。 高位数字的数字(X)小于预定数(Z)。
-
-
-
-
-
-
-
-
-