-
1.
公开(公告)号:US20240361937A1
公开(公告)日:2024-10-31
申请号:US18309463
申请日:2023-04-28
Applicant: DELL PRODUCTS L.P.
Inventor: Smriti Thakkar , Kao-Feng Hsieh , Zachary Tom , Tony Wong
IPC: G06F3/06
CPC classification number: G06F3/0641 , G06F3/0619 , G06F3/0689
Abstract: A system and method for an estimation of error bounds for file size calculations using MinHash in deduplication systems. The system includes one or more processors to determine a similarity score between the first file and the second file. The one or more processors are further to determine a size estimation of a combination of the first and second files based on the similarity score. Finally, the one or more processors are to determine a maximum error for the size estimation of the combination of the first and second files, wherein the first and second file are to be combined via deduplication and have at least one shared data segment.
-
公开(公告)号:US20240361936A1
公开(公告)日:2024-10-31
申请号:US18308035
申请日:2023-04-27
Applicant: Dell Products L.P.
Inventor: Charles William Kaufman , Naizhong Chiu , Jamie Pocas , Saoni Mukherjee , Radia Joy Perlman , James McCoy , Ashok Tamilarasan
IPC: G06F3/06
CPC classification number: G06F3/0641 , G06F3/0623 , G06F3/0665 , G06F3/0679
Abstract: A method for operating a storage array includes: generating a per-chunk key and a deduplication key for a data chunk based on a hash value, wherein the per-chunk key is stored in a metadata file; obtaining a per-volume key for a volume of the storage array; encrypting a metadata folder associated with the volume to generate an encrypted metadata folder using the per-volume key, in which the encrypted metadata folder comprises at least the metadata file; compressing the data chunk to generate a compressed data chunk (CDC); encrypting the CDC to generate an encrypted compressed data chunk (ECDC) using the per-chunk key, in which the ECDC is stored in the volume; obtaining storage location information of the ECDC; updating a hash value lookup table based on the storage location information; and initiating notification of a user of a client about completion of a write request.
-
公开(公告)号:US12105972B2
公开(公告)日:2024-10-01
申请号:US18119162
申请日:2023-03-08
Applicant: Hitachi, Ltd.
Inventor: Ryosuke Tatsumi , Tomohiro Yoshihara , Takaki Matsushita , Yusuke Yamaga , Naoyuki Masuda
IPC: G06F3/06
CPC classification number: G06F3/0638 , G06F3/0611 , G06F3/0641 , G06F3/0679
Abstract: A snapshot virtual device (SS-VDEV) is prepared for each snapshot family (SS-Family) and a deduplication virtual device is prepared apart from the SS-VDEV. When the same data is in a plurality of VOLs of the SS-Family, a storage system maps a plurality of addresses of the same data among the plurality of VOLs to address of the SS-VDEVs of the SS-Family. When duplicated data is in two or more SS-VDEVs, the storage system maps two or more addresses of the duplicated data of the two or more SS-VDEVs to addresses corresponding to the duplicated data among the deduplication virtual devices.
-
公开(公告)号:US12105636B2
公开(公告)日:2024-10-01
申请号:US17715459
申请日:2022-04-07
Applicant: Dell Products L.P.
Inventor: Ashok Tamilarasan , Philippe Armangau , Vamsi K. Vankamamidi
IPC: G06F13/28 , G06F3/06 , G06F12/0871 , G06F12/0888 , G06F12/1045
CPC classification number: G06F12/1054 , G06F3/0611 , G06F3/0641 , G06F3/0683 , G06F12/0871 , G06F12/0888 , G06F13/28 , G06F2212/604
Abstract: A data storage system can include a deduplicated data cache used to store unique deduplicated data portions. Data portions can be promoted to the deduplicated data cache in connection with servicing I/O operations. Servicing the I/O operation that reads data from, or writes data to, a logical address can include determining whether a data portion stored at the logical address meets criteria for promoting the data portion to a deduplicated data cache. The criteria can include a condition that the data portion is a duplicate of content stored at multiple logical addresses, and can include a condition that the data portion has a reference count that is at least a minimum threshold where the reference count denotes a number of logical addresses at which the data portion is stored. Responsive to determining the data portion meets the criteria, the data portion can be stored in the deduplicated data cache.
-
公开(公告)号:US12093550B2
公开(公告)日:2024-09-17
申请号:US17386873
申请日:2021-07-28
Applicant: EMC IP Holding Company LLC
Inventor: John H. Lee , Michael C. Brundage , Nagapraveen V. Seela , Alan L. Taylor , William C. Davenport
CPC classification number: G06F3/0641 , G06F3/061 , G06F3/0631 , G06F3/0659 , G06F3/0683 , G06F16/137 , G06F16/2246
Abstract: A method performed by a data storage apparatus of maintaining filesystem object attributes in connection with a plurality of drivers is provided. Each driver of the plurality is configured to operate on a respective set of filesystem objects. The method includes (a) for each driver of the plurality, creating a respective driver-specific storage volume; (b) storing, within each driver-specific storage volume, attributes specific to respective filesystem objects operated on by the driver of that driver-specific storage volume; and (c) while operating a particular driver of the plurality on a particular filesystem object, using values of the stored attributes specific to that particular filesystem object operated on by that particular driver. An apparatus, system, and computer program product for performing a similar method are also provided.
-
6.
公开(公告)号:US12073102B2
公开(公告)日:2024-08-27
申请号:US18340853
申请日:2023-06-24
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Kun Guan , Shaohui Quan , Liyu Wang , Jianqiang Shen
CPC classification number: G06F3/0641 , G06F3/0608 , G06F3/0673 , H03M7/30 , G06F3/0674 , G06F3/0676
Abstract: In a method for compressing data, a storage system determines whether deduplication can be performed on a target data block. When deduplication cannot be performed on the target data block, the storage system obtaining a similar fingerprint of the target data block, and determines, based on the similar fingerprint, a combined data block group to which the target data block belongs. The storage system then performs similar compression on the target data block based on a reference block in the combined data block group.
-
公开(公告)号:US20240256490A1
公开(公告)日:2024-08-01
申请号:US18432489
申请日:2024-02-05
Applicant: Commvault Systems, Inc.
Inventor: Anand PRAHLAD , Marcus Muller , Rajiv KOTTOMTHARAYIL , Srinivas KAVURI , Parag GOKHALE , Manoj Kumar VIJAYAN
IPC: G06F16/174 , G06F3/06 , G06F11/34 , G06F16/11 , G06F16/182 , G06F16/41 , G06Q30/02 , G06Q30/0201 , G06Q50/18 , H04L9/40 , H04L67/02 , H04L67/06 , H04L67/1095 , H04L67/1097 , H04L67/50 , H04L67/56 , H04L67/5682 , H04L69/08
CPC classification number: G06F16/1748 , G06F3/0605 , G06F3/061 , G06F3/0626 , G06F3/0631 , G06F3/0641 , G06F3/0649 , G06F3/0667 , G06F3/067 , G06F16/122 , G06F16/1827 , G06F16/1844 , G06F16/41 , G06Q30/02 , G06Q30/0206 , G06Q50/188 , H04L63/0428 , H04L67/1095 , H04L67/1097 , H04L67/56 , H04L67/5682 , G06F3/06 , G06F11/3485 , H04L67/02 , H04L67/06 , H04L67/535 , H04L69/08
Abstract: Data storage operations, including content-indexing, containerized deduplication, and policy-driven storage, are performed within a cloud environment. The systems support a variety of clients and cloud storage sites that may connect to the system in a cloud environment that requires data transfer over wide area networks, such as the Internet, which may have appreciable latency and/or packet loss, using various network protocols, including HTTP and FTP. Methods are disclosed for content indexing data stored within a cloud environment to facilitate later searching, including collaborative searching. Methods are also disclosed for performing containerized deduplication to reduce the strain on a system namespace, effectuate cost savings, etc. Methods are disclosed for identifying suitable storage locations, including suitable cloud storage sites, for data files subject to a storage policy. Further, systems and methods for providing a cloud gateway and a scalable data object store within a cloud environment are disclosed, along with other features.
-
公开(公告)号:US20240241761A1
公开(公告)日:2024-07-18
申请号:US18618901
申请日:2024-03-27
Applicant: Intel Corporation
Inventor: Mohan J. Kumar , Murugasamy K. Nachimuthu , Krishna Bhuyan
IPC: G06F9/50 , G06F3/06 , G06F7/06 , G06F8/65 , G06F8/654 , G06F8/656 , G06F8/658 , G06F9/38 , G06F9/4401 , G06F9/455 , G06F9/48 , G06F9/54 , G06F11/07 , G06F11/30 , G06F11/34 , G06F12/02 , G06F12/06 , G06F13/16 , G06F16/174 , G06F21/57 , G06F21/62 , G06F21/73 , G06F21/76 , G06T1/20 , G06T1/60 , G06T9/00 , H01R13/453 , H01R13/631 , H03K19/173 , H03M7/30 , H03M7/40 , H03M7/42 , H04L9/08 , H04L12/28 , H04L12/46 , H04L41/044 , H04L41/0816 , H04L41/0853 , H04L41/12 , H04L43/04 , H04L43/06 , H04L43/08 , H04L43/0894 , H04L47/20 , H04L47/2441 , H04L49/104 , H04L61/5007 , H04L67/10 , H04L67/1014 , H04L67/63 , H04L67/75 , H05K7/14 , G06F11/14 , G06F15/80 , G06F16/28 , H04L9/40 , H04L41/046 , H04L41/0896 , H04L41/142 , H04L47/78 , H04Q11/00
CPC classification number: G06F9/505 , G06F3/0604 , G06F3/0608 , G06F3/0611 , G06F3/0613 , G06F3/0617 , G06F3/0641 , G06F3/0647 , G06F3/065 , G06F3/0653 , G06F3/067 , G06F7/06 , G06F8/65 , G06F8/654 , G06F8/656 , G06F8/658 , G06F9/3851 , G06F9/3891 , G06F9/4401 , G06F9/45533 , G06F9/4843 , G06F9/4881 , G06F9/5005 , G06F9/5038 , G06F9/5044 , G06F9/5083 , G06F9/544 , G06F11/0709 , G06F11/0751 , G06F11/079 , G06F11/3006 , G06F11/3034 , G06F11/3055 , G06F11/3079 , G06F11/3409 , G06F12/0284 , G06F12/0692 , G06F13/1652 , G06F16/1744 , G06F21/57 , G06F21/6218 , G06F21/73 , G06F21/76 , G06T1/20 , G06T1/60 , G06T9/005 , H01R13/453 , H01R13/4536 , H01R13/4538 , H01R13/631 , H03K19/1731 , H03M7/3084 , H03M7/40 , H03M7/42 , H03M7/60 , H03M7/6011 , H03M7/6017 , H03M7/6029 , H04L9/0822 , H04L12/2881 , H04L12/4633 , H04L41/044 , H04L41/0816 , H04L41/0853 , H04L41/12 , H04L43/04 , H04L43/06 , H04L43/08 , H04L43/0894 , H04L47/20 , H04L47/2441 , H04L49/104 , H04L61/5007 , H04L67/10 , H04L67/1014 , H04L67/63 , H04L67/75 , H05K7/1452 , H05K7/1487 , H05K7/1491 , G06F11/1453 , G06F12/023 , G06F15/80 , G06F16/285 , G06F2212/401 , G06F2212/402 , G06F2221/2107 , H04L41/046 , H04L41/0896 , H04L41/142 , H04L47/78 , H04L63/1425 , H04Q11/0005 , H05K7/1447 , H05K7/1492
Abstract: Technologies for composing a managed node with multiple processors on multiple compute sleds to cooperatively execute a workload include a memory, one or more processors connected to the memory, and an accelerator. The accelerator further includes a coherence logic unit that is configured to receive a node configuration request to execute a workload. The node configuration request identifies the compute sled and a second compute sled to be included in a managed node. The coherence logic unit is further configured to modify a portion of local working data associated with the workload on the compute sled in the memory with the one or more processors of the compute sled, determine coherence data indicative of the modification made by the one or more processors of the compute sled to the local working data in the memory, and send the coherence data to the second compute sled of the managed node.
-
公开(公告)号:US20240231622A9
公开(公告)日:2024-07-11
申请号:US17972039
申请日:2022-10-24
Applicant: Dell Products L.P.
Inventor: Alexander Shknevsky , Uri Shabi , Aleksey Kabishcher
IPC: G06F3/06 , G06F16/215
CPC classification number: G06F3/061 , G06F3/0641 , G06F3/068 , G06F16/215
Abstract: A technique for managing deduplication using a non-cryptographic hash function includes obtaining metadata associated with both a candidate block presented for deduplication and a target block having a hash-based match to the candidate block. The technique further includes checking for a mismatch between the candidate block and the target block based on the obtained metadata. In response to the checking determining a mismatch, the technique further includes abandoning deduplication of the candidate block, such that the candidate block is stored independently of the target block.
-
10.
公开(公告)号:US20240193045A1
公开(公告)日:2024-06-13
申请号:US18444536
申请日:2024-02-16
Applicant: EMC IP Holding Company LLC
Inventor: Jagannathdas Rath , Kalyan C. Gunda
IPC: G06F11/14 , G06F3/06 , G06F16/174
CPC classification number: G06F11/1453 , G06F3/0619 , G06F3/0641 , G06F3/0656 , G06F3/067 , G06F11/1435 , G06F11/1464 , G06F11/1469 , G06F16/174
Abstract: One example method includes identifying a cloud object as a potential candidate for defragmentation, evaluating the cloud object to determine what portion of segments of the cloud object are expired, when the portion of expired segments meets or exceeds a threshold, segregating the expired segments and unexpired segments of the cloud object, creating a first new cloud object that includes only unexpired segments, creating a second new cloud object that includes only expired segments, and deleting the cloud object from storage.
-
-
-
-
-
-
-
-
-