Invention Grant
- Patent Title: Deduplicating patterned data in a storage system
-
Application No.: US15861279Application Date: 2018-01-03
-
Publication No.: US10783131B1Publication Date: 2020-09-22
- Inventor: Marco Sanvido , Richard Hankins , John Hayes , Steve Hodgson , Feng Wang , Sergey Zhuravlev , Andrew Kleinerman
- Applicant: Pure Storage, Inc.
- Applicant Address: US CA Mountain View
- Assignee: Pure Storage, Inc.
- Current Assignee: Pure Storage, Inc.
- Current Assignee Address: US CA Mountain View
- Main IPC: G06F16/22
- IPC: G06F16/22 ; G06F3/06

Abstract:
A system and method for efficiently storing data in a storage system. A data storage subsystem includes multiple data storage locations on multiple storage devices in addition to at least one mapping table. A data storage controller determines whether data to store in the storage subsystem has one or more patterns of data intermingled with non-pattern data within an allocated block. Rather than store the one or more pattern on the storage devices, the controller stores information in a header on the storage devices. The information includes at least an offset for the first instance of a pattern, a pattern length, and an identification of the pattern. The data may be reconstructed for a corresponding read request from the information stored in the header.
Information query