Invention Grant
- Patent Title: Predictive probabilistic deduplication of storage
-
Application No.: US14726597Application Date: 2015-05-31
-
Publication No.: US09940337B2Publication Date: 2018-04-10
- Inventor: Wenguang Wang , Tian Luo
- Applicant: VMware, Inc.
- Applicant Address: US CA Palo Alto
- Assignee: VMware, Inc.
- Current Assignee: VMware, Inc.
- Current Assignee Address: US CA Palo Alto
- Main IPC: G06F17/30
- IPC: G06F17/30 ; G06N7/00 ; G06F3/06 ; G06F11/14

Abstract:
Examples perform predictive probabilistic deduplication of storage, such as virtualized or physical disks. Incoming input/output (I/O) commands include data, which is written to storage and tracked in a key-value store. The key-value store includes a hash of the data as the key, and a reference counter and the address of the data as the value. When a certain percentage of sampled incoming data is found to be duplicate, it is predicted that the I/O commands have become not unique (e.g., duplicate). Based on the prediction, subsequent incoming data is not written to storage, and instead the reference counter associated with the hash of the data is incremented. In this manner, predictions on the uniqueness of future data is made based on previous data, and extraneous writes and deletions from the chunk store are avoided.
Public/Granted literature
- US20160350324A1 PREDICTIVE PROBABILISTIC DEDUPLICATION OF STORAGE Public/Granted day:2016-12-01
Information query