Predictive probabilistic deduplication of storage

Invention Grant

US09940337B2 Predictive probabilistic deduplication of storage 有权

Please log in to see more content

Patent Title: Predictive probabilistic deduplication of storage
Application No.: US14726597

Application Date: 2015-05-31
Publication No.: US09940337B2

Publication Date: 2018-04-10
Inventor: Wenguang Wang , Tian Luo
Applicant: VMware, Inc.
Applicant Address: US CA Palo Alto
Assignee: VMware, Inc.
Current Assignee: VMware, Inc.
Current Assignee Address: US CA Palo Alto
Main IPC: G06F17/30
IPC: G06F17/30 ; G06N7/00 ; G06F3/06 ; G06F11/14

Abstract:

Examples perform predictive probabilistic deduplication of storage, such as virtualized or physical disks. Incoming input/output (I/O) commands include data, which is written to storage and tracked in a key-value store. The key-value store includes a hash of the data as the key, and a reference counter and the address of the data as the value. When a certain percentage of sampled incoming data is found to be duplicate, it is predicted that the I/O commands have become not unique (e.g., duplicate). Based on the prediction, subsequent incoming data is not written to storage, and instead the reference counter associated with the hash of the data is incremented. In this manner, predictions on the uniqueness of future data is made based on previous data, and extraneous writes and deletions from the chunk store are avoided.

Public/Granted literature

US20160350324A1 PREDICTIVE PROBABILISTIC DEDUPLICATION OF STORAGE Public/Granted day:2016-12-01

Information query

Espacenet