Invention Publication
- Patent Title: DATA-AWARE STORAGE TIERING AND LIFETIME DATA VALUATION FOR DEEP LEARNING
-
Application No.: US17971410Application Date: 2022-10-20
-
Publication No.: US20240135162A1Publication Date: 2024-04-25
- Inventor: CONG XU , SUPARNA BHATTACHARYA , RYAN BEETHE , MARTIN FOLTIN
- Applicant: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP
- Applicant Address: US TX Houston
- Assignee: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP
- Current Assignee: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP
- Current Assignee Address: US TX Houston
- Main IPC: G06N3/08
- IPC: G06N3/08

Abstract:
Systems and methods are configured to provide lifetime data valuations for a dataset that evolves across multiple machine learning training tasks by providing and updating path-dependent data valuations for data points in the dataset during each training task. A current machine learning training task may include splitting the dataset into multiple random mini-epochs and training the current machine learning model using a first random mini-epoch and an accuracy mini-epoch, which consists of high value data points from the path-dependent data valuations. The random and accuracy mini-epochs can be, during the training, iterated for a number of times during the training, while a second random mini-epoch is prefetch. During the training, the path-dependent data valuations can be updated based on data valuations during the current training and a similarity between the current machine learning model and prior trained machine learning models.
Public/Granted literature
- US20240232607A9 DATA-AWARE STORAGE TIERING AND LIFETIME DATA VALUATION FOR DEEP LEARNING Public/Granted day:2024-07-11
Information query