-
公开(公告)号:US20230103011A1
公开(公告)日:2023-03-30
申请号:US17485968
申请日:2021-09-27
Applicant: NETFLIX, INC.
Inventor: Puneet ZAROO , Eva TSE
IPC: G06F16/2453 , G06F16/21 , G06F16/22 , G06F11/34 , G06F11/14
Abstract: One embodiment of the present invention sets forth a technique for optimizing data in a dataset. The technique includes determining, based on one or more attributes of a dataset, an optimization that is associated with at least one of a file encoding, a file size, and a sort column. The technique also includes identifying a plurality of candidate configurations associated with the dataset and corresponding to the optimization, and for each candidate configuration, generating a corresponding set of evaluation metrics associated with the first optimization. The technique further includes determining, based on the sets of evaluation metrics corresponding to the plurality of candidate configurations, a set of configurations in the plurality of candidate configurations to be applied to the dataset. Finally, the technique includes modifying the dataset based on the set of configurations.