OPTIMIZING STORAGE-RELATED COSTS WITH COMPRESSION IN A MULTI-TIERED STORAGE DEVICE

    公开(公告)号:US20230418468A1

    公开(公告)日:2023-12-28

    申请号:US17851443

    申请日:2022-06-28

    Applicant: Adobe Inc.

    CPC classification number: G06F3/0604 G06F3/0644 G06F3/067

    Abstract: Some techniques described herein relate to determining how to optimally store datasets in a multi-tiered storage device with compression. In one example, a method includes assigning, to a data partition of a dataset, a priority based on access patterns of the data partition. Compression data is accessed describing results of compressing a data sample associated with the data partition using multiple compression schemes. Based both on the priority of the data partition and the compression data, a storage tier is determined for storing the data partition in the multi-tiered storage device. Further, based both on the priority of the data partition and the compression data, a compression scheme is determined for compressing the data partition for storage in the multi-tiered storage device. The data partition is compressed using the compression scheme to produce a compressed data partition, and the compressed data partition is stored in the storage tier.

    Optimizing storage-related costs with compression in a multi-tiered storage device

    公开(公告)号:US11907531B2

    公开(公告)日:2024-02-20

    申请号:US17851443

    申请日:2022-06-28

    Applicant: Adobe Inc.

    CPC classification number: G06F3/0604 G06F3/067 G06F3/0644

    Abstract: Some techniques described herein relate to determining how to optimally store datasets in a multi-tiered storage device with compression. In one example, a method includes assigning, to a data partition of a dataset, a priority based on access patterns of the data partition. Compression data is accessed describing results of compressing a data sample associated with the data partition using multiple compression schemes. Based both on the priority of the data partition and the compression data, a storage tier is determined for storing the data partition in the multi-tiered storage device. Further, based both on the priority of the data partition and the compression data, a compression scheme is determined for compressing the data partition for storage in the multi-tiered storage device. The data partition is compressed using the compression scheme to produce a compressed data partition, and the compressed data partition is stored in the storage tier.

    RELATING DATA IN DATA LAKES
    4.
    发明申请

    公开(公告)号:US20240386002A1

    公开(公告)日:2024-11-21

    申请号:US18319748

    申请日:2023-05-18

    Applicant: Adobe Inc.

    Abstract: A dataset comprising tables is received. Embeddings are generated for column titles of a table. Based on the embeddings, similar tables are clustered. The tables are organized into smaller clusters based on statistical similarities. Similarity scores are calculated for tables within the same cluster. A relatedness graph is created based on the similarity scores; similar tables are represented by nodes connected by edges. If the similarity score for a pair of tables exceeds a threshold, a table is deleted.

    COMPUTING RESOURCE ALLOCATION MECHANISM TESTING AND DEPLOYMENT

    公开(公告)号:US20240303176A1

    公开(公告)日:2024-09-12

    申请号:US18178715

    申请日:2023-03-06

    Applicant: Adobe Inc.

    CPC classification number: G06F11/3442 G06F9/5077

    Abstract: A computing resource allocation system receives entity resource usage data describing computing resource usage of an executable service platform by an entity as part of a first allocation generated using a first allocation mechanism. A computing resource allocation system generates an entity resource model based on the entity resource usage data of the computing resource usage of the executable service platform as part of the first allocation mechanism. A computing resource allocation system simulates computing resource usage of the executable service platform by the entity as part of a second allocation mechanism based on the entity resource model and the entity resource usage data. A computing resource allocation system estimates a second allocation to provide to the entity based on the simulating.

Patent Agency Ranking