METHOD AND APPARATUS OF DATA PROCESSING, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20240411478A1

    公开(公告)日:2024-12-12

    申请号:US18734229

    申请日:2024-06-05

    Abstract: Embodiments of the disclosure provide a method and an apparatus of data processing, an electronic device and a storage medium. The method includes receiving first access data transmitted by at least one client, the first access data representing an instruction log of a remote direct data read instruction transmitted by the client for target data cached in a non-uniform memory access structure; obtaining a data popularity of the target data based on the first access data, the data popularity representing a frequency of the target data accessed by the remote direct data read instruction; and based on the data popularity of the target data, caching the target data to a target location in a data storage unit implemented based on the non-uniform memory access structure, or migrating the target data out of the data storage unit, wherein the target location has a data read-write speed corresponding to the data popularity.

    Method, apparatus and device for data shuffling, computer-readable storage medium and product

    公开(公告)号:US12229118B2

    公开(公告)日:2025-02-18

    申请号:US18742351

    申请日:2024-06-13

    Abstract: The embodiments of the disclosure provide a dada shuffling method, apparatus and device, a computer-readable storage medium and product. The method comprises: acquiring a data shuffling request; acquiring a shuffling request parameter linked list associated with the at least one data to be shuffled based on the data shuffling request; performing a merging operation on shuffling request parameters in the shuffling request parameter linked list according to the data amount of the data segment corresponding to the shuffling request parameter and memory buffer information to obtain at least one target request parameter; and caching the data to be shuffled corresponding to the at least one target request parameter to a predetermined remote direct memory access network card; and distributing respectively data segments associated with at least one data to be shuffled cached in the remote direct memory access network card to a target server of the data segment.

    Hash engine for conducting point queries

    公开(公告)号:US12235817B2

    公开(公告)日:2025-02-25

    申请号:US18475695

    申请日:2023-09-27

    Abstract: Systems and methods are provided for improved point querying of a database. The index values are separated from data and retained in cache memory to allow access without requiring a disk input/output (I/O) operation and thereby having less latency resulting from such disk I/O operations. The index values can be compressed using an algorithm such as Crit-Bit-Trie to allow storage of the index values in limited cache memory space. The index values can be selected for storage according to a least recently used approach when cache memory is insufficient to store all index values to maintain a hit rate for the cached portion and reduce the disk I/O operations.

    Data caching based on data popularity

    公开(公告)号:US12299320B2

    公开(公告)日:2025-05-13

    申请号:US18734229

    申请日:2024-06-05

    Abstract: A method, apparatus, electronic device and storage medium for data caching based on data popularity is provided. In the method, first access data transmitted by at least one client is received. The first access data represents an instruction log of a remote direct data read instruction transmitted by the client for target data cached in a non-uniform memory access structure. A data popularity of the target data is obtained based on the first access data. The data popularity represents a frequency of the target data accessed by the remote direct data read instruction. Based on the data popularity of the target data, the target data is cached to a target location in a data storage unit implemented based on the non-uniform memory access structure. Alternatively, the target data out of the data storage unit is migrated. The target location has a data read-write speed corresponding to the data popularity.

Patent Agency Ranking