METHOD FOR PERFORMING RANDOM READ ACCESS TO A BLOCK OF DATA USING PARALLEL LUT READ INSTRUCTION IN VECTOR PROCESSORS

    公开(公告)号:US20240345842A1

    公开(公告)日:2024-10-17

    申请号:US18754455

    申请日:2024-06-26

    IPC分类号: G06F9/38 G06F9/30

    摘要: This disclosure is directed to the problem of paralleling random read access within a reasonably sized block of data for a vector SIMD processor. The invention sets up plural parallel look up tables, moves data from main memory to each plural parallel look up table and then employs a look up table read instruction to simultaneously move data from each parallel look up table to a corresponding part a vector destination register. This enables data processing by vector single instruction multiple data (SIMD) operations. This vector destination register load can be repeated if the tables store more used data. New data can be loaded into the original tables if appropriate. A level one memory is preferably partitioned as part data cache and part directly addressable memory. The look up table memory is stored in the directly addressable memory.

    Remote descriptor to enable remote direct memory access (RDMA) transport of a serialized object

    公开(公告)号:US12093208B2

    公开(公告)日:2024-09-17

    申请号:US17862222

    申请日:2022-07-11

    摘要: Technologies for enabling remote direct memory access (RDMA) transport of serialized objects in streaming pipelines are described. One method of a first computing device that stores a serialized object in a first memory can generate a remote descriptor associated with the serialized object. The remote descriptor uniquely identifies the location of the serialized object and a reference count token. The first computing device sends the remote descriptor to a second computing device in the data center over a network fabric. The second computing device uses the remote descriptor to obtain the contiguous block from the first memory for storage at a second memory associated with the second computing device. The value of the reference count token can be updated by receiving a message from the second computing device, and the remote descriptor can be released responsive to the value of the reference count token satisfying a threshold value.

    FRESHNESS AND GRAVITY OF DATA OPERATORS EXECUTING IN NEAR MEMORY COMPUTE IN SCALABLE DISAGGREGATED MEMORY ARCHITECTURES

    公开(公告)号:US20240303078A1

    公开(公告)日:2024-09-12

    申请号:US18181307

    申请日:2023-03-09

    IPC分类号: G06F9/30

    CPC分类号: G06F9/3004 G06F9/5016

    摘要: The disclosure provides for systems and methods for improving bandwidth and latency associated with executing data requests in disaggregated memory by leveraging usage indicators (also referred to as usage value), such as “freshness” of data operators and processing “gravity” of near memory compute functions. Examples of the systems and methods disclosed herein generate data operators comprising near memory compute functions offloaded proximate to disaggregated memory nodes, assign a usage value to each data operator based on at least one of: (i) a freshness indicator for each data operators, and (ii) a gravity indicator for each near memory compute function; and allocate data operations to the data operators based on the usage value.