METHOD FOR PERFORMING RANDOM READ ACCESS TO A BLOCK OF DATA USING PARALLEL LUT READ INSTRUCTION IN VECTOR PROCESSORS

    公开(公告)号:US20240345842A1

    公开(公告)日:2024-10-17

    申请号:US18754455

    申请日:2024-06-26

    CPC classification number: G06F9/383 G06F9/30036 G06F9/3004 G06F9/30043

    Abstract: This disclosure is directed to the problem of paralleling random read access within a reasonably sized block of data for a vector SIMD processor. The invention sets up plural parallel look up tables, moves data from main memory to each plural parallel look up table and then employs a look up table read instruction to simultaneously move data from each parallel look up table to a corresponding part a vector destination register. This enables data processing by vector single instruction multiple data (SIMD) operations. This vector destination register load can be repeated if the tables store more used data. New data can be loaded into the original tables if appropriate. A level one memory is preferably partitioned as part data cache and part directly addressable memory. The look up table memory is stored in the directly addressable memory.

    METHOD FOR PERFORMING RANDOM READ ACCESS TO A BLOCK OF DATA USING PARALLEL LUT READ INSTRUCTION IN VECTOR PROCESSORS

    公开(公告)号:US20230297383A1

    公开(公告)日:2023-09-21

    申请号:US18321037

    申请日:2023-05-22

    CPC classification number: G06F9/383 G06F9/30036 G06F9/3004 G06F9/30043

    Abstract: This disclosure is directed to the problem of paralleling random read access within a reasonably sized block of data for a vector SIMD processor. The invention sets up plural parallel look up tables, moves data from main memory to each plural parallel look up table and then employs a look up table read instruction to simultaneously move data from each parallel look up table to a corresponding part a vector destination register. This enables data processing by vector single instruction multiple data (SIMD) operations. This vector destination register load can be repeated if the tables store more used data. New data can be loaded into the original tables if appropriate. A level one memory is preferably partitioned as part data cache and part directly addressable memory. The look up table memory is stored in the directly addressable memory.

    METHOD FOR EFFICIENT MEDIAN FILTERING
    28.
    发明申请
    METHOD FOR EFFICIENT MEDIAN FILTERING 有权
    有效的中介过滤方法

    公开(公告)号:US20160232641A1

    公开(公告)日:2016-08-11

    申请号:US15040424

    申请日:2016-02-10

    Abstract: A method is disclosed for efficiently calculating a median value of a high-order array in a Single Instruction Multiple Data (SIMD) processor. Values of the high-order array are sorted vertically in each column followed by sorts on each individual row. After the sort, selective diagonal values of the sorted high-order array are used to form a low-order array to calculate the median of the high-order array. The median calculation using selective diagonal values of the high-order array in a low-order array significantly improves SIMD processor efficiency and throughput.

    Abstract translation: 公开了一种用于在单指令多数据(SIMD)处理器中有效计算高阶阵列的中值的方法。 高阶数组的值在每列中垂直排列,后跟排列在每个单独的行上。 排序后,排序高阶数组的选择性对角线值用于形成低阶数组,以计算高阶数组的中位数。 使用低阶阵列中高阶阵列的选择性对角线值的中值计算显着提高了SIMD处理器的效率和吞吐量。

    DYNAMIC FRAME PADDING IN A VIDEO HARDWARE ENGINE
    30.
    发明申请
    DYNAMIC FRAME PADDING IN A VIDEO HARDWARE ENGINE 审中-公开
    视频硬件引擎中的动态框架

    公开(公告)号:US20150271512A1

    公开(公告)日:2015-09-24

    申请号:US14661770

    申请日:2015-03-18

    CPC classification number: H04N19/43 H04N19/51

    Abstract: A video hardware engine which support dynamic frame padding is disclosed. The video hardware engine includes an external memory. The external memory stores a reference frame. The reference frame includes a plurality of reference pixels. A motion estimation (ME) engine receives a current LCU (largest coding unit), and defines a search area around the current LCU for motion estimation. The ME engine receives a set of reference pixels corresponding to the current LCU. The set of reference pixels of the plurality of reference pixels are received from the external memory. The ME engine pads a set of duplicate pixels along an edge of the reference frame when a part area of the search area is outside the reference frame.

    Abstract translation: 公开了支持动态帧填充的视频硬件引擎。 视频硬件引擎包括外部存储器。 外部存储器存储参考帧。 参考帧包括多个参考像素。 运动估计(ME)引擎接收当前LCU(最大编码单元),并且定义当前LCU周围的运动估计周围的搜索区域。 ME引擎接收与当前LCU对应的一组参考像素。 从外部存储器接收多个参考像素的参考像素集合。 当搜索区域的一部分区域在参考帧之外时,ME引擎沿着参考帧的边缘焊接一组重复像素。

Patent Agency Ranking