Methods and apparatus to tile walk a tensor for convolution operations

    公开(公告)号:US12223413B2

    公开(公告)日:2025-02-11

    申请号:US18392761

    申请日:2023-12-21

    Abstract: An example apparatus to perform a convolution on an input tensor includes a parameters generator to: generate a horizontal hardware execution parameter for a horizontal dimension of the input tensor based on a kernel parameter and a layer parameter; and generate a vertical hardware execution parameter for a vertical dimension of the input tensor based on the kernel parameter and the layer parameter; an accelerator interface to configure a hardware accelerator circuitry based on the horizontal and vertical hardware execution parameters; a horizontal Iterator controller to determine when the hardware accelerator circuitry completes the first horizontal iteration of the convolution; and a vertical Iterator controller to determine when the hardware accelerator circuitry completes the first vertical iteration of the convolution.

    METHODS AND APPARATUS TO TILE WALK A TENSOR FOR CONVOLUTION OPERATIONS

    公开(公告)号:US20250045560A1

    公开(公告)日:2025-02-06

    申请号:US18922038

    申请日:2024-10-21

    Abstract: An example apparatus to perform a convolution on an input tensor includes a parameters generator to: generate a horizontal hardware execution parameter for a horizontal dimension of the input tensor based on a kernel parameter and a layer parameter; and generate a vertical hardware execution parameter for a vertical dimension of the input tensor based on the kernel parameter and the layer parameter; an accelerator interface to configure a hardware accelerator circuitry based on the horizontal and vertical hardware execution parameters; a horizontal Iterator controller to determine when the hardware accelerator circuitry completes the first horizontal iteration of the convolution; and a vertical Iterator controller to determine when the hardware accelerator circuitry completes the first vertical iteration of the convolution.

    METHODS AND APPARATUS TO TILE WALK A TENSOR FOR CONVOLUTION OPERATIONS

    公开(公告)号:US20240119255A1

    公开(公告)日:2024-04-11

    申请号:US18392761

    申请日:2023-12-21

    CPC classification number: G06N3/04 G06F8/41 G06F17/15 G06N3/063

    Abstract: An example apparatus to perform a convolution on an input tensor includes a parameters generator to: generate a horizontal hardware execution parameter for a horizontal dimension of the input tensor based on a kernel parameter and a layer parameter; and generate a vertical hardware execution parameter for a vertical dimension of the input tensor based on the kernel parameter and the layer parameter; an accelerator interface to configure a hardware accelerator circuitry based on the horizontal and vertical hardware execution parameters; a horizontal Iterator controller to determine when the hardware accelerator circuitry completes the first horizontal iteration of the convolution; and a vertical Iterator controller to determine when the hardware accelerator circuitry completes the first vertical iteration of the convolution.

    PACKED FINITE IMPULSE RESPONSE (FIR) FILTER PROCESSORS, METHODS, SYSTEMS, AND INSTRUCTIONS
    8.
    发明申请
    PACKED FINITE IMPULSE RESPONSE (FIR) FILTER PROCESSORS, METHODS, SYSTEMS, AND INSTRUCTIONS 有权
    包装有限冲突响应(FIR)过滤器,方法,系统和说明

    公开(公告)号:US20160328233A1

    公开(公告)日:2016-11-10

    申请号:US14704633

    申请日:2015-05-05

    Abstract: A processor includes a decode unit to decode a packed finite impulse response (FIR) filter instruction that indicates one or more source packed data operands, a plurality of FIR filter coefficients, and a destination storage location. The source operand(s) include a first number of data elements and a second number of additional data elements. The second number is one less than a number of FIR filter taps. An execution unit, in response to the packed FIR filter instruction being decoded, is to store a result packed data operand. The result packed data operand includes the first number of FIR filtered data elements that each is to be based on a combination of products of the plurality of FIR filter coefficients and a different corresponding set of data elements from the one or more source packed data operands, which is equal in number to the number of FIR filter taps.

    Abstract translation: 处理器包括解码单元,用于解码指示一个或多个源打包数据操作数,多个FIR滤波器系数和目的地存储位置的压缩有限脉冲响应(FIR)滤波器指令。 源操作数包括第一数量的数据元素和第二数量的附加数据元素。 第二个数字是少于FIR滤波器抽头的数量。 响应于被解码的打包FIR滤波器指令,执行单元是存储结果打包数据操作数。 结果打包数据操作数包括第一数量的FIR滤波数据元素,每个FIR滤波数据元素将基于多个FIR滤波器系数的乘积和来自一个或多个源打包数据操作数的不同对应的数据元素的组合, 其数量与FIR滤波器抽头的数量相等。

    Methods and apparatus to tile walk a tensor for convolution operations

    公开(公告)号:US12112251B2

    公开(公告)日:2024-10-08

    申请号:US17954846

    申请日:2022-09-28

    CPC classification number: G06N3/04 G06F8/41 G06F17/15 G06N3/063

    Abstract: An example apparatus to perform a convolution on an input tensor includes a parameters generator to: generate a horizontal hardware execution parameter for a horizontal dimension of the input tensor based on a kernel parameter and a layer parameter; and generate a vertical hardware execution parameter for a vertical dimension of the input tensor based on the kernel parameter and the layer parameter; an accelerator interface to configure a hardware accelerator circuitry based on the horizontal and vertical hardware execution parameters; a horizontal Iterator controller to determine when the hardware accelerator circuitry completes the first horizontal iteration of the convolution; and a vertical Iterator controller to determine when the hardware accelerator circuitry completes the first vertical iteration of the convolution.

Patent Agency Ranking