DEVICE AND METHOD WITH TRANSFORMER MODEL IMPLEMENTATION

    公开(公告)号:US20230138659A1

    公开(公告)日:2023-05-04

    申请号:US17887145

    申请日:2022-08-12

    IPC分类号: G06N3/04

    摘要: A device and method with transformer model implementation are provided. The electronic device includes a processor configured to perform an inference by implementing a transformer model including a plurality of encoders and a plurality of decoders, and a memory configured to store instructions to be executed by the processor. Each of the encoders and the decoders includes an attention block that determines an attention value. The processor is configured to perform a first sub-softmax tile-wise operation in the attention block, perform a reduction operation to determine an adjustment factor based on a resulting value of the first sub-softmax operation, and perform a second sub-softmax tile-wise operation based on a resulting value of the reduction operation.

    NEURAL PROCESSOR
    5.
    发明申请

    公开(公告)号:US20220283984A1

    公开(公告)日:2022-09-08

    申请号:US17369298

    申请日:2021-07-07

    IPC分类号: G06F15/80 G06N3/063 G06F9/30

    摘要: A neural processor is provided. The neural processor includes a matrix device which is configured to generate an output feature map by processing a standard convolution operation and which has a systolic array architecture, and accelerators with an adder-tree structure which are configured to process depth-wise convolution operations for each of elements of the output feature map corresponding to lanes of the matrix device.