METHOD, ACCELERATOR, AND ELECTRONIC DEVICE WITH TENSOR PROCESSING

    公开(公告)号:US20210406646A1

    公开(公告)日:2021-12-30

    申请号:US17091338

    申请日:2020-11-06

    Abstract: A processor-implemented tensor processing method includes: receiving a request to process a neural network including a normalization layer by an accelerator; and generating an instruction executable by the accelerator in response to the request, wherein, by executing the instruction, the accelerator is configured to determine an intermediate tensor corresponding to a result of a portion of operations of the normalization layer, by performing, in a channel axis direction, a convolution based on an input tensor and a kernel, wherein the input tensor is of the normalization layer and includes a plurality of channels, a number of input channels of the kernel is determined based on the input tensor, and scaling values of elements of the kernel are determined based on the number of input channels.

    APPARATUS AND METHOD WITH MULTI-FORMAT DATA SUPPORT

    公开(公告)号:US20230065528A1

    公开(公告)日:2023-03-02

    申请号:US17883987

    申请日:2022-08-09

    Abstract: An apparatus with multi-format data support includes: a receiver configured to receive a plurality of data corresponding to a plurality of data formats; one or more processors configured to: multiply the plurality of data using one or more multipliers; perform a first alignment on a result of the multiplication based on an exponent value of the plurality of data; add a result of the first alignment; and perform a second alignment on a result of the addition based on the exponent value and an operation result of a previous cycle.

    METHOD AND APPARATUS WITH DATA PROCESSING

    公开(公告)号:US20210097347A1

    公开(公告)日:2021-04-01

    申请号:US16804672

    申请日:2020-02-28

    Abstract: A processor-implemented data processing method includes: predicting whether there will be an inefficient section, of a neural network set to be implemented, during a processing of data, based on a hardware configuration for processing the data; adjusting a layer parameter corresponding to the inefficient section of the neural network; and processing the data using the neural network with the adjusted layer parameter.

Patent Agency Ranking