NEURAL NETWORK METHOD AND APPARATUS

    公开(公告)号:US20210081798A1

    公开(公告)日:2021-03-18

    申请号:US16835532

    申请日:2020-03-31

    Abstract: A method and apparatus for the pruning of a neural network is provided. The method sets a weight threshold value based on a weight distribution of layers included in a neural network, predicts a change of inference accuracy of a neural network by pruning of each layer based on the weight threshold value, determines a current subject layer to be pruned with a weight threshold value among the layers included in the neural network, and prunes a determined current subject layer.

    NEURAL NETWORK METHOD AND APPARATUS
    2.
    发明公开

    公开(公告)号:US20230274140A1

    公开(公告)日:2023-08-31

    申请号:US18144009

    申请日:2023-05-05

    CPC classification number: G06N3/08 G06N20/10

    Abstract: A processor-implemented method of performing a convolution operation is provided. The method includes obtaining input feature map data and kernel data, determine the kernel data based on a number of input channels of the input feature map, a number of output channels of an output feature map, and a number of groups of the input feature map data and a number of groups of the kernel data related to the convolution operation, and performing the convolution operation based on the input feature map data and the determined kernel data.

    METHOD AND APPARATUS WITH NEURAL NETWORK PRUNING

    公开(公告)号:US20220114453A1

    公开(公告)日:2022-04-14

    申请号:US17236503

    申请日:2021-04-21

    Abstract: A neural network pruning method includes: acquiring a first task accuracy of an inference task processed by a pretrained neural network; pruning, based on a channel unit, the neural network by adjusting weights between nodes of channels based on a preset learning weight and based on a channel-by-channel pruning parameter corresponding to a channel of each of a plurality of layers of the pretrained neural network; updating the learning weight based on the first task accuracy and a task accuracy of the pruned neural network; updating the channel-by-channel pruning parameter based on the updated learning weight and the task accuracy of the pruned neural network; and repruning, based on the channel unit, the pruned neural network based on the updated learning weight and based on the updated channel-by-channel pruning parameter.

    NEURAL NETWORK METHOD AND APPARATUS

    公开(公告)号:US20210201132A1

    公开(公告)日:2021-07-01

    申请号:US16897461

    申请日:2020-06-10

    Abstract: A processor-implemented method of performing a convolution operation is provided. The method includes obtaining input feature map data and kernel data, determine the kernel data based on a number of input channels of the input feature map, a number of output channels of an output feature map, and a number of groups of the input feature map data and a number of groups of the kernel data related to the convolution operation, and performing the convolution operation based on the input feature map data and the determined kernel data.

    NEURAL NETWORK METHOD AND APPARATUS
    5.
    发明公开

    公开(公告)号:US20240346317A1

    公开(公告)日:2024-10-17

    申请号:US18752163

    申请日:2024-06-24

    CPC classification number: G06N3/082 G06N3/04

    Abstract: A method and apparatus for the pruning of a neural network is provided. The method sets a weight threshold value based on a weight distribution of layers included in a neural network, predicts a change of inference accuracy of a neural network by pruning of each layer based on the weight threshold value, determines a current subject layer to be pruned with a weight threshold value among the layers included in the neural network, and prunes a determined current subject layer.

    METHOD AND APPARATUS FOR COMPRESSING ARTIFICIAL NEURAL NETWORK

    公开(公告)号:US20220108180A1

    公开(公告)日:2022-04-07

    申请号:US17191954

    申请日:2021-03-04

    Abstract: A method and apparatus for compressing an artificial neural network may acquire weights corresponding to an artificial neural network trained in advance, wherein the artificial neural network includes a plurality of layers, and a processor configured to generate data for acquiring a change of behavior of the artificial neural network due to pruning of the artificial neural network based on the weights, determine a pruning threshold for pruning of the artificial neural network based on the change of the behavior of the artificial neural network, and compress the neural network based on the pruning threshold.

Patent Agency Ranking