Optimal placement of data structures in a hybrid memory based inference computing platform

    公开(公告)号:US11175844B1

    公开(公告)日:2021-11-16

    申请号:US15929618

    申请日:2020-05-13

    Abstract: In a deep neural network (DNN), weights are defined that represent a strength of connections between different neurons of the DNN and activations are defined that represent an output produced by a neuron after passing through an activation function of receiving an input and producing an output based on some threshold value. The weight traffic associated with a hybrid memory therefore is distinguished from the activation traffic to the hybrid memory, and one or more data structures may be dynamically allocated in the hybrid memory according to the weights and activations of the one or more data structures in the DNN. The hybrid memory includes at least a first memory and a second memory that differ according to write endurance attributes.

Patent Agency Ranking