-
公开(公告)号:US20220012563A1
公开(公告)日:2022-01-13
申请号:US17484226
申请日:2021-09-24
申请人: Alejandro Castro Gonzalez , Praveen Nair , Somnath Paul , Sudheendra Kadri , Palanivel Guruvareddiar , Aaron Gubrud , Vinodh Gopal
发明人: Alejandro Castro Gonzalez , Praveen Nair , Somnath Paul , Sudheendra Kadri , Palanivel Guruvareddiar , Aaron Gubrud , Vinodh Gopal
IPC分类号: G06N3/04
摘要: Methods, apparatus, systems, and articles of manufacture are disclosed for high throughput compression of neural network weights. An example apparatus includes at least one memory, instructions in the apparatus and processor circuitry to execute the instructions to determine sizes of data lanes in a partition of neural network weights, determine a slice size based on a size difference between a first data lane and a second data lane of the data lanes in the partition, the first data lane including first data, the second data lane including second data, the second data of a smaller size than the first data, cut a portion of the first data from the first data lane based on the slice size, and append the portion of the first data to the second data lane.