-
公开(公告)号:US20220284075A1
公开(公告)日:2022-09-08
申请号:US17686409
申请日:2022-03-04
Applicant: Shanghai Biren Technology Co.,Ltd
Inventor: YuFei ZHANG , Zhu LIANG , Min GAO
Abstract: The embodiments of the disclosure relate to a computing device, a computing apparatus, and a method of warp accumulation and relate to the field of computers. The computing device includes a storage unit and an accumulation computing unit coupled to the storage unit. The accumulation computing unit is configured to receive, from a vector processing unit coupled to the computing device, a first warp accumulation instruction, a plurality of first values corresponding to a warp lane number, and a first storage address; generate a current accumulation result based on the plurality of first values in response to the first warp accumulation instruction; and store the current accumulation result in the first storage address in the storage unit to be read by the vector processing unit. In this way, accumulation in a warp may be decoupled to dedicated hardware for processing, and overall accumulation performance may thus be significantly improved.
-
2.
公开(公告)号:US20230117626A1
公开(公告)日:2023-04-20
申请号:US17958441
申请日:2022-10-03
Applicant: Shanghai Biren Technology Co.,Ltd
IPC: G06F17/15
Abstract: A convolution apparatus including a data memory, a matrix unknit-knit device, and a convolution operation device, a convolution method, a matrix unknit-knit device, and a matrix unknit-knit method are provided. The matrix unknit-knit device unknits a first matrix stored in the data memory into s*s second matrices (or knits the s*s second matrices into the first matrix), where s is greater than 1. Pixels in each of s*s subblocks in the first matrix serve one-to-one as pixels of the s*s second matrices. A convolution operation device unknits a convolution kernel of a convolution operation with a stride of s into s*s sub-kernels, uses any one of the sub-kernels to perform a convolution operation with a stride of 1 on one corresponding second matrix, and accumulates the operation results the second matrices as the operation result of performing the convolution operation with a stride of s on the first matrix.
-
公开(公告)号:US20220292632A1
公开(公告)日:2022-09-15
申请号:US17692198
申请日:2022-03-11
Applicant: Shanghai Biren Technology Co.,Ltd
Inventor: YuFei ZHANG , Zhu LIANG , ChengKun SUN
Abstract: A method for computing, a computing device, and a computer-readable storage medium are provided. The method includes determining a first pixel block in a cache. The first pixel block is composed of a 2m row×2n column pixel matrix and includes original pixel data and pixel data related to the original pixel data. The first pixel block is read from the cache. At least part of the pixel data related to the original pixel data is used for padding related to the original pixel data. The original pixel data includes pixel data from the (n+1)th column to the 2nth column in the (m+1)th row to the 2mth row in the 2m row×2 n column pixel matrix. When reading data from the cache, pixel data that needs to be obtained after insert-zero and padding operations on the original pixel data in back propagation can be read at one time.
-
-