-
公开(公告)号:US11775611B2
公开(公告)日:2023-10-03
申请号:US16816247
申请日:2020-03-11
Applicant: Samsung Electronics Co., Ltd.
Inventor: Jun Fang , Joseph H. Hassoun , Ali Shafiee Ardestani , Hamzah Ahmed Ali Abdelaziz , Georgios Georgiadis , Hui Chen , David Philip Lloyd Thorsley
Abstract: In some embodiments, a method of quantizing an artificial neural network includes dividing a quantization range for a tensor of the artificial neural network into a first region and a second region, and quantizing values of the tensor in the first region separately from values of the tensor in the second region. In some embodiments, linear or nonlinear quantization are applied to values of the tensor in the first region and the second region. In some embodiments, the method includes locating a breakpoint between the first region and the second region by substantially minimizing an expected quantization error over at least a portion of the quantization range. In some embodiments, the expected quantization error is minimized by solving analytically and/or searching numerically.