-
1.
公开(公告)号:US11461614B2
公开(公告)日:2022-10-04
申请号:US15838552
申请日:2017-12-12
Applicant: Hailo Technologies Ltd.
Inventor: Avi Baum , Or Danon , Daniel Ciubotariu , Mark Grobman , Alex Finkelstein
IPC: G06N3/04 , G06N3/063 , G06N20/00 , G06F17/10 , G06F5/01 , G06N3/08 , G06N3/02 , G06F12/02 , G06F12/06 , G06F30/30 , G06F30/27 , G06V10/40 , G06F7/501 , G06F7/523 , G06F9/50 , G06F13/16 , G06F9/30 , G06K9/62
Abstract: A novel and useful system and method of data driven quantization optimization of weights and input data in an artificial neural network (ANN). The system reduces quantization implications (i.e. error) in a limited resource system by employing the information available in the data actually observed by the system. Data counters in the layers of the network observe the data input thereto. The distribution of the data is used to determine an optimum quantization scheme to apply to the weights, input data, or both. The mechanism is sensitive to the data observed at the input layer of the network. As a result, the network auto-tunes to optimize the instance specific representation of the network. The network becomes customized (i.e. specialized) to the inputs it observes and better fits itself to the subset of the sample space that is applicable to its actual data flow. As a result, nominal process noise is reduced and detection accuracy improves. In addition, the mechanism enables the reduction of the representation space and further reduces the memory (and energy thereof) needed to represent the network properties.
-
公开(公告)号:US20180285678A1
公开(公告)日:2018-10-04
申请号:US15669933
申请日:2017-08-06
Applicant: Hailo Technologies Ltd.
Inventor: Avi Baum , Or Danon , Mark Grobman , Hadar Zeitlin
CPC classification number: G06F12/0207 , G06F5/01 , G06F7/501 , G06F7/523 , G06F9/30054 , G06F9/5016 , G06F9/5027 , G06F12/0646 , G06F12/0692 , G06F13/1663 , G06F17/10 , G06K9/46 , G06K9/62 , G06N3/02 , G06N3/04 , G06N3/0454 , G06N3/063 , G06N3/08 , G06N3/082 , G06N3/084 , Y02D10/14
Abstract: A novel and useful artificial neural network that incorporates emphasis and focus techniques to extract more information from one or more portions of an input image compared to the rest of the image. The ANN recognizes that valuable information in an input image is typically not distributed throughout the image but rather is concentrated in one or more regions. Rather than implement CNN layers sequentially (i.e. row by row) on the input domain of each layer, the present invention leverages the fact that valuable information is focused in one or more regions of the image where it is desirable to apply more attention and for which it is desired to apply more elaborate evaluation. Precision dilution can be applied to those portions of the input image that are not the center of focus and emphasis. A spatial aware function determines the location(s) of the ears of focus and is applied to the first convolutional layer. Dilution of precision is performed either before and/or after the first convolutional layer thereby significantly reducing computation and power requirements.
-
公开(公告)号:US10387298B2
公开(公告)日:2019-08-20
申请号:US15669933
申请日:2017-08-06
Applicant: Hailo Technologies Ltd.
Inventor: Avi Baum , Or Danon , Mark Grobman , Hadar Zeitlin
IPC: G06F12/02 , G06N3/063 , G06F12/06 , G06F7/501 , G06F7/523 , G06F9/50 , G06N3/04 , G06F17/10 , G06F5/01 , G06N3/08 , G06F13/16 , G06F9/30 , G06K9/46 , G06K9/62 , G06N3/02
Abstract: A novel and useful artificial neural network that incorporates emphasis and focus techniques to extract more information from one or more portions of an input image compared to the rest of the image. The ANN recognizes that valuable information in an input image is typically not distributed throughout the image but rather is concentrated in one or more regions. Rather than implement CNN layers sequentially (i.e. row by row) on the input domain of each layer, the present invention leverages the fact that valuable information is focused in one or more regions of the image where it is desirable to apply more attention and for which it is desired to apply more elaborate evaluation. Precision dilution can be applied to those portions of the input image that are not the center of focus and emphasis. A spatial aware function determines the location(s) of the ears of focus and is applied to the first convolutional layer. Dilution of precision is performed either before and/or after the first convolutional layer thereby significantly reducing computation and power requirements.
-
4.
公开(公告)号:US20180285736A1
公开(公告)日:2018-10-04
申请号:US15838552
申请日:2017-12-12
Applicant: Hailo Technologies Ltd.
Inventor: Avi Baum , Or Danon , Daniel Ciubotariu , Mark Grobman , Alex Finkelstein
Abstract: A novel and useful system and method of data driven quantization optimization of weights and input data in an artificial neural network (ANN). The system reduces quantization implications (i.e. error) in a limited resource system by employing the information available in the data actually observed by the system. Data counters in the layers of the network observe the data input thereto. The distribution of the data is used to determine an optimum quantization scheme to apply to the weights, input data, or both. The mechanism is sensitive to the data observed at the input layer of the network. As a result, the network auto-tunes to optimize the instance specific representation of the network. The network becomes customized (i.e. specialized) to the inputs it observes and better fits itself to the subset of the sample space that is applicable to its actual data flow. As a result, nominal process noise is reduced and detection accuracy improves. In addition, the mechanism enables the reduction of the representation space and further reduces the memory (and energy thereof) needed to represent the network properties.
-
-
-