-
公开(公告)号:US11734568B2
公开(公告)日:2023-08-22
申请号:US16274599
申请日:2019-02-13
Applicant: Google LLC
Inventor: Jyrki Alakuijala , Ruud van Asseldonk , Robert Obryk , Krzysztof Potempa
Abstract: The present disclosure provides systems and methods for modification (e.g., pruning, compression, quantization, etc.) of artificial neural networks based on estimations of the utility of network connections (also known as “edges”). In particular, the present disclosure provides novel techniques for estimating the utility of one or more edges of a neural network in a fashion that requires far less expenditure of resources than calculation of the actual utility. Based on these estimated edge utilities, a computing system can make intelligent decisions regarding network pruning, network quantization, or other modifications to a neural network. In particular, these modifications can reduce resource requirements associated with the neural network. By making these decisions with knowledge of and based on the utility of various edges, this reduction in resource requirements can be achieved with only a minimal, if any, degradation of network performance (e.g., prediction accuracy).
-
2.
公开(公告)号:US20190251444A1
公开(公告)日:2019-08-15
申请号:US16274599
申请日:2019-02-13
Applicant: Google LLC
Inventor: Jyrki Alakuijala , Ruud van Asseldonk , Robert Obryk , Krzysztof Potempa
CPC classification number: G06N3/082 , G06N3/0454 , G06N3/084 , G06N20/20
Abstract: The present disclosure provides systems and methods for modification (e.g., pruning, compression, quantization, etc.) of artificial neural networks based on estimations of the utility of network connections (also known as “edges”). In particular, the present disclosure provides novel techniques for estimating the utility of one or more edges of a neural network in a fashion that requires far less expenditure of resources than calculation of the actual utility. Based on these estimated edge utilities, a computing system can make intelligent decisions regarding network pruning, network quantization, or other modifications to a neural network. In particular, these modifications can reduce resource requirements associated with the neural network. By making these decisions with knowledge of and based on the utility of various edges, this reduction in resource requirements can be achieved with only a minimal, if any, degradation of network performance (e.g., prediction accuracy).
-