-
公开(公告)号:US11651192B2
公开(公告)日:2023-05-16
申请号:US16788261
申请日:2020-02-11
Applicant: Apple Inc.
Inventor: James C. Gabriel , Mohammad Rastegari , Hessam Bagherinezhad , Saman Naderiparizi , Anish Prabhu , Sophie Lebrecht , Jonathan Gelsey , Sayyed Karen Khatamifard , Andrew L. Chronister , David Bakin , Andrew Z. Luo
Abstract: Systems and processes for training and compressing a convolutional neural network model include the use of quantization and layer fusion. Quantized training data is passed through a convolutional layer of a neural network model to generate convolutional results during a first iteration of training the neural network model. The convolutional results are passed through a batch normalization layer of the neural network model to update normalization parameters of the batch normalization layer. The convolutional layer is fused with the batch normalization layer to generate a first fused layer and the fused parameters of the fused layer are quantized. The quantized training data is passed through the fused layer using the quantized fused parameters to generate output data, which may be quantized for a subsequent layer in the training iteration.
-
公开(公告)号:US12165337B2
公开(公告)日:2024-12-10
申请号:US17068750
申请日:2020-10-12
Applicant: Apple Inc.
Inventor: Anish Prabhu , Sayyed Karen Khatamifard , Hessam Bagherinezhad
IPC: G06T7/20 , G06F18/21 , G06F18/214 , G06F18/24 , G06F18/25 , G06N5/04 , G06N20/00 , G06T7/254 , G06T7/73 , G06V10/28 , G06V10/44 , G06V10/764 , G06V10/778 , G06V10/82 , G06V20/52 , G06V40/20
Abstract: Aspects of the subject technology relate to machine learning based object recognition using pixel difference information. A difference image generated by subtraction of a current image from one or more previous images can be provided, as input, to a machine-learning engine. The machine-learning may output a detected object or a detected action based, at least in part, on the difference image. In this way, temporal information about the object can be provided to, and used by, a machine-learning model that is structured to accept image input.
-