-
公开(公告)号:US20220414441A1
公开(公告)日:2022-12-29
申请号:US17902776
申请日:2022-09-02
Applicant: Google LLC
IPC: G06N3/063 , G06F17/16 , G06F17/15 , G06F30/20 , G06N3/04 , G06F30/27 , G06N3/08 , G06N3/10 , G06F30/367 , G06F30/18
Abstract: Methods and systems, including computer programs encoded on a computer storage medium. In one aspect, a method includes the actions of receiving a request to perform convolutional computations for a neural network on a hardware circuit having a matrix computation unit, the request specifying the convolutional computation to be performed on a feature tensor and a filter and padding applied to the feature tensor prior to performing the convolutional computation; and generating instructions that when executed by the hardware circuit cause the hardware circuit to perform operations comprising: transferring feature tensor data from a main memory of the hardware circuit to a scratchpad memory of the hardware circuit; and repeatedly performing the following operations: identifying a current subset of the feature tensor; and determining whether a memory view into the scratchpad memory for the current subset is consistent with a memory view of the current subset in the main memory.
-
公开(公告)号:US20210056396A1
公开(公告)日:2021-02-25
申请号:US16548555
申请日:2019-08-22
Applicant: Google LLC
Abstract: Methods and systems, including computer programs encoded on a computer storage medium. In one aspect, a method includes the actions of receiving a request to perform convolutional computations for a neural network on a hardware circuit having a matrix computation unit, the request specifying the convolutional computation to be performed on a feature tensor and a filter and padding applied to the feature tensor prior to performing the convolutional computation; and generating instructions that when executed by the hardware circuit cause the hardware circuit to perform operations comprising: transferring feature tensor data from a main memory of the hardware circuit to a scratchpad memory of the hardware circuit; and repeatedly performing the following operations: identifying a current subset of the feature tensor; and determining whether a memory view into the scratchpad memory for the current subset is consistent with a memory view of the current subset in the main memory.
-
公开(公告)号:US11449739B2
公开(公告)日:2022-09-20
申请号:US16548555
申请日:2019-08-22
Applicant: Google LLC
IPC: G06N3/063 , G06N3/08 , G06N3/04 , G06N3/10 , G06F17/15 , G06F17/16 , G06F30/18 , G06F30/20 , G06F30/27 , G06F30/367
Abstract: Methods and systems, including computer programs encoded on a computer storage medium. In one aspect, a method includes the actions of receiving a request to perform convolutional computations for a neural network on a hardware circuit having a matrix computation unit, the request specifying the convolutional computation to be performed on a feature tensor and a filter and padding applied to the feature tensor prior to performing the convolutional computation; and generating instructions that when executed by the hardware circuit cause the hardware circuit to perform operations comprising: transferring feature tensor data from a main memory of the hardware circuit to a scratchpad memory of the hardware circuit; and repeatedly performing the following operations: identifying a current subset of the feature tensor; and determining whether a memory view into the scratchpad memory for the current subset is consistent with a memory view of the current subset in the main memory.
-
公开(公告)号:US20210390410A1
公开(公告)日:2021-12-16
申请号:US17347416
申请日:2021-06-14
Applicant: Google LLC
Inventor: Ashish Teku Vaswani , Prajit Ramachandran , Aravind Srinivas Lakshminarayanan , Blake Alan Hechtman , Niki J. Parmar
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing images using a computer vision neural network that has one or more local self-attention layers. Each local self-attention layer is configured to apply or more local self-attention mechanisms to the layer input to the local self-attention layer.
-
公开(公告)号:US20240232598A1
公开(公告)日:2024-07-11
申请号:US18469272
申请日:2023-09-18
Applicant: Google LLC
IPC: G06N3/063 , G06F17/15 , G06F17/16 , G06F30/18 , G06F30/20 , G06F30/27 , G06F30/367 , G06N3/045 , G06N3/086 , G06N3/10
CPC classification number: G06N3/063 , G06F17/15 , G06F17/16 , G06F30/18 , G06F30/20 , G06F30/27 , G06F30/367 , G06N3/045 , G06N3/086 , G06N3/10
Abstract: Methods and systems, including computer programs encoded on a computer storage medium. In one aspect, a method includes the actions of receiving a request to perform convolutional computations for a neural network on a hardware circuit having a matrix computation unit, the request specifying the convolutional computation to be performed on a feature tensor and a filter and padding applied to the feature tensor prior to performing the convolutional computation; and generating instructions that when executed by the hardware circuit cause the hardware circuit to perform operations comprising: transferring feature tensor data from a main memory of the hardware circuit to a scratchpad memory of the hardware circuit; and repeatedly performing the following operations: identifying a current subset of the feature tensor; and determining whether a memory view into the scratchpad memory for the current subset is consistent with a memory view of the current subset in the main memory.
-
公开(公告)号:US20230206126A1
公开(公告)日:2023-06-29
申请号:US18088229
申请日:2022-12-23
Applicant: Google LLC
Inventor: Blake Alan Hechtman
IPC: G06N20/00
CPC classification number: G06N20/00
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for transforming patterns of operations on tensors in a computational graph to reduce the memory burden incurred when reshape operations are performed, in particular when deployed to hardware platforms that have vector instructions or vector memory requiring alignment of operands.
-
公开(公告)号:US20200349465A1
公开(公告)日:2020-11-05
申请号:US16402981
申请日:2019-05-03
Applicant: Google LLC
Inventor: Blake Alan Hechtman
IPC: G06N20/00
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for transforming patterns of operations on tensors in a computational graph to reduce the memory burden incurred when reshape operations are performed, in particular when deployed to hardware platforms that have vector instructions or vector memory requiring alignment of operands.
-
公开(公告)号:US11537939B2
公开(公告)日:2022-12-27
申请号:US16402981
申请日:2019-05-03
Applicant: Google LLC
Inventor: Blake Alan Hechtman
IPC: G06N20/00
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for transforming patterns of operations on tensors in a computational graph to reduce the memory burden incurred when reshape operations are performed, in particular when deployed to hardware platforms that have vector instructions or vector memory requiring alignment of operands.
-
9.
公开(公告)号:US11500959B2
公开(公告)日:2022-11-15
申请号:US16543282
申请日:2019-08-16
Applicant: Google LLC
Inventor: David Alexander Majnemer , Blake Alan Hechtman
Abstract: Methods, systems, and apparatus, including instructions encoded on storage media, for performing reduction of gradient vectors and similarly structured data that are generated in parallel, for example, on nodes organized in a mesh or torus topology defined by connections in at least two dimension between the nodes. The methods provide parallel computation and communication between nodes in the topology.
-
10.
公开(公告)号:US20210049231A1
公开(公告)日:2021-02-18
申请号:US16543282
申请日:2019-08-16
Applicant: Google LLC
Inventor: David Alexander Majnemer , Blake Alan Hechtman
Abstract: Methods, systems, and apparatus, including instructions encoded on storage media, for performing reduction of gradient vectors and similarly structured data that are generated in parallel, for example, on nodes organized in a mesh or torus topology defined by connections in at least two dimension between the nodes. The methods provide parallel computation and communication between nodes in the topology.
-
-
-
-
-
-
-
-
-