Patent search ap:("FACEBOOK TECHNOLOGIES Page LLC") AND inv:"LAI

1.

发明申请
COUNTER BASED MULTIPLY-AND-ACCUMULATE CIRCUIT FOR NEURAL NETWORK 审中-公开

公开(公告)号：WO2021003032A1

公开(公告)日：2021-01-07

申请号：PCT/US2020/038782

申请日：2020-06-19

Applicant: FACEBOOK TECHNOLOGIES, LLC

Inventor： LAI, Liangzhen , CHUANG, Pierce I-Jen

IPC: G06F7/544

Abstract: Disclosed herein includes a system, a method, and a device for improving computation efficiency of a neural network. In one aspect, adder circuitry is configured to add input data from processing of the neural network and a first number of bits of accumulated data for the neural network to generate summation data. In one aspect, according to a carry value of the adding from the adder circuitry, a multiplexer is configured to select between i) a second number of bits of the accumulated data and ii) incremented data comprising the second number of bits of the accumulated data incremented by a predetermined value. The summation data appended with the selected one of the second number of bits of the accumulated data or the incremented data may form appended data.

2.

发明申请
SYSTEM AND METHOD FOR SUPPORTING ALTERNATE NUMBER FORMAT FOR EFFICIENT MULTIPLICATION 审中-公开

公开(公告)号：WO2021011316A1

公开(公告)日：2021-01-21

申请号：PCT/US2020/041454

申请日：2020-07-09

Applicant: FACEBOOK TECHNOLOGIES, LLC

Inventor： VENKATESH, Ganesh , LAI, Liangzhen , CHUANG, Pierce, I-Jen , LI, Meng , CHANDRA, Vikas

IPC: G06F7/544 , G06F7/523

Abstract: Disclosed herein includes a system, a method, and a device including shift circuitry and add circuitry for performing multiplication of a first value and a second value for a neural network. The first value has a predetermined format including a first bit, and two or more second bits to represent a value of zero or 2 n where n is an integer greater than or equal to 0. The device shifts, when the two or more second bits represent the value of 2 n , the second value by (n+1) bits via the shift circuitry to provide a first result, selectively outputs zero or the second value, based on a value of the first bit of the first value, to provide a second result, and adds the first result and the second results via the add circuitry to provide a result of the multiplication of the first and second values.

3.

发明申请
SYSTEM AND METHOD FOR PERFORMING SMALL CHANNEL COUNT CONVOLUTIONS IN ENERGY-EFFICIENT INPUT OPERAND STATIONARY ACCELERATOR 审中-公开

公开(公告)号：WO2021011314A8

公开(公告)日：2021-01-21

申请号：PCT/US2020/041449

申请日：2020-07-09

Applicant: FACEBOOK TECHNOLOGIES, LLC

Inventor： VENKATESH, Ganesh , LAI, Liangzhen , CHUANG, Pierce, I-Jen , LI, Meng

IPC: G06N3/04 , G06N3/063 , G06N3/08 , G06N3/0454

Abstract: Disclosed herein includes a system, a method, and a device for receiving input data to generate a plurality of outputs for a layer of a neural network. The plurality of outputs are arranged in a first array. Dimensions of the first array may be compared with dimensions of a processing unit (PE) array including a plurality of PEs. According to a result of the comparing, the first array is partitioned into subarrays by the processor. Each of the subarrays has dimensions less than or equal to the dimensions of the PE array. A first group of PEs in the PE array is assigned to a first one of the subarrays. A corresponding output of the plurality of outputs is generated using a portion of the input data by each PE of the first group of PEs assigned to the first one of the subarrays.

4.

发明申请
SYSTEMS AND METHODS FOR PIPELINED PARALLELISM TO ACCELERATE DISTRIBUTED PROCESSING 审中-公开

公开(公告)号：WO2021007333A1

公开(公告)日：2021-01-14

申请号：PCT/US2020/041218

申请日：2020-07-08

Applicant: FACEBOOK TECHNOLOGIES, LLC

Inventor： VENKATESH, Ganesh , LAI, Liangzhen

IPC: G06N3/04 , G06N3/063 , G06F9/50 , G06F9/5038 , G06F9/5061 , G06N3/0454 , G06N3/0635 , G06N3/08 , D10

Abstract: Disclosed herein includes a system, a method, and a device for pipelined parallelism to accelerate distributed learning network graph. First data for a first layer of a neural network may be stored in memory. First circuitry including a first plurality of processing element (PE) circuits may read the first data from the memory and perform computation for the first layer of the neural network using the first data to generate second data. The first circuitry includes a plurality of buffers for outputting the generated second data as input to second circuitry to perform computation for a second layer of the neural network. The second circuitry includes a second plurality of PE circuits configured to perform computation for the second layer of the neural network using the second data.

5.

发明申请
SYSTEMS AND METHODS FOR READING AND WRITING SPARSE DATA IN A NEURAL NETWORK ACCELERATOR 审中-公开

公开(公告)号：WO2021006965A1

公开(公告)日：2021-01-14

申请号：PCT/US2020/034848

申请日：2020-05-28

Applicant: FACEBOOK TECHNOLOGIES, LLC

Inventor： VENKATESH, Ganesh , LAI, Liangzhen , CHUANG, Pierce I-Jen , LI, Meng

IPC: G06F12/02 , G06F12/04 , G06N3/063 , G06F1/3234 , G06F12/0886 , G06N3/04 , G06F13/16

Abstract: Disclosed herein includes a system, a method, and a device for reading and writing sparse data in a neural network accelerator. A plurality of slices can be established to access a memory having an access size of a data word. A first slice can be configured to access a first side of the data word in memory. Circuitry can access a mask identifying byte positions within the data word having non-zero values. The circuitry can modify the data word to have non-zero byte values stored starting at an end of the first side, and any zero byte values stored in a remainder of the data word. A determination can be made whether a number of non-zero byte values is less than or equal to a first access size of the first slice. The circuitry can write the modified data word to the memory via at least the first slice.

6.

发明申请
SYSTEM AND METHOD FOR PERFORMING SMALL CHANNEL COUNT CONVOLUTIONS IN ENERGY-EFFICIENT INPUT STATIONARY ACCELERATOR 审中-公开

公开(公告)号：WO2021011314A1

公开(公告)日：2021-01-21

申请号：PCT/US2020/041449

申请日：2020-07-09

Applicant: FACEBOOK TECHNOLOGIES, LLC

Inventor： VENKATESH, Ganesh , LAI, Liangzhen , CHUANG, Pierce, I-Jen , LI, Meng

IPC: G06N3/04 , G06N3/063 , G06N3/08

Abstract: Disclosed herein includes a system, a method, and a device for receiving input data to generate a plurality of outputs for a layer of a neural network. The plurality of outputs are arranged in a first array. Dimensions of the first array may be compared with dimensions of a processing unit (PE) array including a plurality of PEs. According to a result of the comparing, the first array is partitioned into subarrays by the processor. Each of the subarrays has dimensions less than or equal to the dimensions of the PE array. A first group of PEs in the PE array is assigned to a first one of the subarrays. A corresponding output of the plurality of outputs is generated using a portion of the input data by each PE of the first group of PEs assigned to the first one of the subarrays.

7.

发明申请
SYSTEMS, METHODS, AND DEVICES FOR EARLY-EXIT FROM CONVOLUTION 审中-公开

公开(公告)号：WO2021007337A1

公开(公告)日：2021-01-14

申请号：PCT/US2020/041226

申请日：2020-07-08

Applicant: FACEBOOK TECHNOLOGIES, LLC

Inventor： VENKATESH, Ganesh , LAI, Liangzhen , CHUANG, Pierce, I-Jen

IPC: G06N3/04 , G06N3/063 , G06N3/08

Abstract: Disclosed herein includes a system, a method, and a device for early-exit from convolution. In some embodiments, at least one processing element (PE) circuit is configured to perform, for a node of a neural network corresponding to a dot-product operation with a set of operands, computation using a subset of the set of operands to generate a dot-product value of the subset of the set of operands. The at least one PE circuit can compare the dot-product value of the subset of the set of operands, to a threshold value. The at least one PE circuit can determine whether to activate the node of the neural network, based at least on a result of the comparing.

8.

发明申请
POWER EFFICIENT MULTIPLY-ACCUMULATE CIRCUITRY 审中-公开

公开(公告)号：WO2021007325A1

公开(公告)日：2021-01-14

申请号：PCT/US2020/041207

申请日：2020-07-08

Applicant: FACEBOOK TECHNOLOGIES, LLC

Inventor： LAI, Liangzhen

IPC: G06F9/30 , G06N3/04 , G06N3/063 , G06N3/08

Abstract: Disclosed herein includes a system, a method, and a device for multiply- accumulate operation. In one aspect, an input operand is received by control circuitry. In one aspect, the control circuitry determines a sparsity of the input operand, where the sparsity may indicate whether a value of the input operand has a predetermined value or not. In one aspect, the control circuitry determines a stationarity of the input operand, where the stationarity may indicate whether the value of the input operand changes over one or more clock cycles. In one aspect, the input operand is provided to multiply- accumulate circuitry as an input, according to the determined sparsity and stationarity of the input operand.

9.

发明申请
SYSTEMS AND METHODS FOR DISTRIBUTING A NEURAL NETWORK ACROSS MULTIPLE COMPUTING DEVICES 审中-公开

公开(公告)号：WO2021007257A1

公开(公告)日：2021-01-14

申请号：PCT/US2020/041077

申请日：2020-07-07

Applicant: FACEBOOK TECHNOLOGIES, LLC

Inventor： VENKATESH, Ganesh , LAI, Liangzhen , CHUANG, Pierce I-Jen , CHANDRA, Vikas

IPC: G06N3/04 , G02B27/01 , G06N3/063 , G06N3/08

Abstract: Disclosed herein is a method for using a neural network across multiple devices. The method can include receiving, by a first device configured with a first one or more layers of a neural network, input data for processing via the neural network implemented across the first device and a second device. The method can include outputting, by the first one or more layers of the neural network implemented on the first device, a data set that is reduced in size relative to the input data while identifying one or more features of the input data for processing by a second one or more layers of the neural network. The method can include communicating, by the first device, the data set to the second device for processing via the second one or more layers of the neural network implemented on the second device.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification