DEEP LEARNING OPTIMIZATION THROUGH ZERO TILE MANIPULATION

    公开(公告)号:US20250053803A1

    公开(公告)日:2025-02-13

    申请号:US18448390

    申请日:2023-08-11

    Abstract: Processing zero weights within a data structure when performing a multiply and accumulate operation in a deep learning network as the result is itself a zero. Avoiding this step may save time and reduce power consumption in the training and operation of deep learning networks. An approach to zero-tile manipulation may be presented herein. An approach to permute and pack weighted data structures into zero-tile data structures may be presented. The zero-tiles may be configured in a structure which is optimized for the architecture of a parallel processing unit. The zero tile data structures may comprise vectors which instruct a the components in processing element to operate in a manner which prevents the element from expending energy when processing the zero tiles. An apparatus may also be presented in the immediate disclosure which can be configured to accept a zero-tile data structure.

    ENERGY-EFFICIENT ANALOG-TO-DIGITAL CONVERSION IN MIXED SIGNAL CIRCUITRY

    公开(公告)号:US20230188146A1

    公开(公告)日:2023-06-15

    申请号:US17550493

    申请日:2021-12-14

    CPC classification number: H03M1/002

    Abstract: An apparatus comprises at least one processor and at least one memory including instruction code configured to, with the at least one processor, cause the apparatus at least to perform a successive approximation analog-to-digital conversion of an analog input, representing a result of multiplication of first and second vectors, to a digital output by determining an upper bound on the result of multiplication of the first and second vectors, identifying, based at least in part on the determined upper bound, at least a portion of the successive approximation analog-to-digital conversion to be skipped, and skipping the identified portion of the successive approximation analog-to-digital conversion.

    POWER-EFFICIENT MIXED-SIGNAL CIRCUIT INCLUDING ANALOG MULTIPLY AND ACCUMULATE ENGINES

    公开(公告)号:US20250077804A1

    公开(公告)日:2025-03-06

    申请号:US18458141

    申请日:2023-08-29

    Abstract: A first circuit is configured to split a first integer value into a first coarse value and a first fine value, and split a second integer value into a second coarse value and a second fine value. A second circuit performs an analog multiply and accumulate (MAC) operation on the first and second coarse values to produce a first analog output, perform an analog MAC operation on the first coarse value and the second fine value to produce a second analog output, perform an analog MAC operation on the first fine value and the second coarse value to produce a third analog output, and perform an analog MAC operation on the first and second fine values together to produce a fourth analog output. A third circuit is configured to perform analog-to-digital (A/D) conversion on and combine the analog output signals to produce a reconstructed digital output signal.

    Scalable Switch Capacitor Computation Cores for Accurate and Efficient Deep Learning Inference

    公开(公告)号:US20240176584A1

    公开(公告)日:2024-05-30

    申请号:US18071230

    申请日:2022-11-29

    CPC classification number: G06F7/523

    Abstract: An apparatus comprising: a first plurality of inputs representing an activation input vector; a second plurality of inputs representing a weight input vector; an analog multiplier-and-accumulator to generate a first analog voltage representing a first multiply-and-accumulate result for the said first inputs and the second inputs; a voltage multiplier that takes the said first analog voltage and produces a second analog voltage representing, a second multiply-and-accumulate result by multiplying at least one scaling factor to the first analog voltage; an analog to digital converter configured to convert the said second analog voltage multiply-and-accumulate result into a digital signal using a limited-precision operation during a neural network inference operation; and a hardware controller configured to determine the at least one scaling factor based on the first multiply-and-accumulate result, or a software controller configured to determine the at least one scaling factor based on the first multiply-and-accumulate result.

    Energy-efficient analog-to-digital conversion in mixed signal circuitry

    公开(公告)号:US11811416B2

    公开(公告)日:2023-11-07

    申请号:US17550493

    申请日:2021-12-14

    CPC classification number: H03M1/002

    Abstract: An apparatus comprises at least one processor and at least one memory including instruction code configured to, with the at least one processor, cause the apparatus at least to perform a successive approximation analog-to-digital conversion of an analog input, representing a result of multiplication of first and second vectors, to a digital output by determining an upper bound on the result of multiplication of the first and second vectors, identifying, based at least in part on the determined upper bound, at least a portion of the successive approximation analog-to-digital conversion to be skipped, and skipping the identified portion of the successive approximation analog-to-digital conversion.

Patent Agency Ranking