Approximation of matrices for matrix multiply operations

    公开(公告)号:US12197533B2

    公开(公告)日:2025-01-14

    申请号:US17214784

    申请日:2021-03-26

    Abstract: A processing device is provided which comprises memory configured to store data and a processor configured to receive a portion of data of a first matrix comprising a first plurality of elements and receive a portion of data of a second matrix comprising a second plurality of elements. The processor is also configured to determine values for a third matrix by dropping a number of products from products of pairs of elements of the first and second matrices based on approximating the products of the pairs of elements as a sum of the exponents of the pairs of elements and performing matrix multiplication on remaining products of the pairs of elements of the first and second matrices.

    Speculative dram request enabling and disabling

    公开(公告)号:US12189953B2

    公开(公告)日:2025-01-07

    申请号:US17956417

    申请日:2022-09-29

    Abstract: Methods, devices, and systems for retrieving information based on cache miss prediction. It is predicted, based on a history of cache misses at a private cache, that a cache lookup for the information will miss a shared victim cache. A speculative memory request is enabled based on the prediction that the cache lookup for the information will miss the shared victim cache. The information is fetched based on the enabled speculative memory request.

    Cache blocking for dispatches
    34.
    发明授权

    公开(公告)号:US12189534B2

    公开(公告)日:2025-01-07

    申请号:US17564474

    申请日:2021-12-29

    Abstract: A processing system divides successive dispatches of work items into portions. The successive dispatches are separated from each other by barriers, each barrier indicating that the work items of the previous dispatch must complete execution before work items of a subsequent dispatch can begin execution. In some embodiments, the processing system interleaves execution of portions of a first dispatch with portions of subsequent dispatches that consume data produced by the first dispatch. The processing system thereby reduces the amount of data written to the local cache by a producer dispatch while preserving data locality for a subsequent consumer (or consumer/producer) dispatch and facilitating processing efficiency.

    Suppressing cache line modification

    公开(公告)号:US12189530B2

    公开(公告)日:2025-01-07

    申请号:US18621799

    申请日:2024-03-29

    Inventor: Paul J. Moyer

    Abstract: Disclosed is a system and method for use in a cache for suppressing modification of cache line. The system and method includes a processor and a memory operating cooperatively with a cache controller. The memory includes a coherence directory stored within a cache created to track at least one cache line in the cache via the cache controller. The processor instructs a cache controller to store a first data in a cache line in the cache. The cache controller tags the cache line based on the first data. The processor instructs the cache controller to store a second data in the cache line in the cache causing eviction of the first data from the cache line. The processor compares based on the tagging the first data and the second data and suppresses modification of the cache line based on the comparing of the first data and the second data.

    HOST-TO-DEVICE INTERFACE CIRCUITRY TESTING

    公开(公告)号:US20250006290A1

    公开(公告)日:2025-01-02

    申请号:US18343377

    申请日:2023-06-28

    Inventor: Nehal Patel

    Abstract: Some implementations provide systems methods and devices for integrated circuit self-test. The integrated circuit includes interface circuitry configured to read test data and to write the test data into memory of the integrated circuit. The integrated circuit also includes test circuitry configured to test the interface circuitry based on the test data written into memory of the integrated circuit. Some implementations provide an integrated circuit configured for storing and reading data. The integrated circuit includes circuitry configured to write or read a first portion of data to or from a first memory via BIST circuitry of the first memory until a first BIST counter saturates. The integrated circuit also includes circuitry configured to write or read a second portion of the data to or from a second memory via BIST circuitry of the second memory until a second BIST counter saturates.

    RC-TUNED WORDLINE UNDERDRIVE CIRCUIT

    公开(公告)号:US20250006246A1

    公开(公告)日:2025-01-02

    申请号:US18344812

    申请日:2023-06-29

    Abstract: An apparatus and method for both reducing power consumption and increasing read access stability of a memory array. An integrated circuit includes a memory array with memory bit cells arranged as multiple rows and multiple columns. The array also includes multiple word line driver circuits configured to generate a corresponding word line for multiple rows. The array includes an underdrive circuit configured to adjust, via a configurable resistor-capacitor circuit, a rate of change of a voltage level of a word line. The configurable resistor-capacitor circuit controls the store data rate of the charging of the selected word line and allows the selected word line to charge to the power supply voltage. The configurable resistor-capacitor circuit controls the rate of charging without creating a direct current path between the power supply voltage and the ground reference level that would increase power consumption.

    CONTENT-AWARE PARTITIONING OF VIDEO FRAMES

    公开(公告)号:US20250005929A1

    公开(公告)日:2025-01-02

    申请号:US18217250

    申请日:2023-06-30

    Abstract: A portable electronic device includes a video encoder. The video encoder is configured to receive a video frame. Responsive to the video frame including at least one of a salient object or a region of interest, the video encoder is further configured to partition the video frame into one or more tiles based on a location of the at least one of the salient object or the region of interest.

    RUNNING AVERAGE CACHE HIT RATE
    40.
    发明申请

    公开(公告)号:US20250004943A1

    公开(公告)日:2025-01-02

    申请号:US18345974

    申请日:2023-06-30

    Abstract: The disclosed device includes a first register that stores a cumulative delta value and a second register that stores an average cache hit rate. The device also includes a control circuit that calculates a cache hit rate and updates the cumulative delta value based on the cache hit rate and the average cache hit rate. The control circuit also updates the average cache hit rate based on the updated cumulative delta value, and can update a cache allocation policy based on the updated average cache hit rate. Various other methods, systems, and computer-readable media are also disclosed.

Patent Agency Ranking