Cache access measurement deskew
    172.
    发明授权

    公开(公告)号:US11880310B2

    公开(公告)日:2024-01-23

    申请号:US17553044

    申请日:2021-12-16

    CPC classification number: G06F12/12 G06F2212/601

    Abstract: A processor includes a cache having two or more test regions and a larger non-test region. The processor further includes a cache controller that applies different cache replacement policies to the different test regions of the cache, and a performance monitor that measures performance metrics for the different test regions, such as a cache hit rate at each test region. Based on the performance metrics, the cache controller selects a cache replacement policy for the non-test region, such as selecting the replacement policy associated with the test region having the better performance metrics among the different test regions. The processor deskews the memory access measurements in response to a difference in the amount of accesses to the different test regions exceeding a threshold.

    Management of thrashing in a GPU
    174.
    发明授权

    公开(公告)号:US11875197B2

    公开(公告)日:2024-01-16

    申请号:US17136738

    申请日:2020-12-29

    CPC classification number: G06F9/52 G06F9/30141 G06F9/3836 G06T1/20

    Abstract: Systems, apparatuses, and methods for managing a number of wavefronts permitted to concurrently execute in a processing system. An apparatus includes a register file with a plurality of registers and a plurality of compute units configured to execute wavefronts. A control unit of the apparatus is configured to allow a first number of wavefronts to execute concurrently on the plurality of compute units. The control unit is configured to allow no more than a second number of wavefronts to execute concurrently on the plurality of compute units, wherein the second number is less than the first number, in response to detection that thrashing of the register file is above a threshold. The control unit is configured to detect said thrashing based at least in part on a number of registers in use by executing wavefronts that spill to memory.

    OVERSTRESS DESIGN FOR VERIFICATION
    176.
    发明公开

    公开(公告)号:US20240012970A1

    公开(公告)日:2024-01-11

    申请号:US17861623

    申请日:2022-07-11

    CPC classification number: G06F30/3308 G06F2119/18

    Abstract: Techniques for implementing an overstress design for verification that reduce production and verification time by enabling a verification system to perform verification of components of a circuit design selectively, accurately, and exhaustively under extreme stress scenarios are disclosed. Circuit nodes in an emulation model are selected and overstress is provided to the nodes such that behavior of the circuit under such extreme stress scenarios is readily observable, enabling designers to produce circuits that are more secure, reliable, and resilient in case of failures. Overstress is provided to the node to enable verification of the emulation model without having to design complex test signal representations to produce extreme stress conditions. A request for manufacture is generated including aspects of the emulation model to enable verification of a fabricated circuit in a similar or identical manner to those used to verify the emulation model.

    Lock address contention predictor
    178.
    发明授权

    公开(公告)号:US11868818B2

    公开(公告)日:2024-01-09

    申请号:US15273304

    申请日:2016-09-22

    CPC classification number: G06F9/52 G06F9/50

    Abstract: Techniques for selectively executing a lock instruction speculatively or non-speculatively based on lock address prediction and/or temporal lock prediction. including methods an devices for locking an entry in a memory device. In some techniques, a lock instruction executed by a thread for a particular memory entry of a memory device is detected. Whether contention occurred for the particular memory entry during an earlier speculative lock is detected on a condition that the lock instruction comprises a speculative lock instruction. The lock is executed non-speculatively if contention occurred for the particular memory entry during an earlier speculative lock. The lock is executed speculatively if contention did not occur for the particular memory entry during an earlier speculative lock.

    CHANNEL AND SUB-CHANNEL THROTTLING FOR MEMORY CONTROLLERS

    公开(公告)号:US20240005971A1

    公开(公告)日:2024-01-04

    申请号:US17853418

    申请日:2022-06-29

    CPC classification number: G11C7/222 G11C7/1069 G11C7/1096 G11C7/109

    Abstract: An arbiter is operable to pick commands from a command queue for dispatch to a memory. The arbiter includes a traffic throttle circuit for mitigating excess power usage increases in coordination with one or more additional arbiters. The traffic throttle circuit includes a monitoring circuit and a throttle circuit. The monitoring circuit is for measuring a number of read and write commands picked by the arbiter and the one or more additional arbiters over a first predetermined period of time. The throttle circuit, responsive to a low activity state, limits a number of read and write commands issued by the arbiter during a second predetermined period of time.

Patent Agency Ranking