Split read port latch array bit cell

    公开(公告)号:US12033721B2

    公开(公告)日:2024-07-09

    申请号:US17359446

    申请日:2021-06-25

    CPC classification number: G11C8/16 G06F30/392 G11C11/418 G11C11/419

    Abstract: An apparatus and method for providing efficient floor planning, power, and performance tradeoffs of memory accesses. Adjacent bit cells in a column of an array use a split read port such that the bit cells do not share a read bit line while sharing a write bit line. The adjacent bit cells include asymmetrical read access circuits that convey data stored by latch circuitry of a corresponding bit cell to a corresponding read bit line. The layout of adjacent bit cells provides a number of contacted gate pitches per bit cell that is less than a sum of the maximum number of metal gates in layout of each of the adjacent bit cells divided by the number of adjacent bit cells.

    Register compaction with early release

    公开(公告)号:US12033238B2

    公开(公告)日:2024-07-09

    申请号:US17030852

    申请日:2020-09-24

    CPC classification number: G06T1/60 G06T1/20

    Abstract: Systems, apparatuses, and methods for implementing register compaction with early release are disclosed. A processor includes at least a command processor, a plurality of compute units, a plurality of registers, and a control unit. Registers are statically allocated to wavefronts by the control unit when wavefronts are launched by the command processor on the compute units. In response to determining that a first set of registers, previously allocated to a first wavefront, are no longer needed, the first wavefront executes an instruction to release the first set of registers. The control unit detects the executed instruction and releases the first set of registers to the available pool of registers to potentially be used by other wavefronts. Then, the control unit can allocate the first set of registers to a second wavefront for use by threads of the second wavefront while the first wavefront is still active.

    Method and apparatus for predicting kernel tuning parameters

    公开(公告)号:US12033035B2

    公开(公告)日:2024-07-09

    申请号:US16560954

    申请日:2019-09-04

    CPC classification number: G06N20/00 G06F11/3409 G06N3/02

    Abstract: A processing device, which improves processing performance, is provided which comprises memory configured to store data and a processor, in communication with the memory. The processor is configured to receive tuning parameters, each having a numeric value, for executing a portion of a program on an identified hardware device and convert the numeric values of the tuning parameters to words. The processor is also configured to predict, using one or more machine language learning algorithms, which combination of the words to execute the portion of the program on the identified hardware device based on performance efficiency and convert the predicted combination of the words to corresponding numeric values for executing the portion of the program on the identified hardware device.

    Processing Element-Centric All-to-All Communication

    公开(公告)号:US20240220336A1

    公开(公告)日:2024-07-04

    申请号:US18147081

    申请日:2022-12-28

    CPC classification number: G06F9/54 G06F9/5044 G06F15/17356

    Abstract: In accordance with described techniques for PE-centric all-to-all communication, a distributed computing system includes processing elements, such as graphics processing units, distributed in clusters. An all-to-all communication procedure is performed by the processing elements that are each configured to generate data packets in parallel for all-to-all data communication between the clusters. The all-to-all communication procedure includes a first stage of intra-cluster parallel data communication between respective processing elements of each of the clusters; a second stage of inter-cluster data exchange for all-to-all data communication between the clusters; and a third stage of intra-cluster data distribution to the respective processing elements of each of the clusters.

    INTERRUPT CONTROL USING A GUEST OWNED BACKING PAGE

    公开(公告)号:US20240220297A1

    公开(公告)日:2024-07-04

    申请号:US18090740

    申请日:2022-12-29

    Abstract: Techniques for implementing programmable control by a guest virtual machine (VM) of interrupts at a processing system using a guest owned backing page are disclosed. The VM programs a guest owned backing page (e.g., a data structure in memory) that designates particular interrupts that are to be blocked. In response to detecting a designated interrupt, system hardware or software blocks the interrupt, rather than executing an interrupt handler to process the interrupt. The VM is thereby able to protect confidential information and program behavior with less risk of a malicious hypervisor failing to protect the VM from, e.g., unexpected or unwanted interrupts, thereby improving overall system security and predictability.

    Permute Instructions for Register-Based Lookups

    公开(公告)号:US20240220247A1

    公开(公告)日:2024-07-04

    申请号:US18148873

    申请日:2022-12-30

    CPC classification number: G06F9/30127 G06F9/30134

    Abstract: Permute instructions for register-based lookups is described. In accordance with the described techniques, a processor is configured to perform a register-based lookup by retrieving a first result from a first lookup table based on a subset of bits included in an index of a destination register, retrieving a second result from a second lookup table based on the subset of bits included in the index of the destination register, selecting the first result or the second result based on a bit in the index of the destination register that is excluded from the subset of bits, and overwriting data included in the index of the destination register using a selected one of the first result or the second result.

Patent Agency Ranking