Cache support for indirect loads and indirect stores in graph applications

    公开(公告)号:US12204901B2

    公开(公告)日:2025-01-21

    申请号:US17359305

    申请日:2021-06-25

    Abstract: Techniques for operating on an indirect memory access instruction, where the instruction accesses a memory location via at least one indirect address. A pipeline processes the instruction and a memory operation engine generates a first access to the at least one indirect address and a second access to a target address determined by the at least one indirect address. A cache memory used with the pipeline and the memory operation engine caches pointers. In response to a cache hit when executing the indirect memory access instruction, operations dereference a pointer to obtain the at least one indirect address, not set a cache bit, and return data for the instruction without storing the data in the cache memory; and in response to a cache miss, operations set the cache bit, obtain, and store a cache line for a missed pointer, and return data without storing the data in the cache memory.

    MICRO LIGHT EMITTING DIODE WITH METASURFACE REFLECTORS

    公开(公告)号:US20250160059A1

    公开(公告)日:2025-05-15

    申请号:US18508617

    申请日:2023-11-14

    Abstract: A micro-LED includes a light emitter between a pair of reflective metasurfaces formed from nanostructures. The metasurfaces have different levels of reflectivity, with one metasurface reflecting nearly all light, and the other metasurface allowing some light to pass through. The reflections of the light within the micro-LED result in an improved radiation recombination rate, which results in an increased modulation speed. In addition, the light emitted from the micro-LED has a relatively narrow divergence angle and narrow emission linewidth, making the micro-LED suitable for optical communications.

    PHOTONIC INTEGRATED CIRCUIT PACKAGES WITH SCALABLE HETEROGENEOUS INTEGRATION

    公开(公告)号:US20240355980A1

    公开(公告)日:2024-10-24

    申请号:US18305525

    申请日:2023-04-24

    CPC classification number: H01L33/62 H01L33/58

    Abstract: An IC package may include a stack of microelectronic units capable of horizontal and vertical optical communications. A microelectronic unit includes one or more power delivery pillars, two light source layers, an optical interconnect layer between the light source layers, and one or more IC devices arranged on the optical interconnect layer. A light source layer includes micro-LEDs that emit light used for generating optical signals. The optical interconnect layer includes one or more optical interconnects that enable horizontal optical communication, e.g., transmission of optical signals between the IC devices. A light source layer in the microelectronic unit can facilitate optical communications with another microelectronic unit that is below or above the microelectronic unit. A channel may exist above or below the light source layer to promote dissipation of heat generated by the IC devices. Light from the light source layer may pass through the channel for vertical optical communication.

    Array broadcast and reduction systems and methods

    公开(公告)号:US10983793B2

    公开(公告)日:2021-04-20

    申请号:US16369846

    申请日:2019-03-29

    Abstract: The present disclosure is directed to systems and methods of performing one or more broadcast or reduction operations using direct memory access (DMA) control circuitry. The DMA control circuitry executes a modified instruction set architecture (ISA) that facilitates the broadcast distribution of data to a plurality of destination addresses in system memory circuitry. The broadcast instruction may include broadcast of a single data value to each destination address. The broadcast instruction may include broadcast of a data array to each destination address. The DMA control circuitry may also execute a reduction instruction that facilitates the retrieval of data from a plurality of source addresses in system memory and performing one or more operations using the retrieved data. Since the DMA control circuitry, rather than the processor circuitry performs the broadcast and reduction operations, system speed and efficiency is beneficially enhanced.

    Circuitry and methods for direct memory access instruction set architecture support for flexible dense compute using a reconfigurable spatial array

    公开(公告)号:US12158852B2

    公开(公告)日:2024-12-03

    申请号:US17358832

    申请日:2021-06-25

    Abstract: Systems, methods, and apparatuses for direct memory access instruction set architecture support for flexible dense compute using a reconfigurable spatial array are described. In one embodiment, a processor includes a first type of hardware processor core that includes a two-dimensional grid of compute circuits, a memory, and a direct memory access circuit coupled to the memory and the two-dimensional grid of compute circuits; and a second different type of hardware processor core that includes a decoder circuit to decode a single instruction into a decoded single instruction, the single instruction including a first field to identify a base address of two-dimensional data in the memory, a second field to identify a number of elements in each one-dimensional array of the two-dimensional data, a third field to identify a number of one-dimensional arrays of the two-dimensional data, a fourth field to identify an operation to be performed by the two-dimensional grid of compute circuits, and a fifth field to indicate the direct memory access circuit is to move the two-dimensional data indicated by the first field, the second field, and the third field into the two-dimensional grid of compute circuits and the two-dimensional grid of compute circuits is to perform the operation on the two-dimensional data according to the fourth field, and an execution circuit to execute the decoded single instruction according to the fields.

    CACHE SUPPORT FOR INDIRECT LOADS AND INDIRECT STORES IN GRAPH APPLICATIONS

    公开(公告)号:US20220413855A1

    公开(公告)日:2022-12-29

    申请号:US17359305

    申请日:2021-06-25

    Abstract: Techniques for operating on an indirect memory access instruction, where the instruction accesses a memory location via at least one indirect address. A pipeline processes the instruction and a memory operation engine generates a first access to the at least one indirect address and a second access to a target address determined by the at least one indirect address. A cache memory used with the pipeline and the memory operation engine caches pointers. In response to a cache hit when executing the indirect memory access instruction, operations dereference a pointer to obtain the at least one indirect address, not set a cache bit, and return data for the instruction without storing the data in the cache memory; and in response to a cache miss, operations set the cache bit, obtain, and store a cache line for a missed pointer, and return data without storing the data in the cache memory.

    ARRAY BROADCAST AND REDUCTION SYSTEMS AND METHODS

    公开(公告)号:US20200310795A1

    公开(公告)日:2020-10-01

    申请号:US16369846

    申请日:2019-03-29

    Abstract: The present disclosure is directed to systems and methods of performing one or more broadcast or reduction operations using direct memory access (DMA) control circuitry. The DMA control circuitry executes a modified instruction set architecture (ISA) that facilitates the broadcast distribution of data to a plurality of destination addresses in system memory circuitry. The broadcast instruction may include broadcast of a single data value to each destination address. The broadcast instruction may include broadcast of a data array to each destination address. The DMA control circuitry may also execute a reduction instruction that facilitates the retrieval of data from a plurality of source addresses in system memory and performing one or more operations using the retrieved data. Since the DMA control circuitry, rather than the processor circuitry performs the broadcast and reduction operations, system speed and efficiency is beneficially enhanced.

Patent Agency Ranking