-
91.
公开(公告)号:US11797201B2
公开(公告)日:2023-10-24
申请号:US17745278
申请日:2022-05-16
CPC分类号: G06F3/0631 , G06F3/0604 , G06F3/0673 , G06F12/0207 , G06F12/0223 , G06F12/0607
摘要: Approaches are provided for implementing hardware-software collaborative address mapping schemes that enable mapping data elements which are accessed together in the same row of one bank or over the same rows of different banks to achieve higher performance by reducing row conflicts. Using an intra-bank frame striping policy (IBFS), corresponding subsets of data elements are interleaved into a single row of a bank. Using an intra-channel frame striping policy (ICFS), corresponding subsets of data elements are interleaved into a single channel row of a channel. A memory controller utilizes ICFS and/or IBFS to efficiently store and access data elements in memory, such as processing-in-memory (PIM) enabled memory.
-
公开(公告)号:US11742259B2
公开(公告)日:2023-08-29
申请号:US17353115
申请日:2021-06-21
发明人: Manish Arora , Nuwan Jayasena
IPC分类号: H01L23/427 , H05K1/02 , H01L23/00 , H05K3/46 , H05K3/34
CPC分类号: H01L23/4275 , H05K1/0203 , H01L24/13 , H01L24/16 , H01L2224/131 , H01L2224/16225 , H01L2924/15192 , H01L2924/15311 , H01L2924/15312 , H01L2924/15313 , H01L2924/19041 , H01L2924/19042 , H01L2924/19043 , H01L2924/19105 , H05K3/3436 , H05K3/4697 , H01L2224/131 , H01L2924/014
摘要: Various circuit board embodiments are disclosed. In one aspect, an apparatus is provided that includes a circuit board and a first phase change material pocket positioned on or in the circuit board and contacting a surface of the circuit board.
-
公开(公告)号:US20230095461A1
公开(公告)日:2023-03-30
申请号:US17491478
申请日:2021-09-30
发明人: Nuwan Jayasena
IPC分类号: G06F12/0802
摘要: An electronic device includes a cache, a memory, and a controller. The controller stores an epoch counter value in metadata for a location in the memory when a cache block evicted from the cache is stored in the location. The controller also controls how the cache block is retained in the cache based at least in part on the epoch counter value when the cache block is subsequently retrieved from the location and stored in the cache.
-
公开(公告)号:US20230004491A1
公开(公告)日:2023-01-05
申请号:US17364854
申请日:2021-06-30
发明人: Shaizeen Aga , Nuwan Jayasena
IPC分类号: G06F12/084 , G06F12/0811 , G06F12/0862 , G06F12/02
摘要: A technical solution to the technical problem of how to reduce the undesirable side effects of offloading computations to memory uses read hints to preload results of memory-side processing into a processor-side cache. A cache controller, in response to identifying a read hint in a memory-side processing instruction, causes results of the memory-side processing to be preloaded into a processor-side cache. Implementations include, without limitation, enabling or disabling the preloading based upon cache thrashing levels, preloading results, or portions of results, of memory-side processing to particular destination caches, preloading results based upon priority and/or degree of confidence, and/or during periods of low data bus and/or command bus utilization, last stores considerations, and enforcing an ordering constraint to ensure that preloading occurs after memory-side processing results are complete.
-
公开(公告)号:US20220239315A1
公开(公告)日:2022-07-28
申请号:US17722931
申请日:2022-04-18
摘要: A data processing platform, method, and program product perform compression and decompression of a set of data items. Suffix data and a prefix are selected for each respective data item in the set of data items based on data content of the respective data item. The set of data items is sorted based on the prefixes. The prefixes are encoded by querying multiple encoding tables to create a code word containing compressed information representing values of all prefixes for the set of data items. The code word and suffix data for each of the data items are stored in memory. The code word is decompressed to recover the prefixes. The recovered prefixes are paired with their respective suffix data.
-
公开(公告)号:US20220206899A1
公开(公告)日:2022-06-30
申请号:US17133843
申请日:2020-12-24
摘要: Methods and processing devices are provided for error protection to support instruction replay for executing idempotent instructions at a processing in memory PIM device. The processing apparatus includes a PIM device configured to execute an idempotent instruction. The processing apparatus also includes a processor, in communication with the PIM device, configured to issue the idempotent instruction to the PIM device for execution at the PIM device and reissue the idempotent instruction to the PIM device when one of execution of the idempotent instruction at the PIM device results in an error and a predetermined latency period expires from when the idempotent instruction is issued.
-
公开(公告)号:US20220206839A1
公开(公告)日:2022-06-30
申请号:US17135381
申请日:2020-12-28
发明人: Jonathan Alsop , Shaizeen Aga , Nuwan Jayasena
摘要: An Address Mapping-Aware Tasking (AMAT) mechanism manages compute task data and issues compute tasks on behalf of threads that created the compute task data. The AMAT mechanism stores compute task data generated by host threads in a set of partitions, where each partition is designated for a particular memory module. The AMAT mechanism maintains address mapping data that maps address information to partitions. Threads push compute task data to the AMAT mechanism instead of generating and issuing their own compute tasks. The AMAT mechanism uses address information included in the compute task data and the address mapping data to determine partitions in which to store the compute task data. The AMAT mechanism then issues compute tasks to be executed near the corresponding memory modules (i.e., in PIM execution units or NUMA compute nodes) based upon the compute task data stored in the partitions.
-
98.
公开(公告)号:US20220066662A1
公开(公告)日:2022-03-03
申请号:US17006646
申请日:2020-08-28
IPC分类号: G06F3/06
摘要: Approaches are provided for implementing hardware-software collaborative address mapping schemes that enable mapping data elements which are accessed together in the same row of one bank or over the same rows of different banks to achieve higher performance by reducing row conflicts. Using an intra-bank frame striping policy (IBFS), corresponding subsets of data elements are interleaved into a single row of a bank. Using an intra-channel frame striping policy (ICFS), corresponding subsets of data elements are interleaved into a single channel row of a channel. A memory controller utilizes ICFS and/or IBFS to efficiently store and access data elements in memory, such as processing-in-memory (PIM) enabled memory.
-
公开(公告)号:US20210373805A1
公开(公告)日:2021-12-02
申请号:US16885677
申请日:2020-05-28
发明人: Johnathan Alsop , Shaizeen Aga , Nuwan Jayasena
IPC分类号: G06F3/06
摘要: An approach is provided for reducing command bus traffic between memory controllers and PIM-enabled memory modules using special PIM commands. The term “special PIM command” is used herein to describe embodiments and refers to a PIM command for which the corresponding module-specific command information is provided to memory modules via a non-command bus data path. A memory controller generates and issues a special PIM command to multiple PIM-enabled memory modules via a command bus and provides module-specific command information (e.g., address information) for the special PIM command to the PIM-enabled memory modules via the non-command bus data path that is shared by the PIM-enabled memory modules and the memory controller.
-
公开(公告)号:US11119923B2
公开(公告)日:2021-09-14
申请号:US15440979
申请日:2017-02-23
IPC分类号: G06F3/06 , G06F12/00 , G06F13/00 , G06F12/0813 , G06F12/0837 , G06F12/0811
摘要: A cache coherence technique for operating a multi-processor system including shared memory includes allocating a cache line of a cache memory of a processor to a memory address in the shared memory in response to execution of an instruction of a program executing on the processor. The technique includes encoding a shared information state of the cache line to indicate whether the memory address is a shared memory address shared by the processor and a second processor, or a private memory address private to the processor, in response to whether the instruction is included in a critical section of the program, the critical section being a portion of the program that confines access to shared, writeable data.
-
-
-
-
-
-
-
-
-