-
公开(公告)号:US20240354260A1
公开(公告)日:2024-10-24
申请号:US18762987
申请日:2024-07-03
Applicant: Texas Instruments Incorporated
Inventor: Timothy David Anderson , Mujibur Rahman
IPC: G06F12/1045 , G06F7/24 , G06F7/487 , G06F7/499 , G06F7/53 , G06F7/57 , G06F9/30 , G06F9/32 , G06F9/345 , G06F9/38 , G06F9/48 , G06F11/00 , G06F11/10 , G06F12/0862 , G06F12/0875 , G06F12/0897 , G06F12/1009 , G06F15/78 , G06F17/16 , H03H17/06
CPC classification number: G06F12/1045 , G06F7/24 , G06F7/487 , G06F7/4876 , G06F7/49915 , G06F7/53 , G06F7/57 , G06F9/3001 , G06F9/30014 , G06F9/30021 , G06F9/30032 , G06F9/30036 , G06F9/30065 , G06F9/30072 , G06F9/30098 , G06F9/30112 , G06F9/30145 , G06F9/30149 , G06F9/3016 , G06F9/32 , G06F9/345 , G06F9/3802 , G06F9/3818 , G06F9/383 , G06F9/3836 , G06F9/3851 , G06F9/3856 , G06F9/3867 , G06F9/3887 , G06F9/48 , G06F11/00 , G06F11/1048 , G06F12/0862 , G06F12/0875 , G06F12/0897 , G06F12/1009 , G06F17/16 , H03H17/0664 , G06F9/30018 , G06F9/325 , G06F9/381 , G06F9/3822 , G06F11/10 , G06F15/7807 , G06F15/781 , G06F2212/452 , G06F2212/60 , G06F2212/602 , G06F2212/68
Abstract: A method for sorting of a vector in a processor is provided that includes performing, by the processor in response to a vector sort instruction, sorting of values stored in lanes of the vector to generate a sorted vector, wherein the values are sorted in an order indicated by the vector sort instruction, and storing the sorted vector in a storage location.
-
公开(公告)号:US12124373B2
公开(公告)日:2024-10-22
申请号:US18185058
申请日:2023-03-16
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: David A. Roberts
IPC: G06F12/08 , G06F12/0815 , G06F12/0846 , G06F12/0862
CPC classification number: G06F12/0848 , G06F12/0815 , G06F12/0862 , G06F2212/282 , G06F2212/602 , G06F2212/621
Abstract: A method and apparatus physically partitions clean and dirty cache lines into separate memory partitions, such as one or more banks, so that during low power operation, a cache memory controller reduces power consumption of the cache memory containing the clean only data. The cache memory controller controls refresh operation so that data refresh does not occur for clean data only banks or the refresh rate is reduced for clean data only banks. Partitions that store dirty data can also store clean data, however other partitions are designated for storing only clean data so that the partitions can have their refresh rate reduced or refresh stopped for periods of time. When multiple DRAM dies or packages are employed, the partition can occur on a die or package level as opposed to a bank level within a die.
-
公开(公告)号:US12111766B2
公开(公告)日:2024-10-08
申请号:US17765485
申请日:2019-10-02
Applicant: Telefonaktiebolaget LM Ericsson (publ)
Inventor: Amir Roozbeh , Dejan Kostic , Gerald Q. Maguire, Jr. , Alireza Farshin
IPC: G06F12/08 , G06F12/06 , G06F12/0862
CPC classification number: G06F12/0862 , G06F12/06 , G06F2212/1024 , G06F2212/602
Abstract: Embodiments herein relates e.g., to a method performed by a first entity, for handling memory operations of an application in a computer environment, is provided. The first entity obtains position data associated with data of the application being fragmented into a number of positions in a physical memory. The position data indicates one or more positions of the number of positions in the physical memory. The first entity then provides, to a second entity, one or more indications of the one or more positions indicated by the position data for prefetching data from the second entity, using the one or more indications.
-
公开(公告)号:US12106104B2
公开(公告)日:2024-10-01
申请号:US17133328
申请日:2020-12-23
Applicant: Intel Corporation
Inventor: Zhe Wang , Alaa R. Alameldeen , Christopher J. Hughes
IPC: G06F9/30 , G06F12/0862 , H03M7/30
CPC classification number: G06F9/30047 , G06F9/30145 , G06F12/0862 , H03M7/30 , G06F2212/602
Abstract: A processor that includes compression instructions to compress multiple adjacent data blocks of uncompressed read-only data stored in memory into one compressed read-only data block and store the compressed read-only data block in multiple adjacent blocks in the memory is provided. During execution of an application to operate on the read-only data, one of the multiple adjacent blocks storing the compressed read-only block is read from memory, stored in a prefetch buffer and decompressed in the memory controller. In response to a subsequent request during execution of the application for an adjacent data block in the compressed read-only data block, the uncompressed adjacent block is read directly from the prefetch buffer.
-
公开(公告)号:US12099400B2
公开(公告)日:2024-09-24
申请号:US18164688
申请日:2023-02-06
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Joseph Zbiciak , Timothy D. Anderson , Duc Bui , Kai Chirca
IPC: G06F11/00 , G06F9/30 , G06F9/345 , G06F9/38 , G06F11/07 , G06F11/27 , G06F11/30 , G06F11/36 , G06F12/0862 , G06F12/0875 , G06F13/16 , G06F11/10
CPC classification number: G06F11/0772 , G06F9/30014 , G06F9/30036 , G06F9/30112 , G06F9/30145 , G06F9/345 , G06F9/383 , G06F9/3867 , G06F11/00 , G06F11/0721 , G06F11/073 , G06F11/27 , G06F11/3037 , G06F11/3648 , G06F12/0862 , G06F12/0875 , G06F13/1673 , G06F11/10 , G06F2212/452 , G06F2212/602
Abstract: This invention is a streaming engine employed in a digital signal processor. A fixed data stream sequence is specified by a control register. The streaming engine fetches stream data ahead of use by a central processing unit and stores it in a stream buffer. Upon occurrence of a fault reading data from memory, the streaming engine identifies the data element triggering the fault preferably storing this address in a fault address register. The streaming engine defers signaling the fault to the central processing unit until this data element is used as an operand. If the data element is never used by the central processing unit, the streaming engine never signals the fault. The streaming engine preferably stores data identifying the fault in a fault source register. The fault address register and the fault source register are preferably extended control registers accessible only via a debugger.
-
公开(公告)号:US12079127B2
公开(公告)日:2024-09-03
申请号:US17208825
申请日:2021-03-22
Applicant: CoreSecure Technologies, LLC
Inventor: Ruby B. Lee , Fangfei Liu
IPC: G06F12/08 , G06F12/0802 , G06F12/0862 , G06F12/0868 , G06F12/14
CPC classification number: G06F12/0862 , G06F12/0802 , G06F12/0868 , G06F12/14 , G06F2212/1052 , G06F2212/602
Abstract: Systems and methods for random fill caching and prefetching for secure cache memories are provided. The system dynamically de-correlates fetching a cache line to the processor from filling the cache with this cache line, due to a demand memory access, in order to provide greater security from information leakage due to cache side-channel attacks on cache memories. The system includes a random fill engine which includes a random number generator and an adjustable random fill window. Also provided is an adaptive random fill caching system which dynamically adapts the random fill window to a wide variety of computational workloads. Systems and methods for cache prefetching to improve system performance using adaptive random fill prefetching and random fill prefetching are also provided.
-
公开(公告)号:US12072812B2
公开(公告)日:2024-08-27
申请号:US17237391
申请日:2021-04-22
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Timothy D. Anderson , Joseph Zbiciak , Duc Quang Bui , Abhijeet Chachad , Kai Chirca , Naveen Bhoria , Matthew D. Pierson , Daniel Wu , Ramakrishnan Venkatasubramanian
IPC: G06F9/30 , G06F7/24 , G06F7/487 , G06F7/499 , G06F7/53 , G06F7/57 , G06F9/32 , G06F9/345 , G06F9/38 , G06F9/48 , G06F11/00 , G06F11/10 , G06F12/0862 , G06F12/0875 , G06F12/0897 , G06F12/1009 , G06F12/1045 , G06F17/16 , H03H17/06 , G06F15/78
CPC classification number: G06F12/1045 , G06F7/24 , G06F7/487 , G06F7/4876 , G06F7/49915 , G06F7/53 , G06F7/57 , G06F9/3001 , G06F9/30014 , G06F9/30021 , G06F9/30032 , G06F9/30036 , G06F9/30065 , G06F9/30072 , G06F9/30098 , G06F9/30112 , G06F9/30145 , G06F9/30149 , G06F9/3016 , G06F9/32 , G06F9/345 , G06F9/3802 , G06F9/3818 , G06F9/383 , G06F9/3836 , G06F9/3851 , G06F9/3856 , G06F9/3867 , G06F9/3887 , G06F9/48 , G06F11/00 , G06F11/1048 , G06F12/0862 , G06F12/0875 , G06F12/0897 , G06F12/1009 , G06F17/16 , H03H17/0664 , G06F9/30018 , G06F9/325 , G06F9/381 , G06F9/3822 , G06F11/10 , G06F15/7807 , G06F15/781 , G06F2212/452 , G06F2212/60 , G06F2212/602 , G06F2212/68
Abstract: Disclosed embodiments include an electronic device having a processor core, a memory, a register, and a data load unit to receive a plurality of data elements stored in the memory in response to an instruction. All of the data elements hare the same data size, which is specified by one or more coding bits. The data load unit includes an address generator to generate addresses corresponding to locations in the memory at which the data elements are located, and a formatting unit to format the data elements. The register is configured to store the formatted data elements, and the processor core is configured to receive the formatted data elements from the register.
-
公开(公告)号:US20240256456A1
公开(公告)日:2024-08-01
申请号:US18391346
申请日:2023-12-20
Applicant: Intel Corporation
Inventor: Vikranth Vemulapalli , Lakshminarayanan Striramassarma , Mike MacPherson , Aravindh Anantaraman , Ben Ashbaugh , Murali Ramadoss , William B. Sadler , Jonathan Pearce , Scott Janus , Brent Insko , Vasanth Ranganathan , Kamal Sinha , Arthur Hunter, Jr. , Prasoonkumar Surti , Nicolas Galoppo von Borries , Joydeep Ray , Abhishek R. Appu , ElMoustapha Ould-Ahmed-Vall , Altug Koker , Sungye Kim , Subramaniam Maiyuran , Valentin Andrei
IPC: G06F12/0862 , G06T1/20 , G06T1/60
CPC classification number: G06F12/0862 , G06T1/20 , G06T1/60 , G06F2212/602 , G06F2212/608
Abstract: Embodiments are generally directed to data prefetching for graphics data processing. An embodiment of an apparatus includes one or more processors including one or more graphics processing units (GPUs); and a plurality of caches to provide storage for the one or more GPUs, the plurality of caches including at least an L1 cache and an L3 cache, wherein the apparatus to provide intelligent prefetching of data by a prefetcher of a first GPU of the one or more GPUs including measuring a hit rate for the Li cache; upon determining that the hit rate for the L1 cache is equal to or greater than a threshold value, limiting a prefetch of data to storage in the L3 cache, and upon determining that the hit rate for the L1 cache is less than a threshold value, allowing the prefetch of data to the L1 cache.
-
9.
公开(公告)号:US12045170B2
公开(公告)日:2024-07-23
申请号:US17545121
申请日:2021-12-08
Applicant: Arm Limited
Inventor: Alexander Cole Shulyak , Karthik Sundaram
IPC: G06F12/08 , G06F12/0862 , G06N20/00
CPC classification number: G06F12/0862 , G06N20/00 , G06F2212/602
Abstract: Prefetch generation circuitry generates requests to prefetch data to a cache, where the prefetch generation circuitry is configured to initiate a producer prefetch to request return of producer data having a producer address and to initiate at least one consumer prefetch to request prefetching of consumer data to the cache, the consumer data having an address derived from the producer data returned in response to the producer prefetch. Training circuitry updates, based on executed load operations, a training table indicating candidate producer-consumer relationships being trained for use by the prefetch generation circuitry in generating the producer/consumer prefetches. Replacement control circuitry controls replacement of candidate producer-consumer relationships based on a producer-data-consumer-operand (PD-CO) match-based replacement policy criterion, which depends on whether a PD-CO match condition, indicative of the producer data for a producer load matching an address operand of a consumer load, is satisfied for existing/new candidate producer-consumer relationships.
-
公开(公告)号:US20240176742A1
公开(公告)日:2024-05-30
申请号:US18059076
申请日:2022-11-28
Applicant: QUALCOMM Incorporated
Inventor: Suryanarayana Murthy Durbhakula
IPC: G06F12/0862 , G06F12/0804 , G06F12/123
CPC classification number: G06F12/0862 , G06F12/0804 , G06F12/123 , G06F2212/1024 , G06F2212/602
Abstract: Providing memory region prefetching in processor-based devices is disclosed. In some aspects, a processor-based device comprises a region prefetcher circuit that comprises a plurality of access bitmaps corresponding to a plurality of contiguous memory regions of a system memory device. Each access bitmap comprises a plurality of bits corresponding to a plurality of memory blocks of a contiguous memory region. The region prefetcher circuit detects a memory access request to a memory block of a contiguous memory region, identifies an access bitmap corresponding to the contiguous memory region, and identifies a bit corresponding to the memory block. The region prefetcher circuit sets the bit to indicate the memory access request to the memory block. The region prefetcher circuit subsequently detects a prefetch trigger event, and, in response, identifies one or more unset bits of the access bitmap, and prefetches one or more memory blocks corresponding to the unset bits.
-
-
-
-
-
-
-
-
-