-
公开(公告)号:US11269774B2
公开(公告)日:2022-03-08
申请号:US16601947
申请日:2019-10-15
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Kai Chirca , Timothy David Anderson
IPC: G06F12/084 , G06F12/0811 , G06F12/1009 , G06F12/0875 , G06F12/10 , G06F13/16 , G06F13/40 , G06F12/0855 , G06F12/06 , G06F12/0817 , G06F12/0831 , G06F13/12 , G06F3/06 , G06F12/0815 , H03M13/01 , H03M13/09 , H03M13/15 , H03M13/27 , G06F9/30 , G06F9/38 , G06F9/48 , G06F9/50 , G06F12/0891 , G06F12/0846 , G06F12/0862
Abstract: Techniques for maintaining cache coherency comprising storing data blocks associated with a main process in a cache line of a main cache memory, storing a first local copy of the data blocks in a first local cache memory of a first processor, storing a second local copy of the set of data blocks in a second local cache memory of a second processor executing a first child process of the main process to generate first output data, writing the first output data to the first data block of the first local copy as a write through, writing the first output data to the first data block of the main cache memory as a part of the write through, transmitting an invalidate request to the second local cache memory, marking the second local copy of the set of data blocks as delayed, and transmitting an acknowledgment to the invalidate request.
-
公开(公告)号:US11256508B2
公开(公告)日:2022-02-22
申请号:US16420467
申请日:2019-05-23
Applicant: Texas Instruments Incorporated
Inventor: Asheesh Bhardwaj , William Franklin Leven , Son Hung Tran , Timothy David Anderson
IPC: G06F9/318 , G06F9/34 , G06F11/00 , G06F12/0875 , G06F17/16 , G06F9/30 , G06F9/38 , G06F11/10 , G06F9/32 , G06F12/0897 , G06F9/345
Abstract: Software instructions are executed on a processor within a computer system to configure a steaming engine with stream parameters to define a multidimensional array. The stream parameters define a size for each dimension of the multidimensional array, a null vector count (N), and a selected dimension. Data is fetched from a memory coupled to the streaming engine responsive to the stream parameters. A stream of vectors is formed for the multidimensional array responsive to the stream parameters from the data fetched from memory. N null stream vectors are inserted into the stream of vectors for the selected dimension without fetching respective null data from the memory.
-
公开(公告)号:US20220027275A1
公开(公告)日:2022-01-27
申请号:US17492776
申请日:2021-10-04
Applicant: Texas Instruments Incorporated
Inventor: Abhijeet Ashok Chachad , Timothy David Anderson , Kai Chirca , David Matthew Thompson
IPC: G06F12/0842 , G06F12/0811 , G06F12/0888 , G06F1/14 , G06F9/54
Abstract: In described examples, a processor system includes a processor core generating memory transactions, a lower level cache memory with a lower memory controller, and a higher level cache memory with a higher memory controller having a memory pipeline. The higher memory controller is connected to the lower memory controller by a bypass path that skips the memory pipeline. The higher memory controller: determines whether a memory transaction is a bypass write, which is a memory write request indicated not to result in a corresponding write being directed to the higher level cache memory; if the memory transaction is determined a bypass write, determines whether a memory transaction that prevents passing is in the memory pipeline; and if no transaction that prevents passing is determined to be in the memory pipeline, sends the memory transaction to the lower memory controller using the bypass path.
-
264.
公开(公告)号:US20210406190A1
公开(公告)日:2021-12-30
申请号:US17472811
申请日:2021-09-13
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Naveen Bhoria , Timothy David Anderson , Pete Michael Hippleheuser
IPC: G06F12/0888 , G06F12/0891 , G06F9/54 , G06F12/02 , G06F12/0811 , G06F12/128 , G06F12/0817 , G06F12/0804 , G06F9/30 , G11C7/10 , G11C29/42 , G11C29/44 , G06F11/10 , G06F12/0855 , G06F12/12 , G06F12/0806 , G06F12/0815 , G06F12/0853 , G06F13/16 , G06F12/121 , G06F12/0884 , G06F12/0897 , G06F12/0895 , G06F12/0864 , G11C7/22 , G11C5/06
Abstract: Methods, apparatus, systems and articles of manufacture to facilitate atomic compare and swap in cache for a coherent level 1 data cache system are disclosed. An example system includes a cache storage; a cache controller coupled to the cache storage wherein the cache controller is operable to: receive a memory operation that specifies a key, a memory address, and a first set of data; retrieve a second set of data corresponding to the memory address; compare the second set of data to the key; based on the second set of data corresponding to the key, cause the first set of data to be stored at the memory address; and based on the second set of data not corresponding to the key, complete the memory operation without causing the first set of data to be stored at the memory address.
-
公开(公告)号:US20210357218A1
公开(公告)日:2021-11-18
申请号:US17387260
申请日:2021-07-28
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Timothy David Anderson , Mujibur Rahman
IPC: G06F9/30 , G06F17/16 , G06F9/38 , G06F7/487 , G06F7/499 , G06F7/24 , H03H17/06 , G06F7/53 , G06F9/48 , G06F7/57
Abstract: A method for sorting of a vector in a processor is provided that includes performing, by the processor in response to a vector sort instruction, sorting of values stored in lanes of the vector to generate a sorted vector, wherein the values are sorted in an order indicated by the vector sort instruction, and storing the sorted vector in a storage location.
-
公开(公告)号:US11138117B2
公开(公告)日:2021-10-05
申请号:US16879264
申请日:2020-05-20
Applicant: Texas Instruments Incorporated
Inventor: Abhijeet Ashok Chachad , Timothy David Anderson , Kai Chirca , David Matthew Thompson
IPC: G06F12/00 , G06F12/0842 , G06F12/0811 , G06F12/0888 , G06F1/14 , G06F9/54
Abstract: In described examples, a processor system includes a processor core generating memory transactions, a lower level cache memory with a lower memory controller, and a higher level cache memory with a higher memory controller having a memory pipeline. The higher memory controller is connected to the lower memory controller by a bypass path that skips the memory pipeline. The higher memory controller: determines whether a memory transaction is a bypass write, which is a memory write request indicated not to result in a corresponding write being directed to the higher level cache memory; if the memory transaction is determined a bypass write, determines whether a memory transaction that prevents passing is in the memory pipeline; and if no transaction that prevents passing is determined to be in the memory pipeline, sends the memory transaction to the lower memory controller using the bypass path.
-
267.
公开(公告)号:US11119935B2
公开(公告)日:2021-09-14
申请号:US16882272
申请日:2020-05-22
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Naveen Bhoria , Timothy David Anderson , Pete Michael Hippleheuser
IPC: G06F12/08 , G06F12/02 , G06F12/0893 , G06F12/0888 , G06F12/0891 , G06F9/54 , G06F12/0811 , G06F12/128 , G06F12/0817 , G06F12/0804 , G06F9/30 , G11C7/10 , G11C29/42 , G11C29/44 , G06F11/10 , G06F12/0855 , G06F12/12 , G06F12/0806 , G06F12/0815 , G06F12/0853 , G06F13/16 , G06F12/121 , G06F12/0884 , G06F12/0897 , G06F12/0895 , G06F12/0864 , G11C7/22 , G11C5/06
Abstract: Methods, apparatus, systems and articles of manufacture to facilitate atomic compare and swap in cache for a coherent level 1 data cache system are disclosed. An example system includes a cache storage; a cache controller coupled to the cache storage wherein the cache controller is operable to: receive a memory operation that specifies a key, a memory address, and a first set of data; retrieve a second set of data corresponding to the memory address; compare the second set of data to the key; based on the second set of data corresponding to the key, cause the first set of data to be stored at the memory address; and based on the second set of data not corresponding to the key, complete the memory operation without causing the first set of data to be stored at the memory address.
-
268.
公开(公告)号:US11086967B2
公开(公告)日:2021-08-10
申请号:US15907356
申请日:2018-02-28
Applicant: Texas Instruments Incorporated
Inventor: Arthur John Redfern , Timothy David Anderson , Kai Chirca , Chenchi Luo , Zhenhua Yu
Abstract: A method for performing a fundamental computational primitive in a device is provided, where the device includes a processor and a matrix multiplication accelerator (MMA). The method includes configuring a streaming engine in the device to stream data for the fundamental computational primitive from memory, configuring the MMA to format the data, and executing the fundamental computational primitive by the device.
-
公开(公告)号:US20210109753A1
公开(公告)日:2021-04-15
申请号:US17126156
申请日:2020-12-18
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Timothy David Anderson , Duc Quang Bui
IPC: G06F9/30 , G06F1/3206 , G06F1/3287 , G06F9/38
Abstract: In one disclosed embodiment, a processor includes a first execution unit and a second execution unit, a register file, and a data path including a plurality of lanes. The data path and the register file are arranged so that writing to the register file by the first execution unit and by the second execution unit is allowed over the data path, reading from the register file by the first execution unit is allowed over the data path, and reading from the register file by the second execution unit is not allowed over the data path. The processor also includes a power control circuit configured to, when a transfer of data between the register file and either of the first and second execution units uses less than all of the lanes, power down the lanes of the data path not used for the transfer of the data.
-
公开(公告)号:US20200371946A1
公开(公告)日:2020-11-26
申请号:US16882234
申请日:2020-05-22
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Naveen Bhoria , Timothy David Anderson , Pete Michael Hippleheuser
IPC: G06F12/0888 , G06F12/0891 , G06F12/02 , G06F9/54
Abstract: Methods, apparatus, systems and articles of manufacture are disclosed to reduce read-modify-write cycles for non-aligned writes. An example apparatus includes a memory that includes a plurality of memory banks, an interface configured to be coupled to a central processing unit, the interface to obtain a write operation from the central processing unit, wherein the write operation is to write a subset of the plurality of memory banks, and bank processing logic coupled to the interface and to the memory, the bank processing logic to determine the subset of the plurality of memory banks to write based on the write operation, and determine whether to cause a read operation to be performed in response to the write operation based on whether a number of addresses in the subset of the plurality of memory banks to write satisfies a threshold.
-
-
-
-
-
-
-
-
-