-
公开(公告)号:US20240403055A1
公开(公告)日:2024-12-05
申请号:US18800249
申请日:2024-08-12
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Joseph Zbiciak , Timothy Anderson
IPC: G06F9/32 , G06F9/30 , G06F9/345 , G06F9/38 , G06F11/00 , G06F11/10 , G06F12/02 , G06F12/0875 , G06F12/0897 , G06F13/16 , G06F13/40
Abstract: A streaming engine employed in a digital data processor specifies fixed first and second read only data streams. Corresponding stream address generator produces address of data elements of the two streams. Corresponding steam head registers stores data elements next to be supplied to functional units for use as operands. The two streams share two memory ports. A toggling preference of stream to port ensures fair allocation. The arbiters permit one stream to borrow the other's interface when the other interface is idle. Thus one stream may issue two memory requests, one from each memory port, if the other stream is idle. This spreads the bandwidth demand for each stream across both interfaces, ensuring neither interface becomes a bottleneck.
-
32.
公开(公告)号:US12061908B2
公开(公告)日:2024-08-13
申请号:US17472852
申请日:2021-09-13
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Joseph Zbiciak , Timothy Anderson
IPC: G06F9/32 , G06F9/30 , G06F9/345 , G06F9/38 , G06F11/00 , G06F12/02 , G06F12/0875 , G06F12/0897 , G06F13/16 , G06F13/40 , G06F11/10
CPC classification number: G06F9/321 , G06F9/30014 , G06F9/30036 , G06F9/30043 , G06F9/30047 , G06F9/30098 , G06F9/30112 , G06F9/30145 , G06F9/3016 , G06F9/32 , G06F9/345 , G06F9/3802 , G06F9/383 , G06F9/3867 , G06F11/00 , G06F12/0207 , G06F12/0875 , G06F12/0897 , G06F13/1605 , G06F13/4068 , G06F9/3836 , G06F11/10 , G06F2212/452 , G06F2212/60
Abstract: A streaming engine employed in a digital data processor specifies fixed first and second read only data streams. Corresponding stream address generator produces address of data elements of the two streams. Corresponding steam head registers stores data elements next to be supplied to functional units for use as operands. The two streams share two memory ports. A toggling preference of stream to port ensures fair allocation. The arbiters permit one stream to borrow the other's interface when the other interface is idle. Thus one stream may issue two memory requests, one from each memory port, if the other stream is idle. This spreads the bandwidth demand for each stream across both interfaces, ensuring neither interface becomes a bottleneck.
-
公开(公告)号:US12045614B2
公开(公告)日:2024-07-23
申请号:US17467550
申请日:2021-09-07
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Joseph Zbiciak
IPC: G06F9/30 , G06F9/32 , G06F9/345 , G06F9/38 , G06F11/00 , G06F11/10 , G06F12/0811 , G06F12/0815 , G06F12/0875
CPC classification number: G06F9/30043 , G06F9/30014 , G06F9/30021 , G06F9/30036 , G06F9/30112 , G06F9/30145 , G06F9/3016 , G06F9/32 , G06F9/345 , G06F9/3802 , G06F9/383 , G06F9/3867 , G06F11/00 , G06F11/1064 , G06F12/0811 , G06F12/0815 , G06F12/0875 , G06F9/30047 , G06F11/10 , G06F2212/452 , G06F2212/621
Abstract: A streaming engine employed in a digital data processor specifies a fixed read only data stream defined by plural nested loops. An address generator produces address of data elements. A steam head register stores data elements next to be supplied to functional units for use as operands. The streaming engine fetches stream data ahead of use by the central processing unit core in a stream buffer constructed like a cache. The stream buffer cache includes plural cache lines, each includes tag bits, at least one valid bit and data bits. Cache lines are allocated to store newly fetched stream data. Cache lines are deallocated upon consumption of the data by a central processing unit core functional unit. Instructions preferably include operand fields with a first subset of codings corresponding to registers, a stream read only operand coding and a stream read and advance operand coding.
-
公开(公告)号:US11994949B2
公开(公告)日:2024-05-28
申请号:US17408561
申请日:2021-08-23
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Joseph Zbiciak , Timothy Anderson
IPC: G06F11/00 , G06F9/30 , G06F9/345 , G06F9/38 , G06F11/10 , G06F11/14 , G06F12/0817 , G06F12/0875 , G06F12/0897 , H03M13/00 , G06F13/38 , G06F13/40
CPC classification number: G06F11/1076 , G06F9/30036 , G06F9/30047 , G06F9/345 , G06F9/383 , G06F9/3836 , G06F9/3891 , G06F11/00 , G06F11/1405 , G06F12/0817 , G06F12/0875 , G06F12/0897 , G06F9/30043 , G06F9/30141 , G06F9/382 , G06F9/3824 , G06F9/3881 , G06F11/10 , G06F13/38 , G06F13/40 , G06F2212/1021 , G06F2212/452 , G06F2212/60
Abstract: Disclosed embodiments relate to a streaming engine employed in, for example, a digital signal processor. A fixed data stream sequence including plural nested loops is specified by a control register. The streaming engine includes an address generator producing addresses of data elements and a steam head register storing data elements next to be supplied as operands. The streaming engine fetches stream data ahead of use by the central processing unit core in a stream buffer. Parity bits are formed upon storage of data in the stream buffer which are stored with the corresponding data. Upon transfer to the stream head register a second parity is calculated and compared with the stored parity. The streaming engine signals a parity fault if the parities do not match. The streaming engine preferably restarts fetching the data stream at the data element generating a parity fault.
-
公开(公告)号:US11860790B2
公开(公告)日:2024-01-02
申请号:US17462105
申请日:2021-08-31
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Joseph Zbiciak
IPC: G06F9/30 , G06F9/345 , G06F12/0875 , G06F12/0897 , G06F9/32 , G06F9/38 , G06F11/10 , G06F12/0846
CPC classification number: G06F12/0875 , G06F9/30003 , G06F9/3012 , G06F9/30014 , G06F9/3016 , G06F9/30032 , G06F9/30036 , G06F9/30047 , G06F9/30098 , G06F9/30112 , G06F9/30145 , G06F9/32 , G06F9/345 , G06F9/3802 , G06F9/383 , G06F9/3822 , G06F9/3853 , G06F9/3867 , G06F12/0897 , G06F11/10 , G06F12/0848 , G06F2212/452 , G06F2212/60
Abstract: A streaming engine employed in a digital data processor specifies a fixed read only data stream defined by plural nested loops. An address generator produces address of data elements. A steam head register stores data elements next to be supplied to functional units for use as operands. An element duplication unit optionally duplicates data element an instruction specified number of times. A vector masking unit limits data elements received from the element duplication unit to least significant bits within an instruction specified vector length. If the vector length is less than a stream head register size, the vector masking unit stores all 0's in excess lanes of the stream head register (group duplication disabled) or stores duplicate copies of the least significant bits in excess lanes of the stream head register.
-
公开(公告)号:US11836494B2
公开(公告)日:2023-12-05
申请号:US17387450
申请日:2021-07-28
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Timothy David Anderson , Duc Quang Bui , Joseph Zbiciak , Kai Chirca
CPC classification number: G06F9/30123 , G06F9/30101 , G06F9/30134
Abstract: A digital signal processor having a CPU with a program counter register and, optionally, an event context stack pointer register for saving and restoring the event handler context when higher priority event preempts a lower priority event handler. The CPU is configured to use a minimized set of addressing modes that includes using the event context stack pointer register and program counter register to compute an address for storing data in memory. The CPU may also eliminate post-decrement, pre-increment and post-decrement addressing and rely only on post-increment addressing.
-
公开(公告)号:US11789742B2
公开(公告)日:2023-10-17
申请号:US17688260
申请日:2022-03-07
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Timothy D. Anderson , Duc Bui , Joseph Zbiciak , Reid E. Tatge
IPC: G06F9/38
CPC classification number: G06F9/3867 , G06F9/3838
Abstract: Techniques related to executing a plurality of instructions by a processor comprising a method for executing a plurality of instructions by a processor. The method comprises detecting a pipeline hazard based on one or more instructions provided for execution by an instruction execution pipeline, beginning execution of an instruction, of the one or more instructions on the instruction execution pipeline, stalling a portion of the instruction execution pipeline based on the detected pipeline hazard, storing a register state associated with the execution of the instruction based on the stalling, determining that the pipeline hazard has been resolved, and restoring the register state to the instruction execution pipeline based on the determination.
-
公开(公告)号:US11604648B2
公开(公告)日:2023-03-14
申请号:US17353908
申请日:2021-06-22
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Joseph Zbiciak , Dheera Balasubramanian Samudrala , Duc Bui
Abstract: A method to transpose source data in a processor in response to a vector bit transpose instruction includes specifying, in respective fields of the vector bit transpose instruction, a source register containing the source data and a destination register to store transposed data. The method also includes executing the vector bit transpose instruction by interpreting N×N bits of the source data as a two-dimensional array having N rows and N columns, creating transposed source data by transposing the bits by reversing a row index and a column index for each bit, and storing the transposed source data in the destination register.
-
公开(公告)号:US11449429B2
公开(公告)日:2022-09-20
申请号:US17234874
申请日:2021-04-20
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Joseph Zbiciak
Abstract: A streaming engine employed in a digital data processor specifies a fixed read only data stream defined by plural nested loops. An address generator produces address of data elements for the nested loops. A steam head register stores data elements next to be supplied to functional units for use as operands. A stream template register specifies a circular address mode for the loop, first and second block size numbers and a circular address block size selection. For a first circular address block size selection the block size corresponds to the first block size number. For a first circular address block size selection the block size corresponds to the first block size number. For a second circular address block size selection the block size corresponds to a sum of the first block size number and the second block size number.
-
公开(公告)号:US11327761B2
公开(公告)日:2022-05-10
申请号:US16881327
申请日:2020-05-22
Applicant: Texas Instruments Incorporated
Inventor: Mujibur Rahman , Timothy David Anderson , Joseph Zbiciak
Abstract: An integrated circuit, comprising an instruction pipeline that includes instruction fetch phase circuitry, instruction decode phase circuitry, and instruction execution circuitry. The instruction execution circuitry includes transformation circuitry for receiving an interleaved dual vector operand as an input and for outputting a first natural order vector including a first set of data values from the interleaved dual vector operand and a second natural order vector including a second set of data values from the interleaved dual vector operand.
-
-
-
-
-
-
-
-
-