-
公开(公告)号:US20240354259A1
公开(公告)日:2024-10-24
申请号:US18732865
申请日:2024-06-04
Applicant: Texas Instruments Incorporated
Inventor: Asheesh Bhardwaj , Mujibur Rahman , Timothy David Anderson
IPC: G06F12/1045 , G06F7/24 , G06F7/487 , G06F7/499 , G06F7/53 , G06F7/57 , G06F9/30 , G06F9/32 , G06F9/345 , G06F9/38 , G06F9/48 , G06F11/00 , G06F11/10 , G06F12/0862 , G06F12/0875 , G06F12/0897 , G06F12/1009 , G06F15/78 , G06F17/16 , H03H17/06
CPC classification number: G06F12/1045 , G06F7/24 , G06F7/487 , G06F7/4876 , G06F7/49915 , G06F7/53 , G06F7/57 , G06F9/3001 , G06F9/30014 , G06F9/30021 , G06F9/30032 , G06F9/30036 , G06F9/30065 , G06F9/30072 , G06F9/30098 , G06F9/30112 , G06F9/30145 , G06F9/30149 , G06F9/3016 , G06F9/32 , G06F9/345 , G06F9/3802 , G06F9/3818 , G06F9/383 , G06F9/3836 , G06F9/3851 , G06F9/3856 , G06F9/3867 , G06F9/3887 , G06F9/48 , G06F11/00 , G06F11/1048 , G06F12/0862 , G06F12/0875 , G06F12/0897 , G06F12/1009 , G06F17/16 , H03H17/0664 , G06F9/30018 , G06F9/325 , G06F9/381 , G06F9/3822 , G06F11/10 , G06F15/7807 , G06F15/781 , G06F2212/452 , G06F2212/60 , G06F2212/602 , G06F2212/68
Abstract: A method is provided that includes performing, by a processor in response to a vector matrix multiply instruction, multiplying an m×n matrix (A matrix) and a n×p matrix (B matrix) to generate elements of an m×p matrix (R matrix), and storing the elements of the R matrix in a storage location specified by the vector matrix multiply instruction.
-
公开(公告)号:US12073105B2
公开(公告)日:2024-08-27
申请号:US17877518
申请日:2022-07-29
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Arthur John Redfern , Asheesh Bhardwaj
CPC classification number: G06F3/0647 , G06F3/0613 , G06F3/0683 , G06F15/00 , G06F17/16 , G06N3/045
Abstract: A matrix transfer accelerator (MTA) system/method that coordinates data transfers between an external data memory (EDM) and a local data memory (LDM) using matrix tiling and/or grouping is disclosed. The system utilizes foreground/background buffering that overlaps compute and data transfer operations and permits EDM-to-LDM data transfers with or without zero pad peripheral matrix filling. The system may incorporate an automated zero-fill direct memory access (DMA) controller (ZDC) that transfers data from the EDM to the LDM based on a set of DMA controller registers including data width register (DWR), transfer count register (TCR), fill count register (FCR), EDM source address register (ESR), and LDM target address register (LTR). The ZDC transfers matrix data from the EDM[ESR] to the LDM[LTR] such that EDM matrix data of DWR row data width is automatically zero-filled around a periphery of a matrix written to the LDM matrix based on the FCR value.
-
公开(公告)号:US20230333848A1
公开(公告)日:2023-10-19
申请号:US18320625
申请日:2023-05-19
Applicant: Texas Instruments Incorporated
Inventor: Mujibur Rahman , Asheesh Bhardwaj , Timothy David Anderson
IPC: G06F9/30 , G06F17/16 , G06F7/487 , G06F7/499 , G06F7/24 , H03H17/06 , G06F7/53 , G06F9/38 , G06F7/57 , G06F9/48
CPC classification number: G06F9/30036 , G06F9/3001 , G06F17/16 , G06F7/4876 , G06F7/49915 , G06F7/24 , G06F9/30032 , H03H17/0664 , G06F7/53 , G06F9/3836 , G06F9/30021 , G06F7/487 , G06F9/3818 , G06F7/57 , G06F9/30145 , G06F9/30149 , G06F9/3851 , G06F9/48
Abstract: A method includes executing, by a processor a vector finite impulse response (VFIR) filter instruction that specifies coefficients, data elements, and a storage location. The executing includes reordering a subset of the data elements to provide each data element of the reordered subset of the data elements to a respective slice multiply component of a vector multiplier of the processor, generating, by the vector multiplier, filter outputs based on the coefficients and data elements, and storing the filter outputs in the storage location.
-
公开(公告)号:US20230297377A1
公开(公告)日:2023-09-21
申请号:US18164806
申请日:2023-02-06
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: Asheesh Bhardwaj , Burton Adrik Copeland , Tim Anderson
CPC classification number: G06F9/30098 , G06F9/3816
Abstract: A method is described herein. The method generally includes fetching a set of data from a memory coupled to a memory controller. The method generally includes determining a first subset of data from the set of data. The method generally includes determining a second subset of data from the set of data. The method generally includes determining a first element from the set of data. The method generally includes providing a vector including the first subset, the first element, and the second subset, wherein each element of the first subset is disposed in one portion of the vector and each element of the second subset is disposed in another portion of the vector. The method generally includes storing the vector into a register of the memory controller.
-
公开(公告)号:US11347503B2
公开(公告)日:2022-05-31
申请号:US16878607
申请日:2020-05-20
Applicant: Texas Instruments Incorporated
Inventor: Asheesh Bhardwaj , Mujibur Rahman , Timothy David Anderson
IPC: G06F9/30 , G06F17/16 , G06F9/38 , G06F7/487 , G06F7/499 , G06F7/24 , H03H17/06 , G06F7/53 , G06F9/48 , G06F7/57
Abstract: A method is provided that includes performing, by a processor in response to a vector matrix multiply instruction, multiplying an m×n matrix (A matrix) and a n×p matrix (B matrix) to generate elements of an m×p matrix (R matrix), and storing the elements of the R matrix in a storage location specified by the vector matrix multiply instruction.
-
公开(公告)号:US11249759B2
公开(公告)日:2022-02-15
申请号:US16420457
申请日:2019-05-23
Applicant: Texas Instruments Incorporated
Inventor: William Franklin Leven , Asheesh Bhardwaj , Son Hung Tran , Timothy David Anderson
IPC: G06F9/34 , G06F11/00 , G06F12/0875 , G06F17/16 , G06F9/30 , G06F9/38 , G06F11/10 , G06F9/32 , G06F12/0897 , G06F9/345
Abstract: Software instructions are executed on a processor within a computer system to configure a steaming engine with stream parameters to define a multidimensional array. The stream parameters define a size for each dimension of the multidimensional array and a specified width for two selected dimensions of the array. Data is fetched from a memory coupled to the streaming engine responsive to the stream parameters. A stream of vectors is formed for the multidimensional array responsive to the stream parameters from the data fetched from memory. When either selected dimension in the stream of vectors exceeds a respective specified width, the streaming engine inserts null elements into each portion of a respective vector for the selected dimension that exceeds the specified width in the stream of vectors. Stream vectors that are completely null are formed by the streaming engine without accessing the system memory for respective data.
-
公开(公告)号:US12045617B2
公开(公告)日:2024-07-23
申请号:US17670611
申请日:2022-02-14
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: William Franklin Leven , Asheesh Bhardwaj , Son Hung Tran , Timothy David Anderson
IPC: G06F9/30 , G06F9/32 , G06F9/345 , G06F9/38 , G06F11/00 , G06F11/10 , G06F12/0875 , G06F12/0897 , G06F17/16
CPC classification number: G06F9/3016 , G06F9/30014 , G06F9/30036 , G06F9/30098 , G06F9/30112 , G06F9/30145 , G06F9/32 , G06F9/345 , G06F9/3802 , G06F9/383 , G06F9/3867 , G06F11/00 , G06F11/1048 , G06F12/0875 , G06F12/0897 , G06F9/3822 , G06F11/10 , G06F17/16 , G06F2212/452 , G06F2212/60
Abstract: Software instructions are executed on a processor within a computer system to configure a steaming engine with stream parameters to define a multidimensional array. The stream parameters define a size for each dimension of the multidimensional array and a specified width for two selected dimensions of the array. Data is fetched from a memory coupled to the streaming engine responsive to the stream parameters. A stream of vectors is formed for the multidimensional array responsive to the stream parameters from the data fetched from memory. When either selected dimension in the stream of vectors exceeds a respective specified width, the streaming engine inserts null elements into each portion of a respective vector for the selected dimension that exceeds the specified width in the stream of vectors. Stream vectors that are completely null are formed by the streaming engine without accessing the system memory for respective data.
-
公开(公告)号:US20220164188A1
公开(公告)日:2022-05-26
申请号:US17670611
申请日:2022-02-14
Applicant: TEXAS INSTRUMENTS INCORPORATED
Inventor: William Franklin Leven , Asheesh Bhardwaj , Son Hung Tran , Timothy David Anderson
IPC: G06F9/30 , G06F9/38 , G06F11/10 , G06F9/32 , G06F12/0875 , G06F12/0897 , G06F11/00 , G06F9/345
Abstract: Software instructions are executed on a processor within a computer system to configure a steaming engine with stream parameters to define a multidimensional array. The stream parameters define a size for each dimension of the multidimensional array and a specified width for two selected dimensions of the array. Data is fetched from a memory coupled to the streaming engine responsive to the stream parameters. A stream of vectors is formed for the multidimensional array responsive to the stream parameters from the data fetched from memory. When either selected dimension in the stream of vectors exceeds a respective specified width, the streaming engine inserts null elements into each portion of a respective vector for the selected dimension that exceeds the specified width in the stream of vectors. Stream vectors that are completely null are formed by the streaming engine without accessing the system memory for respective data.
-
公开(公告)号:US11113062B2
公开(公告)日:2021-09-07
申请号:US16420480
申请日:2019-05-23
Applicant: Texas Instruments Incorporated
Inventor: Asheesh Bhardwaj , Timothy David Anderson , Son Hung Tran
IPC: G06F9/302 , G06F9/315 , G06F11/00 , G06F12/08 , G06F17/16 , G06F7/74 , G06F9/30 , G06F9/38 , G06F11/10 , G06F9/32 , G06F12/0875 , G06F12/0897 , G06F9/345
Abstract: Software instructions are executed on a processor within a computer system to configure a steaming engine with stream parameters to define a multidimensional array. The stream parameters define a size for each dimension of the multidimensional array and a pad value indicator. Data is fetched from a memory coupled to the streaming engine responsive to the stream parameters. A stream of vectors is formed for the multidimensional array responsive to the stream parameters from the data fetched from memory. A padded stream vector is formed that includes a specified pad value without accessing the pad value from system memory.
-
公开(公告)号:US20200371798A1
公开(公告)日:2020-11-26
申请号:US16878607
申请日:2020-05-20
Applicant: Texas Instruments Incorporated
Inventor: Asheesh Bhardwaj , Mujibur Rahman , Timothy David Anderson
Abstract: A method is provided that includes performing, by a processor in response to a vector matrix multiply instruction, multiplying an m×n matrix (A matrix) and a n×p matrix (B matrix) to generate elements of an m×p matrix (R matrix), and storing the elements of the R matrix in a storage location specified by the vector matrix multiply instruction.
-
-
-
-
-
-
-
-
-