- 专利标题: Low latency long short-term memory inference with sequence interleaving
-
申请号: US16177218申请日: 2018-10-31
-
公开(公告)号: US11769041B2公开(公告)日: 2023-09-26
- 发明人: Sateesh Lagudu , Lei Zhang , Allen H. Rush
- 申请人: Advanced Micro Devices, Inc. , ATI Technologies ULC
- 申请人地址: US CA Santa Clara
- 专利权人: Advanced Micro Devices, Inc.,ATI Technologies ULC
- 当前专利权人: Advanced Micro Devices, Inc.,ATI Technologies ULC
- 当前专利权人地址: US CA Santa Clara; CA Markham
- 代理机构: KOWERT HOOD MUNYON RANKIN AND GOETZEL PC
- 代理商 Rory D. Rankin
- 主分类号: G06N3/063
- IPC分类号: G06N3/063 ; G06F7/544 ; G06F17/16 ; G06N20/00
摘要:
Systems, apparatuses, and methods for implementing a low latency long short-term memory (LSTM) machine learning engine using sequence interleaving techniques are disclosed. A computing system includes at least a host processing unit, a machine learning engine, and a memory. The host processing unit detects a plurality of sequences which will be processed by the machine learning engine. The host processing unit interleaves the sequences into data blocks and stores the data blocks in the memory. When the machine learning engine receives a given data block, the machine learning engine performs, in parallel, a plurality of matrix multiplication operations on the plurality of sequences in the given data block and a plurality of coefficients. Then, the outputs of the matrix multiplication operations are coupled to one or more LSTM layers.
信息查询