Low latency long short-term memory inference with sequence interleaving

发明授权

US11769041B2 Low latency long short-term memory inference with sequence interleaving 有权

请登陆查看更多内容

专利标题： Low latency long short-term memory inference with sequence interleaving
申请号： US16177218

申请日： 2018-10-31
公开(公告)号： US11769041B2

公开(公告)日： 2023-09-26
发明人: Sateesh Lagudu , Lei Zhang , Allen H. Rush
申请人： Advanced Micro Devices, Inc. , ATI Technologies ULC
申请人地址： US CA Santa Clara
专利权人： Advanced Micro Devices, Inc.,ATI Technologies ULC
当前专利权人： Advanced Micro Devices, Inc.,ATI Technologies ULC
当前专利权人地址： US CA Santa Clara; CA Markham
代理机构： KOWERT HOOD MUNYON RANKIN AND GOETZEL PC
代理商 Rory D. Rankin
主分类号： G06N3/063
IPC分类号： G06N3/063 ; G06F7/544 ; G06F17/16 ; G06N20/00

Low latency long short-term memory inference with sequence interleaving

摘要：

Systems, apparatuses, and methods for implementing a low latency long short-term memory (LSTM) machine learning engine using sequence interleaving techniques are disclosed. A computing system includes at least a host processing unit, a machine learning engine, and a memory. The host processing unit detects a plurality of sequences which will be processed by the machine learning engine. The host processing unit interleaves the sequences into data blocks and stores the data blocks in the memory. When the machine learning engine receives a given data block, the machine learning engine performs, in parallel, a plurality of matrix multiplication operations on the plurality of sequences in the given data block and a plurality of coefficients. Then, the outputs of the matrix multiplication operations are coupled to one or more LSTM layers.

信息查询

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/06	..物理实现，即神经网络、神经元或神经元部分的硬件实现
G06N3/063	...采用电的