Attention neural networks with sparse attention mechanisms

Invention Grant

US11238332B2 Attention neural networks with sparse attention mechanisms 有权

Please log in to see more content

Patent Title: Attention neural networks with sparse attention mechanisms
Application No.: US17341193

Application Date: 2021-06-07
Publication No.: US11238332B2

Publication Date: 2022-02-01
Inventor: Joshua Timothy Ainslie , Santiago Ontañón , Philip Pham , Manzil Zaheer , Guru Guruganesh , Kumar Avinava Dubey , Amr Ahmed
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Agency: Fish & Richardson P.C.
Main IPC: G06N3/04
IPC: G06N3/04 ; G06N3/08 ; G06N3/063 ; G06N20/00

Attention neural networks with sparse attention mechanisms

Abstract:

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing network inputs using an attention neural network that has one or more sparse attention sub-layers. Each sparse attention sub-layer is configured to apply a sparse attention mechanism that attends differently for input positions that are in a first proper subset of the input positions in the input to the sub-layer than for positions that are not in the first proper subset.

Public/Granted literature

US20210383191A1 ATTENTION NEURAL NETWORKS WITH SPARSE ATTENTION MECHANISMS Public/Granted day:2021-12-09

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/04	..体系结构，例如，互连拓扑