Machine-Learned Attention Models Featuring Omnidirectional Processing

Invention Application

US20220245428A1 Machine-Learned Attention Models Featuring Omnidirectional Processing 有权

Please log in to see more content

Patent Title: Machine-Learned Attention Models Featuring Omnidirectional Processing
Application No.: US17592796

Application Date: 2022-02-04
Publication No.: US20220245428A1

Publication Date: 2022-08-04
Inventor: Yi Tay , Da-Cheng Juan , Dara Bahri , Donald Arthur Metzler, JR. , Jai Prakash Gupta , Mostafa Dehghani , Phillip Pham , Vamsi Krishna Aribandi , Zhen Qin
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Main IPC: G06N3/04
IPC: G06N3/04 ; G06N3/10

Machine-Learned Attention Models Featuring Omnidirectional Processing

Abstract:

Provided are machine-learned attention models that feature omnidirectional processing, example implementations of which can be referred to as Omnidirectional Representations from Transformers (OMNINET). In example models described in the present disclosure, instead of maintaining a strictly horizontal receptive field, each token is allowed to attend to all tokens in some or all of the other tokens across the entire network.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/04	..体系结构，例如，互连拓扑