Systems and methods for machine-learned models having convolution and attention

Invention Grant

US11755883B2 Systems and methods for machine-learned models having convolution and attention 有权

Please log in to see more content

Patent Title: Systems and methods for machine-learned models having convolution and attention
Application No.: US17827130

Application Date: 2022-05-27
Publication No.: US11755883B2

Publication Date: 2023-09-12
Inventor: Zihang Dai , Hanxiao Liu , Mingxing Tan , Quoc V. Le
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: GOOGLE LLC
Current Assignee: GOOGLE LLC
Current Assignee Address: US CA Mountain View
Agency: Dority & Manning, P.A.
Main IPC: G06N20/00
IPC: G06N20/00 ; G06N3/044 ; G06N3/063 ; G06N3/08

Systems and methods for machine-learned models having convolution and attention

Abstract:

A computer-implemented method for performing computer vision with reduced computational cost and improved accuracy can include obtaining, by a computing system including one or more computing devices, input data comprising an input tensor having one or more dimensions, providing, by the computing system, the input data to a machine-learned convolutional attention network, the machine-learned convolutional attention network including two or more network stages, and, in response to providing the input data to the machine-learned convolutional attention network, receiving, by the computing system, a machine-learning prediction from the machine-learned convolutional attention network. The convolutional attention network can include at least one attention block, wherein the attention block includes a relative attention mechanism, the relative attention mechanism including the sum of a static convolution kernel with an adaptive attention matrix. This provides for improved generalization, capacity, and efficiency of the convolutional attention network relative to some existing models.

Public/Granted literature

US20220383069A1 Systems and Methods for Machine-Learned Models Having Convolution and Attention Public/Granted day:2022-12-01

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N20/00	机器学习