Compute optimizations for low precision machine learning operations

Invention Grant

US12148063B2 Compute optimizations for low precision machine learning operations 有权

Please log in to see more content

Patent Title: Compute optimizations for low precision machine learning operations
Application No.: US17960611

Application Date: 2022-10-05
Publication No.: US12148063B2

Publication Date: 2024-11-19
Inventor: Elmoustapha Ould-Ahmed-Vall , Sara S. Baghsorkhi , Anbang Yao , Kevin Nealis , Xiaoming Chen , Altug Koker , Abhishek R. Appu , John C. Weast , Mike B. Macpherson , Dukhwan Kim , Linda L. Hurd , Ben J. Ashbaugh , Barath Lakshmanan , Liwei Ma , Joydeep Ray , Ping T. Tang , Michael S. Strickland
Applicant: Intel Corporation
Applicant Address: US CA Santa Clara
Assignee: Intel Corporation
Current Assignee: Intel Corporation
Current Assignee Address: US CA Santa Clara
Agency: Jaffery Watson Mendonsa & Hamilton LLP
Main IPC: G06T1/20
IPC: G06T1/20 ; G06F7/483 ; G06F9/30 ; G06F9/38 ; G06F9/50 ; G06N3/044 ; G06N3/045 ; G06N3/063 ; G06N3/084 ; G06N20/00 ; G06T1/60 ; G06F3/14 ; G06T15/00

Compute optimizations for low precision machine learning operations

Abstract:

One embodiment provides a multi-chip module accelerator usable to execute tensor data processing operations a multi-chip module. The multi-chip module may include a memory stack including multiple memory dies and parallel processor circuitry communicatively coupled to the memory stack. The parallel processor circuitry may include multiprocessor cores to execute matrix multiplication and accumulate operations. The matrix multiplication and accumulate operations may include floating-point operations that are configurable to include two-dimensional matrix multiply and accumulate operations involving inputs that have differing floating-point precisions. The floating-point operations may include a first operation at a first precision and a second operation at a second precision. The first operation may include a multiply having at least one 16-bit floating-point input and the second operation may include an accumulate having a 32-bit floating-point input.

Public/Granted literature

US20230061331A1 COMPUTE OPTIMIZATIONS FOR LOW PRECISION MACHINE LEARNING OPERATIONS Public/Granted day:2023-03-02

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06T	一般的图像数据处理或产生
G06T1/00	通用图像数据处理
G06T1/20	.处理器架构; 处理器配置，例如流水线