Adjustable precision for multi-stage compute processes

Invention Grant

US11385863B2 Adjustable precision for multi-stage compute processes 有权

Please log in to see more content

Patent Title: Adjustable precision for multi-stage compute processes
Application No.: US16052218

Application Date: 2018-08-01
Publication No.: US11385863B2

Publication Date: 2022-07-12
Inventor: Sai Rahul Chalamalasetti , Paolo Faraboschi , Martin Foltin , Catherine Graves , Dejan S. Milojicic , Sergey Serebryakov , John Paul Strachan
Applicant: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP
Applicant Address: US TX Houston
Assignee: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP
Current Assignee: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP
Current Assignee Address: US TX Houston
Agency: Nolte Lackenbach Siegel
Main IPC: G06F7/483
IPC: G06F7/483 ; G06N3/08 ; G06N3/063

Adjustable precision for multi-stage compute processes

Abstract:

Disclosed techniques provide for dynamically changing precision of a multi-stage compute process. For example, changing neural network (NN) parameters on a per-layer basis depending on properties of incoming data streams and per-layer performance of an NN among other considerations. NNs include multiple layers that may each be calculated with a different degree of accuracy and therefore, compute resource overhead (e.g., memory, processor resources, etc.). NNs are usually trained with 32-bit or 16-bit floating-point numbers. Once trained, an NN may be deployed in production. One approach to reduce compute overhead is to reduce parameter precision of NNs to 16 or 8 for deployment. The conversion to an acceptable lower precision is usually determined manually before deployment and precision levels are fixed while deployed. Disclosed techniques and implementations address automatic rather than manual determination or precision levels for different stages and dynamically adjusting precision for each stage at run-time.

Public/Granted literature

US20200042287A1 Adjustable Precision for Multi-Stage Compute Processes Public/Granted day:2020-02-06

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F7/00	通过待处理的数据的指令或内容进行运算的数据处理的方法或装置（逻辑电路入H03K19/00）
G06F7/38	.只利用数制表示，例如利用二进制、三进制、十进制表示来完成计算的方法或装置
G06F7/48	..应用非形成接触器件的，例如，电子管、固体器件；应用非特定的器件的
G06F7/483	...用数制的非线性组合表示的数字计算，例如，有理数、对数系统、或浮点数