-
公开(公告)号:US11742875B1
公开(公告)日:2023-08-29
申请号:US17724849
申请日:2022-04-20
Applicant: MediaTek Inc.
Inventor: Hsien-Kai Kuo , Huai-Ting Li , Shou-Yao Tseng , Po-Yu Chen
CPC classification number: H03M7/30 , G06F9/5027 , G06N3/04 , G06N3/063 , G06N3/08
Abstract: Floating-point numbers are compressed for neural network computations. A compressor receives multiple operands, each operand having a floating-point representation of a sign bit, an exponent, and a fraction. The compressor re-orders the operands into a first sequence of consecutive sign bits, a second sequence of consecutive exponents, and a third sequence of consecutive fractions. The compressor then compresses the first sequence, the second sequence, and the third sequence to remove at least duplicate exponents. As a result, the compressor can losslessly generate a compressed data sequence.
-
2.
公开(公告)号:US20240119283A1
公开(公告)日:2024-04-11
申请号:US18377315
申请日:2023-10-06
Applicant: MEDIATEK INC.
Inventor: Jui-Yang Hsu , Cheng-Sheng Chan , Jen-Chieh Tsai , Huai-Ting Li , Bo-Yu Kuo , Yen-Hao Chen , Kai-Ling Huang , Ping-Yuan Tseng , Tao Tu , Sheng-Je Hung
IPC: G06N3/08
CPC classification number: G06N3/08
Abstract: A method of performing automatic tuning on a deep learning model includes: utilizing an instruction-based learned cost model to estimate a first type of operational performance metrics based on a tuned configuration of layer fusion and tensor tiling; utilizing statistical data gathered during a compilation process of the deep learning model to determine a second type of operational performance metrics based on the tuned configuration of layer fusion and tensor tiling; performing an auto-tuning process to obtain a plurality of optimal configurations based on the first type of operational performance metrics and the second type of operational performance metrics; and configure the deep learning model according to one of the plurality of optimal configurations.
-