-
公开(公告)号:US11144290B2
公开(公告)日:2021-10-12
申请号:US16570822
申请日:2019-09-13
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Reza Azimi , Cheng Xiang Feng , Kai-Ting Amy Wang , Yaoqing Gao , Ye Tian , Xiang Wang
IPC: G06F8/41
Abstract: A method includes analyzing a dataflow graph representing data dependencies between operators of a dataflow application to identify a plurality of candidate groups of the operators. Based on characteristics of a given hardware accelerator and the operators of a given candidate group of the plurality of candidate groups, determining whether the operators of the given candidate group are to be combined. In response to determining that the operators of the given candidate group are to be combined, retrieving executable binary code segments corresponding to the operators of the given candidate group, generating a unit of binary code including the executable binary code segments and metadata representing an execution control flow among the executable binary code segments, and dispatching the unit of code to the given hardware accelerator for execution of the unit of code.
-
公开(公告)号:US11573777B2
公开(公告)日:2023-02-07
申请号:US17186352
申请日:2021-02-26
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Reza Azimi , Cheng Xiang Feng , Kai-Ting Amy Wang , Yaoqing Gao , Ye Tian , Xiang Wang
IPC: G06F8/41
Abstract: A method includes analyzing a dataflow graph representing data dependencies between operators of a dataflow application to identify a plurality of candidate groups of the operators. Based on characteristics of a given hardware accelerator and the operators of a given candidate group of the plurality of candidate groups, determining whether the operators of the given candidate group are to be combined. In response to determining that the operators of the given candidate group are to be combined, retrieving executable binary code segments corresponding to the operators of the given candidate group, generating a unit of binary code including the executable binary code segments and metadata representing an execution control flow among the executable binary code segments, and dispatching the unit of code to the given hardware accelerator for execution of the unit of code.
-
公开(公告)号:US11816488B2
公开(公告)日:2023-11-14
申请号:US17523560
申请日:2021-11-10
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Henry Fangli Kao , Shehab Yomn Abdellatif Elsayed , Tomasz Sebastian Czajkowski , Reza Azimi , Ehsan Amiri
CPC classification number: G06F9/30181 , G06F9/3016 , G06F9/3802 , G06F9/3832
Abstract: There is provided methods and devices for dynamically simplifying processor instructions. A method includes receiving, at a computing device, processor instructions and determining, by the computing device, if instruction simplification is enabled for an instruction being processed. The method further includes determining, by the computing device, from an instruction simplification table if the instruction is capable of being simplified and scheduling, by the computing device, a simplified instruction based on the determination from the instruction simplification table. A device includes a processor, and a non-transient computer readable memory having stored thereon instructions which when executed by the processor configure the device to execute the methods disclosed herein.
-
4.
公开(公告)号:US11740906B2
公开(公告)日:2023-08-29
申请号:US17677413
申请日:2022-02-22
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Maziar Goudarzi , Zhizhao Qian , Reza Azimi , Billy Mengxuan Cai , Man Pok Ho
IPC: G06F9/38 , G06F8/41 , G06F9/30 , G06F9/32 , G06F12/0862
CPC classification number: G06F9/3814 , G06F8/4452 , G06F9/30047 , G06F9/325 , G06F12/0862
Abstract: A method and hardware system to remove the overhead caused by having stream handling instructions in nested loops. Where code contains inner loops, nested in outer loops, a compiler pass identifies qualified nested streams and generates ISA specific instructions for transferring stream information linking an inner loop stream with an outer loop stream, to hardware components of a co-designed prefetcher. The hardware components include a frontend able to decode and execute instructions for a stream linking information transfer mechanism, a stream engine unit with a streams configuration table (SCT) having a field for allowing a subordinate stream to stay pending for values from its master stream, and a stream prefetch manager with buffers for storing values of current elements of a master stream, and with a nested streams control unit for reconfiguring and iterating the streams.
-
公开(公告)号:US10664278B2
公开(公告)日:2020-05-26
申请号:US15680179
申请日:2017-08-17
Applicant: Huawei Technologies Co., Ltd.
Inventor: Yuanxi Chen , Jack Hon Wai Ng , Craig Davies , Reza Azimi
IPC: G06F9/38 , G06F9/50 , G06F9/4401 , G06F15/82
Abstract: In a distributed computing system comprising multiple processor types, a method of provisioning includes receiving a request from a client device for execution of a function. A first data structure identifies implementations of the function and compatible processor types for each implementation. A second data structure identifies available processors in the system. Compatible processor types matching available processors are candidates for execution of the function. A provisioning instruction is created for allocating resources for execution of the function.
-
-
-
-