-
公开(公告)号:US11656854B2
公开(公告)日:2023-05-23
申请号:US17460749
申请日:2021-08-30
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Xun Deng , Kai-Ting Amy Wang
Abstract: There is provided methods and devices for computing a tile size for software optimization. A method includes receiving, by a computing device, information indicative of one or more of a set of loop bounds and a set of data shapes; processing, by the computing device, the information to determine a computation configuration based on the obtained information, the computation configuration implementable by a compiler, said processing including evaluating at least the computation configuration based on a build cost model, the build cost model representative of a data transfer cost and a data efficiency of the computation configuration; and transmitting, by the computing device, instructions directing the compiler to implement the computation configuration.
-
公开(公告)号:US11144290B2
公开(公告)日:2021-10-12
申请号:US16570822
申请日:2019-09-13
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Reza Azimi , Cheng Xiang Feng , Kai-Ting Amy Wang , Yaoqing Gao , Ye Tian , Xiang Wang
IPC: G06F8/41
Abstract: A method includes analyzing a dataflow graph representing data dependencies between operators of a dataflow application to identify a plurality of candidate groups of the operators. Based on characteristics of a given hardware accelerator and the operators of a given candidate group of the plurality of candidate groups, determining whether the operators of the given candidate group are to be combined. In response to determining that the operators of the given candidate group are to be combined, retrieving executable binary code segments corresponding to the operators of the given candidate group, generating a unit of binary code including the executable binary code segments and metadata representing an execution control flow among the executable binary code segments, and dispatching the unit of code to the given hardware accelerator for execution of the unit of code.
-
3.
公开(公告)号:US20220164169A1
公开(公告)日:2022-05-26
申请号:US17650592
申请日:2022-02-10
Applicant: Huawei Technologies Co., Ltd.
Inventor: Jack Lee , Kai-Ting Amy Wang
IPC: G06F8/41
Abstract: A method and system are provided to construct, from a TensorFlow graph, a common intermediate representation that can be converted to a plurality of compiler intermediate representations (IRs), which enables compiler optimization to be applied efficiently.
-
公开(公告)号:US20190205113A1
公开(公告)日:2019-07-04
申请号:US15861132
申请日:2018-01-03
Applicant: Huawei Technologies Co., Ltd.
Inventor: Marcus Theodore Chernenko Karpoff , Jose Nelson Amaral , Kai-Ting Amy Wang , Brice Adam Dobry
Abstract: An embodiment method for retiring a dynamically updatable function includes receiving, by a collector-thread, a registration of the function, wherein the registration indicates to the collector-thread addresses of memory locations for counters that count a number of calls currently being made to a previous version of the function by a plurality of execution threads; reading, by the collector-thread, values of the counters; and when the values of all the counters are zero, deleting, by the collector-thread, the function from a storage medium on a device previously executing the previous version of the function.
-
公开(公告)号:US20190087321A1
公开(公告)日:2019-03-21
申请号:US15710678
申请日:2017-09-20
Applicant: Huawei Technologies Co., Ltd.
Inventor: Kai-Ting Amy Wang , Man Pok Ho , Peng Wu , Haichuan Wang
IPC: G06F12/02 , G06F12/1009 , G06F9/455
CPC classification number: G06F12/0238 , G06F9/45516 , G06F12/0873 , G06F12/1009 , G06F16/188 , G06F2212/65
Abstract: Embodiments of this disclosure allow non-position-independent-code to be shared between a closed application and a subsequent application without converting the non-position-independent-code into position-independent-code. In particular, embodiment techniques store live data of a closed application during runtime of the closed application, and thereafter page a portion of the live data that is common to both the closed application and a subsequent application back into volatile memory at the same virtual memory address in which the portion of live data was stored during runtime of the closed application so that the paged lived data may be re-used to execute the subsequent application in the managed runtime environment. Because the paged live data is stored at the same virtual memory address during the runtimes of both applications, non-position-independent-code can be shared between the applications.
-
公开(公告)号:US11573777B2
公开(公告)日:2023-02-07
申请号:US17186352
申请日:2021-02-26
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Reza Azimi , Cheng Xiang Feng , Kai-Ting Amy Wang , Yaoqing Gao , Ye Tian , Xiang Wang
IPC: G06F8/41
Abstract: A method includes analyzing a dataflow graph representing data dependencies between operators of a dataflow application to identify a plurality of candidate groups of the operators. Based on characteristics of a given hardware accelerator and the operators of a given candidate group of the plurality of candidate groups, determining whether the operators of the given candidate group are to be combined. In response to determining that the operators of the given candidate group are to be combined, retrieving executable binary code segments corresponding to the operators of the given candidate group, generating a unit of binary code including the executable binary code segments and metadata representing an execution control flow among the executable binary code segments, and dispatching the unit of code to the given hardware accelerator for execution of the unit of code.
-
公开(公告)号:US20190087211A1
公开(公告)日:2019-03-21
申请号:US15890256
申请日:2018-02-06
Applicant: Huawei Technologies Co., Ltd.
Inventor: Kai-Ting Amy Wang , Man Pok Ho , Peng Wu , Haichuan Wang
Abstract: Embodiments of this disclosure allow non-position-independent-code to be shared between a closed application and a subsequent application without converting the non-position-independent-code into position-independent-code. In particular, embodiment techniques store live data of a closed application during runtime of the closed application, and thereafter page a portion of the live data that is common to both the closed application and a subsequent application back into volatile memory at the same virtual memory address in which the portion of live data was stored during runtime of the closed application so that the paged lived data may be re-used to execute the subsequent application in the managed runtime environment. Because the paged live data is stored at the same virtual memory address during the runtimes of both applications, non-position-independent-code can be shared between the applications.
-
8.
公开(公告)号:US11960866B2
公开(公告)日:2024-04-16
申请号:US17650592
申请日:2022-02-10
Applicant: Huawei Technologies Co., Ltd.
Inventor: Jack Lee , Kai-Ting Amy Wang
CPC classification number: G06F8/41
Abstract: A method and system are provided to construct, from a TensorFlow graph, a common intermediate representation that can be converted to a plurality of compiler intermediate representations (IRs), which enables compiler optimization to be applied efficiently.
-
公开(公告)号:US11243790B2
公开(公告)日:2022-02-08
申请号:US15890256
申请日:2018-02-06
Applicant: Huawei Technologies Co., Ltd.
Inventor: Kai-Ting Amy Wang , Man Pok Ho , Peng Wu , Haichuan Wang
Abstract: Embodiments of this disclosure allow non-position-independent-code to be shared between a closed application and a subsequent application without converting the non-position-independent-code into position-independent-code. In particular, embodiment techniques store live data of a closed application during runtime of the closed application, and thereafter page a portion of the live data that is common to both the closed application and a subsequent application back into volatile memory at the same virtual memory address in which the portion of live data was stored during runtime of the closed application so that the paged lived data may be re-used to execute the subsequent application in the managed runtime environment. Because the paged live data is stored at the same virtual memory address during the runtimes of both applications, non-position-independent-code can be shared between the applications.
-
公开(公告)号:US20200233882A1
公开(公告)日:2020-07-23
申请号:US16251760
申请日:2019-01-18
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Martin Ichilevici de Oliveira , Man Pok Ho , Jose Nelson Amaral , Kai-Ting Amy Wang , Yaoqing Gao , Bryan Chan
Abstract: In some examples, a controller comprises a bucketization logic to receive a bucketization indication from a host processor, and in response to the bucketization indication, partition data stored in a memory of a storage device into buckets, wherein a first bucket of the buckets comprises data items that share a first common characteristic. The bucketization logic is to send data items of the first bucket to the host processor for processing by the host processor using a first code module configured for the first common characteristic of the first bucket.
-
-
-
-
-
-
-
-
-