-
公开(公告)号:US12164892B2
公开(公告)日:2024-12-10
申请号:US17674903
申请日:2022-02-18
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Mehrnoosh Heidarpour , Henry Fangli Kao , Ehsan Amiri , Congzhe Cao
IPC: G06F8/41
Abstract: A method for fusing functions of a software program is provided. The method includes processing an intermediate representation (IR) of the software program and determining one or more function call candidates capable of fusion based on the processed IR. The method further includes determining similarities between the function call candidates and generating a merged function of the function call candidates based on the determined similarities. The method further includes generating at least one interleaved version of the merged function and determining an optimized function version from among the merged function and the at least one interleaved version. The method further includes generating a fused function based on the determined optimized function version. In addition, a device is provided which includes a processor, and non-transitory computer readable memory having stored thereon machine-readable instructions which when executed by the processor configure the device to execute the methods disclosed herein.
-
公开(公告)号:US11816488B2
公开(公告)日:2023-11-14
申请号:US17523560
申请日:2021-11-10
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Henry Fangli Kao , Shehab Yomn Abdellatif Elsayed , Tomasz Sebastian Czajkowski , Reza Azimi , Ehsan Amiri
CPC classification number: G06F9/30181 , G06F9/3016 , G06F9/3802 , G06F9/3832
Abstract: There is provided methods and devices for dynamically simplifying processor instructions. A method includes receiving, at a computing device, processor instructions and determining, by the computing device, if instruction simplification is enabled for an instruction being processed. The method further includes determining, by the computing device, from an instruction simplification table if the instruction is capable of being simplified and scheduling, by the computing device, a simplified instruction based on the determination from the instruction simplification table. A device includes a processor, and a non-transient computer readable memory having stored thereon instructions which when executed by the processor configure the device to execute the methods disclosed herein.
-
公开(公告)号:US11755299B2
公开(公告)日:2023-09-12
申请号:US17355737
申请日:2021-06-23
Applicant: Huawei Technologies Co., Ltd.
Inventor: Tomasz S. Czajkowski , Ehsan Amiri
IPC: G06F8/41
CPC classification number: G06F8/4441
Abstract: There is provided methods and apparatus to improve runtime by computer programs at compilation time. A compiler analyzes code to be translated into machine executable instructions to identify overloaded functional units of the target processor, and replaces instructions scheduled on the overloaded functional unit to an idle unit using functionally equivalent operations on the idle unit. The replacement instructions may be taken from an instruction replacement library comprising function calls that implement functionality of a functional unit of the target processor on another functional unit of the target processor.
-
公开(公告)号:US20220413818A1
公开(公告)日:2022-12-29
申请号:US17355737
申请日:2021-06-23
Applicant: Huawei Technologies Co., Ltd.
Inventor: Tomasz S. Czajkowski , Ehsan Amiri
IPC: G06F8/41
Abstract: There is provided methods and apparatus to improve runtime by computer programs at compilation time. A compiler analyzes code to be translated into machine executable instructions to identify overloaded functional units of the target processor, and replaces instructions scheduled on the overloaded functional unit to an idle unit using functionally equivalent operations on the idle unit. The replacement instructions may be taken from an instruction replacement library comprising function calls that implement functionality of a functional unit of the target processor on another functional unit of the target processor.
-
-
-