-
公开(公告)号:US20240403598A1
公开(公告)日:2024-12-05
申请号:US18327821
申请日:2023-06-01
Applicant: Microsoft Technology Licensing, LLC
Inventor: Youshan MIAO , Fan YANG , Quanlu ZHANG , Saeed MALEKI , Xu CAO , Yi ZHU , Mao YANG , Lidong ZHOU , Zhiqi LIN
IPC: G06N3/04
Abstract: Embodiments of the present disclosure include techniques for designing and generating a parallelization plan for a neural network so that workloads in the neural network may be split amongst multiple devices. Operators and tensors in the neural network are transformed into a set of functionally equivalent operators and tensors. These functionally equivalent operators and tensors are then scheduled to separate devices for execution.