-
公开(公告)号:US20220374713A1
公开(公告)日:2022-11-24
申请号:US17880070
申请日:2022-08-03
Inventor: Zhihua WU , Dianhai YU , Yulong AO , Weibao GONG
IPC: G06N3/08
Abstract: The present disclosure provides a method and apparatus for performing distributed training on a deep learning model. The method may include: generating a distributed computation view based on data information of a to-be-trained deep learning model; generating a cluster resource view based on property information of a cluster hardware resource corresponding to the to-be-trained deep learning model; determining a target segmentation strategy of a distributed training task based on the distributed computation view and the cluster resource view; and performing distributed training on the to-be-trained deep learning model based on the target segmentation strategy.