-
1.
公开(公告)号:US20240054384A1
公开(公告)日:2024-02-15
申请号:US16945679
申请日:2020-07-31
Applicant: Meta Platforms, Inc.
Inventor: Garret Ray Catron , Man Wang , Nadathur Rajagopalan Satish , Michael Anderson , Ying Zhang , Bertrand Allen Maher
CPC classification number: G06N20/00 , G06F9/52 , G06F9/5061
Abstract: A machine learning model network is analyzed to identify types of operations and dependencies associated with different portions of the machine learning model network, including by classifying at least a portion of the types of operations as being memory bandwidth intensive or compute intensive. The machine learning model network is partitioned across a plurality of different machine learning accelerator hardware units based at least in part on the analysis. Parallelization and pipelining of an execution of the machine learning model network is allowed based on the partitioning.