-
公开(公告)号:US11145024B2
公开(公告)日:2021-10-12
申请号:US16728591
申请日:2019-12-27
Applicant: QUALCOMM Incorporated
Inventor: Balaji Calidas , Joshua Walter Kelly , Avinash Seetharamaiah , Jonnala Gadda Nagendra Kumar , Hitendra Mohan Gangani
Abstract: Methods, systems, and devices for processing are described. A device may parse a set of layers of a deep neural network. The set of layers may be associated with a set of machine learning operations of the deep neural network. The device may determine one or more layer parameters based on the determined set of layers. In some aspects, the device may determine an execution time associated with executing a shader dispatch based on the one or more layer parameters. The device may batch the shader dispatch to a command buffer based on the execution time and process the command buffer based on the batching. The device may determine a target execution time based on an assembly time associated with the command buffer, a processing time associated with the command buffer, a frequency level associated with processing the command buffer, the one or more layer parameters, or some combination thereof.
-
公开(公告)号:US20210201433A1
公开(公告)日:2021-07-01
申请号:US16728591
申请日:2019-12-27
Applicant: QUALCOMM Incorporated
Inventor: Balaji CALIDAS , Joshua Walter Kelly , Avinash Seetharamaiah , Jonnala Gadda Nagendra Kumar , Hitendra Mohan Gangani
Abstract: Methods, systems, and devices for processing are described. A device may parse a set of layers of a deep neural network. The set of layers may be associated with a set of machine learning operations of the deep neural network. The device may determine one or more layer parameters based on the determined set of layers. In some aspects, the device may determine an execution time associated with executing a shader dispatch based on the one or more layer parameters. The device may batch the shader dispatch to a command buffer based on the execution time and process the command buffer based on the batching. The device may determine a target execution time based on an assembly time associated with the command buffer, a processing time associated with the command buffer, a frequency level associated with processing the command buffer, the one or more layer parameters, or some combination thereof.
-