-
公开(公告)号:US20210240524A1
公开(公告)日:2021-08-05
申请号:US16779275
申请日:2020-01-31
Applicant: QUALCOMM Incorporated
Inventor: Hitendra Mohan GANGANI , Balaji CALIDAS , Murat BALCI
Abstract: The present disclosure relates to methods and apparatus for machine learning processing. For example, disclosed techniques facilitate tile-based GPU machine learning acceleration. Aspects of the present disclosure can determine a tile size based on a memory size of a first memory and a job input size associated with executing a computational job. In some examples, the computational job may be one of a quantity of computational jobs configured to execute a machine learning primitive. Aspects of the present disclosure can also load, based on the tile size, input data associated with a batch of computational jobs from a second memory to the first memory. Further, aspects of the present disclosure can generate batch output data by executing the batch of computational jobs using the input data loaded to the first memory. Additionally, aspects of the present disclosure can store the generated batch output data to the second memory.
-
公开(公告)号:US20220114284A1
公开(公告)日:2022-04-14
申请号:US17070734
申请日:2020-10-14
Applicant: QUALCOMM Incorporated
Inventor: Avinash SEETHARAMAIAH , Murat BALCI , Jonnala Gadda NAGENDRA KUMAR , Nigel POOLE , Abhiraj DESHPANDE
Abstract: Systems, methods, and computer-readable media are provided for signing and executing graphics processing unit (GPU) commands. In some examples, a method can include receiving, by a GPU, one or more commands including one or more verification signatures generated using a processor, each verification signature of the one or more verification signatures including a first value generated based on the one or more commands; generating, by the GPU, one or more additional verification signatures associated with the one or more commands, wherein each verification signature of the one or more additional verification signatures includes a second value generated by the GPU based on the one or more commands; and determining, by the GPU, a validity of the one or more commands based on a comparison of the one or more verification signatures and the one or more additional verification signatures.
-