ACCELERATION OF GPUS IN CLOUD COMPUTING
摘要:
The disclosure relates to technology for acceleration of GPUs in cloud. Instructions for a computational task are accessed. An allocation of data and instructions is calculated based on the data, the instructions, and dynamic GPU resources. The data and the instructions are provided to the GPUs in accordance with the allocation, which includes scheduling a set of instructions for parallel computation of an operation of the computational task on multiple sub-matrices of a data matrix. Separate portions of information are stored into corresponding different regions of non-transitory memory of a processor core to provide concurrent access to the multiple sub-matrices to the processor core. Each sub-matrix corresponds to a portion of the data matrix for which an operation of the computational task is to be performed. Each sub-matrix contains an element in the data matrix in common with another sub-matrix of the data matrix.
信息查询
0/0