Optimized and scalable sparse triangular linear systems on networks of accelerators

发明授权

US10936697B2 Optimized and scalable sparse triangular linear systems on networks of accelerators 有权

请登陆查看更多内容

专利标题： Optimized and scalable sparse triangular linear systems on networks of accelerators
申请号： US16044145

申请日： 2018-07-24
公开(公告)号： US10936697B2

公开(公告)日： 2021-03-02
发明人: Khaled Hamidouche , Michael W. LeBeane , Nicholas P. Malaya , Joseph L. Greathouse
申请人： Advanced Micro Devices, Inc.
申请人地址： US CA Santa Clara
专利权人： Advanced Micro Devices, Inc.
当前专利权人： Advanced Micro Devices, Inc.
当前专利权人地址： US CA Santa Clara
代理机构： Liang & Cheng, PC
主分类号： G06F17/16
IPC分类号： G06F17/16 ; G06F9/38 ; G06F9/30 ; G06F17/12

Optimized and scalable sparse triangular linear systems on networks of accelerators

摘要：

A method includes storing a first portion of a sparse triangular matrix in a local memory and launching a kernel for executing a set of workgroups. The first portion includes a plurality of row blocks, and each workgroup in the set of workgroups is associated with one of the plurality of row blocks. The method also includes, for each workgroup in the set of workgroups, solving the row block. The row block is solved by, for each row segment of a first subset of row segments in the row block, calculating a partial sum for the row segment based on one or more matrix elements in the row segment, and writing the partial sum to a remote memory of a first remote processing unit prior to terminating the kernel.

公开/授权文献

US20200034405A1 OPTIMIZED AND SCALABLE SPARSE TRIANGULAR LINEAR SYSTEMS ON NETWORKS OF ACCELERATORS 公开/授权日：2020-01-30

信息查询

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F17/00	特别适用于特定功能的数字计算设备或数据处理设备或数据处理方法（信息检索，数据库结构或文件系统结构，G06F 16/00）
G06F17/10	.复杂数学运算的
G06F17/16	..矩阵或向量计算的