发明授权
US08117288B2 Optimizing layout of an application on a massively parallel supercomputer
失效
在大型并行超级计算机上优化应用程序的布局
- 专利标题: Optimizing layout of an application on a massively parallel supercomputer
- 专利标题(中): 在大型并行超级计算机上优化应用程序的布局
-
申请号: US10963101申请日: 2004-10-12
-
公开(公告)号: US08117288B2公开(公告)日: 2012-02-14
- 发明人: Gyan V. Bhanot , Alan Gara , Philip Heidelberger , Eoin M. Lawless , James C. Sexton , Robert E. Walkup
- 申请人: Gyan V. Bhanot , Alan Gara , Philip Heidelberger , Eoin M. Lawless , James C. Sexton , Robert E. Walkup
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理机构: Fleit Gibbons Gutman Bongini & Bianco PL
- 代理商 Jon A. Gibbons
- 主分类号: G06F15/177
- IPC分类号: G06F15/177
摘要:
A general computer-implement method and apparatus to optimize problem layout on a massively parallel supercomputer is described. The method takes as input the communication matrix of an arbitrary problem in the form of an array whose entries C(i, j) are the amount to data communicated from domain i to domain j. Given C(i, j), first implement a heuristic map is implemented which attempts sequentially to map a domain and its communications neighbors either to the same supercomputer node or to near-neighbor nodes on the supercomputer torus while keeping the number of domains mapped to a supercomputer node constant (as much as possible). Next a Markov Chain of maps is generated from the initial map using Monte Carlo simulation with Free Energy (cost function) F=Σi,jC(i,j)H(i,j)− where H(i,j) is the smallest number of hops on the supercomputer torus between domain i and domain j. On the cases tested, found was that the method produces good mappings and has the potential to be used as a general layout optimization tool for parallel codes. At the moment, the serial code implemented to test the method is un-optimized so that computation time to find the optimum map can be several hours on a typical PC. For production implementation, good parallel code for our algorithm would be required which could itself be implemented on supercomputer.
公开/授权文献
信息查询