发明授权
US07953684B2 Method and system for optimal parallel computing performance 失效
最优并行计算性能的方法和系统

Method and system for optimal parallel computing performance
摘要:
A system and method that optimizes reduce operations by consolidating the operation into a limited number of participating processes and then distributing the results back to all processes to optimize large message global reduce operations on non power-of-two processes. The method divides a group of processes into subgroups, performs paired exchange and local reduce operations at some of the processes to obtain half vectors of partial reduce results, consolidates partial reduce results into a set of regaining processes, performs successive recursive halving and recursive doubling at a set of remaining processes until each process in the set of remaining process has a half vector of the complete result, and provides a full complete result at every process.
信息查询
0/0