An Efficient Vector Transfer for Sparse Matrix-Vector Multiplication on Distributed Memory Systems
スポンサーリンク
概要
- 論文の詳細を見る
This paper proposes an efficient algorithm to exchange fragments of a vector distributed among processes which repeatedly perform sparse matrix-vector multiplication in parallel and in a block-decomposed manner. The idea to reduce the communication cost for the exchange is to combine non-contiguous fragments of a vector residing in a process and required by another process to multiply the fragments by the submatrix in the receiver process. That is, the sender may send fragments and gaps between them forming a larger fragment, rather than sending each fragment individually or packing fragments into a single vector before sending. The key feature of our algorithm is to find the optimum assortment of combining, individual sending and packing based on dynamic programming from given fragments and non-linear cost functions. Our preliminary evaluation with artificially generated sparse matrices shows the optimum assortment is up to 1.5 times as fast as simple packing.
- 一般社団法人情報処理学会の論文
- 2010-02-15
著者
-
Hiroshi Nagamochi
Graduate School of Informatics, Kyoto University
-
Nagamochi Hiroshi
Kyoto Univ.
-
Hiroshi Nakashima
ACCMS, Kyoto University
-
Iwashita Takeshi
Accms Kyoto University
-
Takeshi Iwashita
Accms Kyoto University
-
Toshiyuki Fukuhara
Graduate School of Informatics, Kyoto University
-
Toshiyuki Fukuhara
Graduate School Of Informatics Kyoto University
-
Hiroshi Nakashima
Accms Kyoto University
-
Takeshi Iwashita
Academic Center for Computing and Media Studies, Kyoto University
関連論文
- Constant Time Generation of Trees with Degree Bounds
- A Memory-Efficient Algorithm and Its Implementation of Variable-Size All-to-All Communication
- k-Edge and 3-Vertex Connectivity Augmentation in an Arbitrary Multigraph
- Augmenting Edge-Connectivity and Vertex-Connectivity Simultaneously
- Enumerating Colored and Rooted Outerplanar Graphs
- BI-CRITERIA FOOD PACKING BY DYNAMIC PROGRAMMING(the 50th Anniversary of the Operations Research Society of Japan)
- APPROXIMATING MINIMUM COST MULTIGRAPHS OF SPECIFIED EDGE-CONNECTIVITY UNDER DEGREE BOUNDS(the 50th Anniversary of the Operations Research Society of Japan)
- An Efficient Vector Transfer for Sparse Matrix-Vector Multiplication on Distributed Memory Systems
- Orthogonal Drawings for Plane Graphs with Specified Face Areas
- Efficient Representation of Constraints and Propagation of Variable-Value Symmetries in Distributed Constraint Reasoning
- Symmetry Breaking by Dominance Detection in Distributed Environments
- Around the Constructive Orbit Problem in Distributed Constraint Programming
- The Next-to-Shortest Path in Undirected Graphs with Nonnegative Weights
- FOREWORD (Special Section on Discrete Mathematics and Its Applications)
- GRAPH ALGORITHMS FOR NETWORK CONNECTIVITY PROBLEMS(Network Design, Control and Optimization)
- Accelerating A* algorithms by sweeping out small-degree nodes
- CONSTANT FACTOR APPROXIMATION ALGORITHMS FOR REPETITIVE ROUTING PROBLEMS OF GRASP-AND-DELIVERY ROBOTS IN PRODUCTION OF PRINTED CIRCUIT BOARDS
- A Simulation-Based Analysis for Worst Case Delay of Single and Multiple Interruptions
- Multilingualization Based on RPC for Job-level Parallel Script Language, Xcrypt
- Breadth-first Search Approach to Enumeration of Tree-like Chemical Compounds
- Breadth-first Search Approach to Enumeration of Tree-like Chemical Compounds
- CONSTANT FACTOR APPROXIMATION ALGORITHMS FOR REPETITIVE ROUTING PROBLEMS OF GRASP-AND-DELIVERY ROBOTS IN PRODUCTION OF PRINTED CIRCUIT BOARDS