Publica
Hier finden Sie wissenschaftliche Publikationen aus den FraunhoferInstituten. Efficient AMG on heterogeneous systems
: Kraus, Jiri; Förster, Malte
:
Preprint urn:nbn:de:0011n1945705 (202 KByte PDF) MD5 Fingerprint: d233d0f2b9fc98e823cdcd21020ad254 Created on: 24.2.2012 
 Keller, R.: Facing the multicorechallenge II : Aspects of new paradigms and technologies in parallel computing; Second Conference for Young Scientists  "Facing the Multicore Challenge II", held at the Karlsruhe Institute of Technology (KIT), September 2830, 2011 Berlin: Springer, 2012 (Lecture Notes in Computer Science 7174) ISBN: 364230396X ISBN: 9783642303968 ISBN: 9783642303975 ISSN: 03029743 pp.133146 
 Conference "Facing the MulticoreChallenge" <2, 2011, Karlsruhe> 

 English 
 Conference Paper, Electronic Publication 
 Fraunhofer SCAI () 
 LAMA; AMG; GPU; CUDA 
Abstract
In many numerical simulation codes the backbone of the application covers the solution of linear systems of equations. Often, being created via a discretization of dierential equations, the corresponding matrices are very sparse. One popular way to solve these sparse linear systems are multigrid methods  in particular AMG  because of their numerical scalability. As the memory bandwidth is usually the bottleneck of linear solvers for sparse systems they especially benefit from high throughput architectures like GPUs. We will show that this is true even for a rather complex hierarchical method like AMG. The presented benchmarks are all based on the new open source library LAMA and compare the run times on different GPUs to those of an efficient OpenMP parallel CPU implementati on. As the memory access pattern is especially crucial for GPUs we have a focus on the performance of different sparse matrix formats.