Bartsch, ValeriaValeriaBartschMachado, RuiRuiMachadoRahn, MirkoMirkoRahnMerten, DirkDirkMertenPfreundt, Franz-JosefFranz-JosefPfreundt2022-03-132022-03-132017https://publica.fraunhofer.de/handle/publica/39901710.1007/978-3-319-64203-1_36Fault tolerance becomes an important feature at large computer systems where the mean time between failure decreases. Checkpointing is a method often used to provide resilience. We present an in-memory checkpointing library based on a PGAS API implemented with GASPI/GPI. It offers a substantial benefit when recovering from failure and leverages existing fault tolerance features of GASPI/GPI. The overhead of the library is negligible when testing it with a simple stencil code and a real life seismic imaging method.en003006519GASPI/GPI in-memory check pointing libraryconference paper