A survey of fault tolerance approaches on different architecture levels

Konferenz: ARCS 2017 - 30th International Conference on Architecture of Computing Systems
03.04.2017 - 06.04.2017 in Vienna, Austria

Tagungsband: ARCS 2017

Seiten: 9Sprache: EnglischTyp: PDF

Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt

Autoren:
Osinski, Lukas; Langer, Tobias; Mottok, Juergen (Laboratory of Safe and Secure Systems - LaS3, University of Applied Sciences Regensburg, Germany)

Inhalt:
In the recent years the development trends for computing platforms moved to multicore systems. Associated with this trend, feature sizes decreased with each new hardware generation and consequently led to a rise of transient and permanent error frequency in memory and CPUs. In this context, researchers presented several approaches which exploit the inherent redundancy of multicore platforms to provide fault tolerance. We present a discussion of fault tolerance approaches based on redundancy at different levels of architecture regarding their sphere of replication, performance as well as error detection and recovery capability.