Determination of the Optimum Degree of Redundancy for Fault-prone Many-Core Systems

Konferenz: Zuverlässigkeit und Entwurf - 6. GMM/GI/ITG-Fachtagung
25.09,2012 - 27.09.2012 in Bremen, Deutschland

Tagungsband: Zuverlässigkeit und Entwurf

Seiten: 8Sprache: EnglischTyp: PDF

Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt

Autoren:
Runge, Armin (University of Würzburg, Am Hubland, 97074 Würzburg, Germany)

Inhalt:
The increasing transistor integration capacity will entail hundreds of processors on a single chip. Further, this will lead to an inherent susceptibility to errors of these systems. To obtain reliable systems again, various redundancy techniques can be applied. Of course, the usage of those techniques involves a significant overhead. Therefore, the identification of the optimal degree of redundancy is an important objective. In this paper we focus on core-level redundancy and checkpointing rollback-recovery. A model to determine the optimal degree of spatial and temporal redundancy regarding the minimal expected execution time will be introduced. Further, we will show that in several cases, the minimal expected execution time is achieved just by a simultaneous combination of both techniques, spatial redundancy and temporal redundancy.