Linköping University: Students Alumni Trade and Industry/Society Internal Search
dimni_SSoCC11

Level of Confidence Study for Roll-back Recovery with Checkpointing

Dimitar Nikolov
 
Urban Ingelsson
Virendra Singh
 
Erik Larsson

The 11th Swedish System-on-Chip Conference, Varberg, Sweden, May 2-3, 2011 (not reviewed, not printed).

ABSTRACT
Increasing soft error rates for semiconductor devices manufactured in later technologies enforces the use of fault tolerant techniques such as Roll-back Recovery with Checkpointing (RRC). However, RRC introduces time overhead that increases the completion (execution) time. For non-real-time systems, research have focused on optimizing RRC and shown that it is possible to find the optimal number of checkpoints such that the average execution time is minimal. While minimal average execution time is important, it is for real-time systems important to provide a high probability of meeting given deadlines. Hence, there is a need of probabilistic guarantees that jobs employing RRC complete before a given deadline. Therefore, in this paper we present a mathematical framework for the evaluation of level of confidence, the probability that a given deadline is met, when RRC is employed.


Related files:
dimni_SSoCC11.pdfAdobe Acrobat portable document


[NISL11] Dimitar Nikolov, Urban Ingelsson, Virendra Singh, Erik Larsson, "Level of Confidence Study for Roll-back Recovery with Checkpointing", The 11th Swedish System-on-Chip Conference, Varberg, Sweden, May 2-3, 2011 (not reviewed, not printed).
( ! ) perl script by Giovanni Squillero with modifications from Gert Jervan   (v3.1, p5.2, September-2002-)