Deviation-Tolerant Computation in Concurrent Failure-Prone HardwarePhillip Stanley-Marbell and Diana Marculescu.Technical report, Electronic Systems group, Technische Universiteit Eindhoven, The Netherlands. ESR-2008-01, ES Reports, ISSN 1574-9517 . January 2008.
Presented are analysis on the relation between faults occurring
in compute hardware or communicated program state (in a multiprocessor
system) and the resulting deviations in values manifested in
source-level program variables. These relations are dependent on
the distributions of values taken on by program variables of different
data types in the absence of faults, and we present detailed
characterizations of these distributions for a large collection of
programs. We show how the analytic derivations, in conjunction with
the empirical characterizations, can enable the implementation of
deviation-tolerant transformations in programs. The work is presented
in the context of a hardware platform we have designed and implemented,
containing 24 processing elements, that manifests tradeoffs between
occurrences of faults in hardware, performance, and energy efficiency.
|