Inproceedings,

Markov Models of Fault-Tolerant Memory Systems Under SEU

.
Proceedings of the Records of the 2004 International Workshop on Memory Technology, Design and Testing, page 38--43. Washington, DC, USA, IEEE Computer Society, (2004)

Abstract

A Single Event Upset (SEU) can affect the correct operation of digital systems, such as memories and processors. This paper proposes novel Markov based models for analyzing the reliability and availability of different fault-tolerant memory arrangements under the operational scenario of an SEU. These arrangements exploit redundancy (either duplex or triplex replication) for dynamic fault-tolerant operation as provided by arbitration (for error detection and output selection) as well as in the presence of dedicated circuitry implementing different correction/detection codes for bit-flips as errors. The primary objective is to preserve either the correctness, or the fail-safe nature of the data output of the memory system for long mission time. It is shown that a duplex memory system encoded with error control codes has a higher reliability than the triplex arrangement. Moreover, the use of a code for single error correction and double error detection (SEC-DED) improves both availability and reliability compared to an error correction code with same error detection capabilities.

Tags

Users

  • @avail_map_stud

Comments and Reviews