A system maintains a copy of data stored in a first memory device in a
redundant distinct second memory device. Upon detecting an uncorrectable
error in the first memory device, the system then relies on the copy of
the data in the second memory device. The system, once it starts relying
on the data in the second memory device, may then test the first memory
device to determine if the uncorrectable error was due to a physical
problem or a transient event. If the first memory device is then found to
be working correctly, it may, in turn, become a redundant memory device
for the second memory device.