A fault isolation technique for checking the accuracy of data packets
transmitted between nodes of a parallel processor. An independent crc is
kept of all data sent from one processor to another, and received from
one processor to another. At the end of each checkpoint, the crcs are
compared. If they do not match, there was an error. The crcs may be
cleared and restarted at each checkpoint. In the preferred embodiment,
the basic functionality is to calculate a CRC of all packet data that has
been successfully transmitted across a given link. This CRC is done on
both ends of the link, thereby allowing an independent check on all data
believed to have been correctly transmitted. Preferably, all links have
this CRC coverage, and the CRC used in this link level check is different
from that used in the packet transfer protocol. This independent check,
if successfully passed, virtually eliminates the possibility that any
data errors were missed during the previous transfer period.