A bus management tool that allows communication to be maintained between a
group of nodes operatively connected on two busses in the presence of
radiation by transmitting periodically a first message from one to
another of the nodes on one of the busses, determining whether the first
message was received by the other of the nodes on the first bus, and when
it is determined that the first message was not received by the other of
the nodes, transmitting a recovery command to the other of the nodes on a
second of the of busses. Methods, systems, and articles of manufacture
consistent with the present invention also provide for a bus recovery
tool on the other node that re-initializes a bus interface circuit
operatively connecting the other node to the first bus in response to the
recovery command.