An apparatus and method shares predictive failure information between
computer system in a computer network. The shared predictive failure
information allows dynamically rerouting a network request to avoid a
computer system that may fail according to its predictive failure
information. According to a first embodiment, if the requested resource
on the network has predictive failure information that indicates the
resource may soon fail, a message is returned to the requesting computer
with information that includes possible alternative sites from which the
information may be obtained. If there is an alternative site, the
requesting computer system may access the alternative site, thereby
avoiding the computer system that may soon fail. If there is no
alternative site, the requesting computer system may return an error
message, or may simply access the original resource on the chance that is
has not yet failed. According to a second embodiment, a router in the
network may indicate one or more alternative paths to a resource if the
predictive failure information for the router indicates it may soon fail.
The requesting computer system may then access the requested resource via
the alternative path. In this manner, predictive failure information can
be used in rerouting network traffic between computer systems on a
network to minimize the effect of a failing computer system.