A multi-processor system includes a partition including a selected number
of nodes selected from a plurality of nodes provided in a plurality of
node groups, each of the nodes including a computer. A failed node in the
partition notifies a failure to a corresponding service processor of the
node group and other nodes of the partition. The corresponding service
processor and the service processors managing the other nodes notify the
error log information to a service processor manager, which identifies
the location of the failure and indicate the service processors to
recover from the failure.