A computer program product and computer system for error monitoring
partitions in a computer system. Provided to each partition is a
partition status indicator (PSI) denoting a RUNNING or FAIL status of the
partition, and an error log area (ELA) for storing partition error
entries. The ELA includes a partition identifier, an entry status
indicator (ESI) indicating READ/UNREAD status for the error entry, and an
error identifier. An error procedure performed for each first partition
whose partition status indicator indicates the FAIL status includes:
copying each error entry in the ELA of the first partition whose ESI
indicates the UNREAD status into the ELA of a second (running) partition;
setting the ESI to the READ status for each copied error entry in the ELA
of the first partition; and having the ESI set to the UNREAD status for
each copied error entry in the ELA of the second partition.