A system and method to implement a resilient compute center. A plurality
of processing systems is initialized. Each of the processing systems
capable of operation communicates status information about its
operational health to a management module responsible for managing the
processing systems. The management module reinitializing any of the
processing systems, if the management module determines that any of the
processing systems is operating in a degraded state based on the status
information communicated to the management module.