An apparatus in one example comprises a substitute manager component that
upon failure of a first manager component, obtains management
responsibility for a distributed software application from the first
manager component and queries one or more software components of the
distributed software application for state information of the one or more
software components. The substitute manager component employs the state
information to determine whether one or more recovery actions are needed
for the distributed software application to recover from the failure of
the first manager component.