This document describes solutions to reduce the time of reduced data
redundancy following transient disk failures that do not corrupt the
disk. Beneficially, these solutions provide a way to estimate the most
efficient repair strategy for the disk group, which helps to minimize the
amount of time data in a disk group remains unprotected. Merely by way of
example, a threshold value might specify a duration in which a disk
failure should be considered transient, such that if the disk is repaired
within that duration, only the stale extents on the disk need be
recreated. If the disk cannot be repaired within that duration, the
entire contents of the disk might be recreated on one or more other disks
in the group.