A distributed method and system of controlling a communications network
having a plurality of spans of interconnected network elements some of
which include a network element processor distributes network topology
information to respective span databases; stores original fault objects
in the respective span databases; advertises fault objects to other
network element processors in a local span when the original fault
affects network elements other than a network element in which the fault
occurred; advertises alarm objects to other network element processors
that are respectively associated with a circuit affected by the original
faults; stores the advertised fault and alarm objects in the respective
span databases; and performs distributed processing of the advertised
fault and alarm objects with the other network element processors and the
respective span databases. Aggregation of other faults and alarms that
may be occurring on the communications network due to other faults other
than the received fault aids in determining causality of the fault.
Causality may be determined by correlating other faults and alarms with
the received fault. If not a root cause of another fault or alarm, the
received fault is sympathetic to another fault or alarm. Sympathetic
faults are suppressed while root cause faults are promoted to an alarm
and reported to affected network elements. The number of alarms viewed by
a network manager as well as the reporting of alarms and underlying
faults are reduced by performing such distributed alarm correlation and
fault reporting suppression.