The invention relates to processing of electronic text communication distributed
in bulk. In one embodiment, a method for detecting electronic text communication
distributed in bulk is disclosed. After receiving a first electronic text communication,
it is processed with an algorithm to produce a first fingerprint. A time period
is begun for the first electronic text communication. After receiving a second
electronic text communications, it is also processed with the algorithm to produce
a second fingerprint. The first fingerprint to the second fingerprint are compared
to determine if the first electronic text communication is similar to the second
electronic text communication. A count for the first electronic text communication
is updated based upon the comparison. It is determined if the count during the
time period reaches a first threshold.