Potentially identical objects (e.g., files) are located across multiple
computers based on stochastic partitioning of workload. For each of a
plurality of objects stored on a plurality of computers in a network, a
portion of object information corresponding to the object is selected.
The object information can be generated in a variety of manners (e.g.,
based on hashing the object, based on characteristics of the object, and
so forth). Any of a variety of portions of the object information can be
used (e.g., the least significant bits of the object information). A
stochastic partitioning process is then used to identify which of the
plurality of computers to communicate the object information to for
identification of potentially identical objects on the plurality of
computers.