Potentially identical objects (e.g., files) are located across multiple
computers based on stochastic partitioning of workload. For each of a plurality
of objects stored on a plurality of computers in a network, a portion of object
information corresponding to the object is selected. The object information can
be generated in a variety of manners (e.g., based on hashing the object, based
on characteristics of the object, and so forth). Any of a variety of portions of
the object information can be used (e.g., the least significant bits of the object
information). A stochastic partitioning process is then used to identify which
of the plurality of computers to communicate the object information to for identification
of potentially identical objects on the plurality of computers.